An investigation of K-means clustering to high and multi-dimensional biological data

Baridam, Barilee B.; Ali, M. Montaz

UPSpace Home
→
Engineering, Built Environment and Information Technology
→
Computer Science
→
Research Articles (Computer Science)
→
View Item

We are excited to announce that the repository will soon undergo an upgrade, featuring a new look and feel along with several enhanced features to improve your experience. Please be on the lookout for further updates and announcements regarding the launch date. We appreciate your support and look forward to unveiling the improved platform soon.

An investigation of K-means clustering to high and multi-dimensional biological data

Baridam, Barilee B.; Ali, M. Montaz

URI: http://hdl.handle.net/2263/32205

Date: 2013

Abstract:

PURPOSE – The K-means clustering algorithm has been intensely researched owing to its simplicity of implementation and usefulness in the clustering task. However, there have also been criticisms on its performance, in particular, for demanding the value of K before the actual clustering task. It is evident from previous researches that providing the number of clusters a priori does not in any way assist in the production of good quality clusters. The authors' investigations in this paper also confirm this finding. The purpose of this paper is to investigate further, the usefulness of the K-means clustering in the clustering of high and multi-dimensional data by applying it to biological sequence data. DESIGN/METHODOLOGY/APPROACH – The authors suggest a scheme which maps the high dimensional data into low dimensions, then show that the K-means algorithm with pre-processor produces good quality, compact and well-separated clusters of the biological data mapped in low dimensions. For the purpose of clustering, a character-to-numeric conversion was conducted to transform the nucleic/amino acids symbols to numeric values.

Description:

1 pdf file.

Show full item record

Files in this item

Name: Baridam_Investiga ...

Size: 474.1Kb

Format: PDF

Description: Postprint Article

View/Open

This item appears in the following Collection(s)

Search UPSpace

Browse

All of UPSpace
This Collection
- Issue Date
- Authors
- Titles
- Subjects
- Supervisor
- UP Author
- UP Postgraduate
- Type

An investigation of K-means clustering to high and multi-dimensional biological data

An investigation of K-means clustering to high and multi-dimensional biological data

Abstract:

Description:

Files in this item

This item appears in the following Collection(s)

Search UPSpace

Browse

All of UPSpace

This Collection

My Account

UPSpace Workspace