Gene selection for high dimensional data using k-means clustering algorithm and statistical approach

Dimensions

Ahmad, Farzana Kabir and Yusof, Yuhanis and Othman, Nor Hayati (2014) Gene selection for high dimensional data using k-means clustering algorithm and statistical approach. In: International Conference on Computational Science and Technology (ICCST), 27-28 Aug. 2014, Kota Kinabalu.

PDF
Restricted to Registered users only
Download (517kB) | Request a copy

Official URL: http://doi.org/10.1109/ICCST.2014.7045188

Abstract

Microarray technology can measure thousands of genes which are useful for biologist to study and classify the cancer cells.However, this high dimensional data consists of large number of genes to be examined in regard of small samples size. Thus, selection of relevant genes is a challenging issue in microarray data analysis and has been a central research focus.This study proposed kmeans clustering algorithm to groups the relevant genes. Several statistical techniques such as Fisher criterion, Golub signal-to-noise, Mann Whitney rank and t-test have been used in deciding the clusters are well separated from one and others. Those genes with high discriminative score will later be used to train the k-NN classifier.The experimental results showed that the proposed gene selection methods able to identify differentially expressed genes with 0.86 ROC score.

Item Type:	Conference or Workshop Item (Paper)
Uncontrolled Keywords:	Gene selection, microarray, statistical techniques
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:	School of Computing
Depositing User:	Mdm. Farzana Kabir Ahmad
Date Deposited:	20 Dec 2015 07:50
Last Modified:	27 Apr 2016 07:19
URI:	https://repo.uum.edu.my/id/eprint/16491

Actions (login required)

View Item