GeneMCL in microarray analysis |
| |
Authors: | Samuel Lattimore B van Dongen Stijn Crabbe M James C |
| |
Institution: | School of Animal and Microbial Sciences, University of Reading, Whiteknights, Reading RG6 6AJ, UK. |
| |
Abstract: | Accurately and reliably identifying the actual number of clusters present with a dataset of gene expression profiles, when no additional information on cluster structure is available, is a problem addressed by few algorithms. GeneMCL transforms microarray analysis data into a graph consisting of nodes connected by edges, where the nodes represent genes, and the edges represent the similarity in expression of those genes, as given by a proximity measurement. This measurement is taken to be the Pearson correlation coefficient combined with a local non-linear rescaling step. The resulting graph is input to the Markov Cluster (MCL) algorithm, which is an elegant, deterministic, non-specific and scalable method, which models stochastic flow through the graph. The algorithm is inherently affected by any cluster structure present, and rapidly decomposes a graph into cohesive clusters. The potential of the GeneMCL algorithm is demonstrated with a 5,730 gene subset (IGS) of the Van't Veer breast cancer database, for which the clusterings are shown to reflect underlying biological mechanisms. |
| |
Keywords: | MCL Microarrays Breast cancer |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|