首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Gene selection from microarray data for cancer classification--a machine learning approach
Authors:Wang Yu  Tetko Igor V  Hall Mark A  Frank Eibe  Facius Axel  Mayer Klaus F X  Mewes Hans W
Institution:aInstitute for Bioinformatics, German Research Center for Environment and Health, Ingolstädter Landstraβ e 1, D-85764 Neuherberg, Germany;bDepartment of Computer Science, University of Waikato, Private Bag 3105, Hamilton, New Zealand;cDepartment of Genome-Oriented Bioinformatics, Wissenschaftszentrum Weihenstephan, Technische Universität München, Alte Akademie 10, D-85354 Freising-Weihenstephan, Germany
Abstract:A DNA microarray can track the expression levels of thousands of genes simultaneously. Previous research has demonstrated that this technology can be useful in the classification of cancers. Cancer microarray data normally contains a small number of samples which have a large number of gene expression levels as features. To select relevant genes involved in different types of cancer remains a challenge. In order to extract useful gene information from cancer microarray data and reduce dimensionality, feature selection algorithms were systematically investigated in this study. Using a correlation-based feature selector combined with machine learning algorithms such as decision trees, nave Bayes and support vector machines, we show that classification performance at least as good as published results can be obtained on acute leukemia and diffuse large B-cell lymphoma microarray data sets. We also demonstrate that a combined use of different classification and feature selection approaches makes it possible to select relevant genes with high confidence. This is also the first paper which discusses both computational and biological evidence for the involvement of zyxin in leukaemogenesis.
Keywords:Microarray  Gene selection  Machine learning  Cancer classification  Feature Selection
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号