首页 | 本学科首页   官方微博 | 高级检索  
     


Ensemble methods for classification in cheminformatics
Authors:Merkwirth Christian  Mauser Harald  Schulz-Gasch Tanja  Roche Olivier  Stahl Martin  Lengauer Thomas
Affiliation:Computational Biology & Applied Algorithmics Group, Max-Planck-Institut für Informatik, Stuhlsatzenhauseg 85, 66123 Saarbrücken, Germany, and Roche Pharma Research, Basel, Switzerland. cmerk@mpi-sb.mpg.de
Abstract:We describe the application of ensemble methods to binary classification problems on two pharmaceutical compound data sets. Several variants of single and ensembles models of k-nearest neighbors classifiers, support vector machines (SVMs), and single ridge regression models are compared. All methods exhibit robust classification even when more features are given than observations. On two data sets dealing with specific properties of drug-like substances (cytochrome P450 inhibition and "Frequent Hitters", i.e., unspecific protein inhibition), we achieve classification rates above 90%. We are able to reduce the cross-validated misclassification rate for the Frequent Hitters problem by a factor of 2 compared to previous results obtained for the same data set with different modeling techniques.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号