首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Active learning with support vector machines in the drug discovery process
Authors:Warmuth Manfred K  Liao Jun  Rätsch Gunnar  Mathieson Michael  Putta Santosh  Lemmen Christian
Institution:Computer Science Department, University of California, Santa Cruz, California 95064, USA. manfred@cse.ucsc.edu
Abstract:We investigate the following data mining problem from computer-aided drug design: From a large collection of compounds, find those that bind to a target molecule in as few iterations of biochemical testing as possible. In each iteration a comparatively small batch of compounds is screened for binding activity toward this target. We employed the so-called "active learning paradigm" from Machine Learning for selecting the successive batches. Our main selection strategy is based on the maximum margin hyperplane-generated by "Support Vector Machines". This hyperplane separates the current set of active from the inactive compounds and has the largest possible distance from any labeled compound. We perform a thorough comparative study of various other selection strategies on data sets provided by DuPont Pharmaceuticals and show that the strategies based on the maximum margin hyperplane clearly outperform the simpler ones.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号