In silico prediction and screening of γ‐secretase inhibitors by molecular descriptors and machine learning methods |
| |
Authors: | Xue‐Gang Yang Wei Lv Yu‐Zong Chen Ying Xue |
| |
Affiliation: | 1. Key Lab of Green Chemistry and Technology in Ministry of Education, College of Chemistry, Sichuan University, Chengdu 610064, People's Republic of China;2. State Key Laboratory of Biotherapy, Chengdu 610041, People's Republic of China;3. Bioinformatics and Drug Design Group, Department of Computational Science, National University of Singapore, Blk SOC1, Level 7, 3 Science Drive 2, Singapore 117543, Singapore |
| |
Abstract: | γ‐Secretase inhibitors have been explored for the prevention and treatment of Alzheimer's disease (AD). Methods for prediction and screening of γ‐secretase inhibitors are highly desired for facilitating the design of novel therapeutic agents against AD, especially when incomplete knowledge about the mechanism and three‐dimensional structure of γ‐secretase. We explored two machine learning methods, support vector machine (SVM) and random forest (RF), to develop models for predicting γ‐secretase inhibitors of diverse structures. Quantitative analysis of the receiver operating characteristic (ROC) curve was performed to further examine and optimize the models. Especially, the Youden index (YI) was initially introduced into the ROC curve of RF so as to obtain an optimal threshold of probability for prediction. The developed models were validated by an external testing set with the prediction accuracies of SVM and RF 96.48 and 98.83% for γ‐secretase inhibitors and 98.18 and 99.27% for noninhibitors, respectively. The different feature selection methods were used to extract the physicochemical features most relevant to γ‐secretase inhibition. To the best of our knowledge, the RF model developed in this work is the first model with a broad applicability domain, based on which the virtual screening of γ‐secretase inhibitors against the ZINC database was performed, resulting in 368 potential hit candidates. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010 |
| |
Keywords: | γ ‐secretase inhibitors machine learning support vector machine (SVM) random forest (RF) virtual screening |
|
|