Drug-likeness scoring based on unsupervised learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Drug-likeness scoring based on unsupervised learning

Authors:	Kyunghoon Lee Jinho Jang Seonghwan Seo Jaechang Lim Woo Youn Kim

Institution:	Department of Chemistry, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34 141 Republic of Korea.; HITS Incorporation, 124 Teheran-ro, Gangnam-gu, Seoul 06 234 Republic of Korea.; KI for Artificial Intelligence, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34 141 Republic of Korea

Abstract:	Drug-likeness prediction is important for the virtual screening of drug candidates. It is challenging because the drug-likeness is presumably associated with the whole set of necessary properties to pass through clinical trials, and thus no definite data for regression is available. Recently, binary classification models based on graph neural networks have been proposed but with strong dependency of their performances on the choice of the negative set for training. Here we propose a novel unsupervised learning model that requires only known drugs for training. We adopted a language model based on a recurrent neural network for unsupervised learning. It showed relatively consistent performance across different datasets, unlike such classification models. In addition, the unsupervised learning model provides drug-likeness scores that well separate distributions with increasing mean values in the order of datasets composed of molecules at a later step in a drug development process, whereas the classification model predicted a polarized distribution with two extreme values for all datasets presumably due to the overconfident prediction for unseen data. Thus, this new concept offers a pragmatic tool for drug-likeness scoring and further can be applied to other biochemical applications. A new quantification method of drug-likeness based on unsupervised learning. The method only uses drug molecules as training set without any non-drug-like molecules.

Keywords:

设为首页 | 免责声明 | 关于勤云 | 加入收藏