Optimal neighbor selection in molecular similarity: comparison of arbitrary versus tailored prediction spaces |
| |
Authors: | Gute B D Basak S C |
| |
Institution: | Natural Resources Research Institute, University of Minnesota Duluth, 5013 Miller Trunk Hwy., 55811, USA. |
| |
Abstract: | Three classes of arbitrary quantitative molecular similarity analysis (QMSA) methods have been computed using atom pairs (APs), topological indices (TIs), and principal components (PCs) derived from topological indices. Tailored QMSA models have been developed from TIs selected through ridge regression. K-nearest neighbor (kNN) based estimation has been applied to all of the methods to estimate normal vapor pressure (p(vap)) and water solubility (sol) for a set of 194 chemicals. Results show that the tailored QMSA methods are superior to arbitrary similarity methods in estimating both of these properties for the given set of chemicals. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|