Accurate and interpretable computational modeling of chemical mutagenicity |
| |
Authors: | Langham James J Jain Ajay N |
| |
Institution: | Cancer Research Institute, University of California, San Francisco, 2340 Sutter Street, San Francisco, California 94143-0128, USA. james.langham@jainlab.org |
| |
Abstract: | We describe a method for modeling chemical mutagenicity in terms of simple rules based on molecular features. A classification model was built using a rule-based ensemble method called RuleFit, developed by Friedman and Popescu. We show how performance compares favorably against literature methods. Performance was measured through the use of cross-validation and testing on external test sets. All data sets used are publicly available. The method automatically generated transparent rules in terms of molecular structure that agree well with known toxicology. While we have focused on chemical mutagenicity in demonstrating this method, we anticipate that it may be more generally useful in modeling other molecular properties such as other types of chemical toxicity. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|