期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Quantitative Structure–activity Relationship Analysis of Pyridinone HIV-1 Reverse Transcriptase Inhibitors using the <Emphasis Type="Italic">k</Emphasis> Nearest Neighbor Method and QSAR-based Database Mining 总被引：1，自引：0，他引：1

Medina-Franco JL Golbraikh A Oloff S Castillo R Tropsha A 《Journal of computer-aided molecular design》2005,19(4):229-242

相似文献

2.

Rational selection of training and test sets for the development of validated QSAR models

Golbraikh A Shen M Xiao Z Xiao YD Lee KH Tropsha A 《Journal of computer-aided molecular design》2003,17(2-4):241-253

Quantitative Structure–Activity Relationship (QSAR) models are used increasingly to screen chemical databases and/or virtual chemical libraries for potentially bioactive molecules. These developments emphasize the importance of rigorous model validation to ensure that the models have acceptable predictive power. Using k nearest neighbors (kNN) variable selection QSAR method for the analysis of several datasets, we have demonstrated recently that the widely accepted leave-one-out (LOO) cross-validated R² (q²) is an inadequate characteristic to assess the predictive ability of the models [Golbraikh, A., Tropsha, A. Beware of q2! J. Mol. Graphics Mod. 20, 269-276, (2002)]. Herein, we provide additional evidence that there exists no correlation between the values of q ² for the training set and accuracy of prediction (R ²) for the test set and argue that this observation is a general property of any QSAR model developed with LOO cross-validation. We suggest that external validation using rationally selected training and test sets provides a means to establish a reliable QSAR model. We propose several approaches to the division of experimental datasets into training and test sets and apply them in QSAR studies of 48 functionalized amino acid anticonvulsants and a series of 157 epipodophyllotoxin derivatives with antitumor activity. We formulate a set of general criteria for the evaluation of predictive power of QSAR models. 相似文献

3.

Combinatorial QSAR of ambergris fragrance compounds 总被引：4，自引：0，他引：4

Kovatcheva A Golbraikh A Oloff S Xiao YD Zheng W Wolschann P Buchbauer G Tropsha A 《Journal of chemical information and computer sciences》2004,44(2):582-595

相似文献

4.

Novel approach to evolutionary neural network based descriptor selection and QSAR model development

Debeljak Z Marohnić V Srecnik G Medić-Sarić M 《Journal of computer-aided molecular design》2005,19(12):835-855

相似文献

5.

Molecular dataset diversity indices and their applications to comparison of chemical databases and QSAR analysis

Golbraikh A 《Journal of chemical information and computer sciences》2000,40(2):414-425

相似文献

6.

Selection of appropriate training and validation set chemicals for modelling dermal permeability by U-optimal design

G. Xu J.M. Hughes-Oliver J.D. Brooks J.L. Yeatts 《SAR and QSAR in environmental research》2013,24(2):135-156

Quantitative structure-activity relationship (QSAR) models are being used increasingly in skin permeation studies. The main idea of QSAR modelling is to quantify the relationship between biological activities and chemical properties, and thus to predict the activity of chemical solutes. As a key step, the selection of a representative and structurally diverse training set is critical to the prediction power of a QSAR model. Early QSAR models selected training sets in a subjective way and solutes in the training set were relatively homogenous. More recently, statistical methods such as D-optimal design or space-filling design have been applied but such methods are not always ideal. This paper describes a comprehensive procedure to select training sets from a large candidate set of 4534 solutes. A newly proposed ‘Baynes’ rule’, which is a modification of Lipinski's ‘rule of five’, was used to screen out solutes that were not qualified for the study. U-optimality was used as the selection criterion. A principal component analysis showed that the selected training set was representative of the chemical space. Gas chromatograph amenability was verified. A model built using the training set was shown to have greater predictive power than a model built using a previous dataset [1]. 相似文献

7.

Predictive toxicology modeling: protocols for exploring hERG classification and Tetrahymena pyriformis end point predictions

Su BH Tu YS Esposito EX Tseng YJ 《Journal of chemical information and modeling》2012,52(6):1660-1673

相似文献

8.

Combinatorial QSAR modeling of specificity and subtype selectivity of ligands binding to serotonin receptors 5HT1E and 5HT1F

Wang XS Tang H Golbraikh A Tropsha A 《Journal of chemical information and modeling》2008,48(5):997-1013

相似文献

9.

Development of quantitative structure-activity relationships and classification models for anticonvulsant activity of hydantoin analogues

Sutherland JJ Weaver DF 《Journal of chemical information and computer sciences》2003,43(3):1028-1036

相似文献

10.

Impact assessment of the rational selection of training and test sets on the predictive ability of QSAR models

M. F. Andrada E. G. Vega-Hissi M. R. Estrada 《SAR and QSAR in environmental research》2017,28(12):1011-1023

This study performed an analysis of the influence of the training and test set rational selection on the quality and predictively of the quantitative structure–activity relationship (QSAR) model. The study was carried out on three different datasets of Influenza Neuraminidase (H1N1) inhibitors. The three datasets were divided into training and test sets using three rational selection methods: based on k-means, Kennard–Stone algorithm and Activity and the results were compared with Random selection. Then, a total of 31,490 mathematical models were developed and those models that presented a determination coefficient higher than: r²_train > 0.8, r²_loo > 0.7, r²_test > 0.5 and minimum standard deviation (SD) and minimum root-mean square error (RMS) were selected. The selected models were validated using the internal leave-one-out method and the predictive capacity was evaluated by the external test set. The results indicate that random selection could lead to erroneous results. In return, a rational selection allows for obtaining more reliable conclusions. The QSAR models with major predictive power were found using the k-means algorithm and selection by activity. 相似文献

11.

Considerations and recent advances in QSAR models for cytochrome P450-mediated drug metabolism prediction

Li H Sun J Fan X Sui X Zhang L Wang Y He Z 《Journal of computer-aided molecular design》2008,22(11):843-855

Quantitative structure–activity relationships (QSAR) methods are urgently needed for predicting ADME/T (absorption, distribution, metabolism, excretion and toxicity) properties to select lead compounds for optimization at the early stage of drug discovery, and to screen drug candidates for clinical trials. Use of suitable QSAR models ultimately results in lesser time-cost and lower attrition rate during drug discovery and development. In the case of ADME/T parameters, drug metabolism is a key determinant of metabolic stability, drug–drug interactions, and drug toxicity. QSAR models for predicting drug metabolism have undergone significant advances recently. However, most of the models used lack sufficient interpretability and offer poor predictability for novel drugs. In this review, we describe some considerations to be taken into account by QSAR for modeling drug metabolism, such as the accuracy/consistency of the entire data set, representation and diversity of the training and test sets, and variable selection. We also describe some novel statistical techniques (ensemble methods, multivariate adaptive regression splines and graph machines), which are not yet used frequently to develop QSAR models for drug metabolism. Subsequently, rational recommendations for developing predictable and interpretable QSAR models are made. Finally, the recent advances in QSAR models for cytochrome P450-mediated drug metabolism prediction, including in vivo hepatic clearance, in vitro metabolic stability, inhibitors and substrates of cytochrome P450 families, are briefly summarized. 相似文献

12.

Design and development of novel focal adhesion kinase (FAK) inhibitors using Monte Carlo method with index of ideality of correlation to validate QSAR

P. Kumar A. Kumar J. Sindhu 《SAR and QSAR in environmental research》2019,30(2):63-80

相似文献

13.

External validation and prediction employing the predictive squared correlation coefficient test set activity mean vs training set activity mean

Schüürmann G Ebert RU Chen J Wang B Kühne R 《Journal of chemical information and modeling》2008,48(11):2140-2145

The external prediction capability of quantitative structure-activity relationship (QSAR) models is often quantified using the predictive squared correlation coefficient, q (2). This index relates the predictive residual sum of squares, PRESS, to the activity sum of squares, SS, without postprocessing of the model output, the latter of which is automatically done when calculating the conventional squared correlation coefficient, r (2). According to the current OECD guidelines, q (2) for external validation should be calculated with SS referring to the training set activity mean. Our present findings including a mathematical proof demonstrate that this approach yields a systematic overestimation of the prediction capability that is triggered by the difference between the training and test set activity means. Example calculations with three regression models and data sets taken from literature show further that for external test sets, q (2) based on the training set activity mean may become even larger than r (2). As a consequence, we suggest to always use the test set activity mean when quantifying the external prediction capability through q (2) and to revise the respective OECD guidance document accordingly. The discussion includes a comparison between r (2) and q (2) value ranges and the q (2) statistics for cross-validation. 相似文献

14.

Antitumor Agents 252. Application of validated QSAR models to database mining: discovery of novel tylophorine derivatives as potential anticancer agents

Zhang S Wei L Bastow K Zheng W Brossi A Lee KH Tropsha A 《Journal of computer-aided molecular design》2007,21(1-3):97-112

相似文献

15.

Combinatorial QSAR modeling of P-glycoprotein substrates

de Cerqueira Lima P Golbraikh A Oloff S Xiao Y Tropsha A 《Journal of chemical information and modeling》2006,46(3):1245-1254

相似文献

16.

Application of BCUT metrics and genetic algorithm in binary QSAR analysis

Gao H 《Journal of chemical information and computer sciences》2001,41(2):402-407

相似文献

17.

Constructing optimum blood brain barrier QSAR models using a combination of 4D-molecular similarity measures and cluster analysis

Pan D Iyer M Liu J Li Y Hopfinger AJ 《Journal of chemical information and computer sciences》2004,44(6):2083-2098

A new method, using a combination of 4D-molecular similarity measures and cluster analysis to construct optimum QSAR models, is applied to a data set of 150 chemically diverse compounds to build optimum blood-brain barrier (BBB) penetration models. The complete data set is divided into subsets based on 4D-molecular similarity measures using cluster analysis. The compounds in each cluster subset are further divided into a training set and a test set. Predictive QASAR models are constructed for each cluster subset using the corresponding training sets. These QSAR models best predict test set compounds which are assigned to the same cluster subset, based on the 4D-molecular similarity measures, from which the models are derived. The results suggest that the specific properties governing blood-brain barrier permeability may vary across chemically diverse compounds. Partitioning compounds into chemically similar classes is essential to constructing predictive blood-brain barrier penetration models embedding the corresponding key physiochemical properties of a given chemical class. 相似文献

18.

Local and global quantitative structure-activity relationship modeling and prediction for the baseline toxicity

Yuan H Wang Y Cheng Y 《Journal of chemical information and modeling》2007,47(1):159-169

The predictive accuracy of the model is of the most concern for computational chemists in quantitative structure-activity relationship (QSAR) investigations. It is hypothesized that the model based on analogical chemicals will exhibit better predictive performance than that derived from diverse compounds. This paper develops a novel scheme called "clustering first, and then modeling" to build local QSAR models for the subsets resulted from clustering of the training set according to structural similarity. For validation and prediction, the validation set and test set were first classified into the corresponding subsets just as those of the training set, and then the prediction was performed by the relevant local model for each subset. This approach was validated on two independent data sets by local modeling and prediction of the baseline toxicity for the fathead minnow. In this process, hierarchical clustering was employed for cluster analysis, k-nearest neighbor for classification, and partial least squares for the model generation. The statistical results indicated that the predictive performances of the local models based on the subsets were much superior to those of the global model based on the whole training set, which was consistent with the hypothesis. This approach proposed here is promising for extension to QSAR modeling for various physicochemical properties, biological activities, and toxicities. 相似文献

19.

On the Use of Backpropagation Neural Networks in Modeling Environmental Degradation

E. Rorije M. C. Van Wezel W. J. G. M. Peijnenburg 《SAR and QSAR in environmental research》2013,24(4):219-235

相似文献

20.

Differentiation of AmpC beta-lactamase binders vs. decoys using classification <Emphasis Type="Italic">k</Emphasis>NN QSAR modeling and application of the QSAR classifier to virtual screening

Hsieh JH Wang XS Teotico D Golbraikh A Tropsha A 《Journal of computer-aided molecular design》2008,22(9):593-609

相似文献