首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 605 毫秒
1.
2.
3.
4.
Selecting most rigorous quantitative structure-activity relationship (QSAR) approaches is of great importance in the development of robust and predictive models of chemical toxicity. To address this issue in a systematic way, we have formed an international virtual collaboratory consisting of six independent groups with shared interests in computational chemical toxicology. We have compiled an aqueous toxicity data set containing 983 unique compounds tested in the same laboratory over a decade against Tetrahymena pyriformis. A modeling set including 644 compounds was selected randomly from the original set and distributed to all groups that used their own QSAR tools for model development. The remaining 339 compounds in the original set (external set I) as well as 110 additional compounds (external set II) published recently by the same laboratory (after this computational study was already in progress) were used as two independent validation sets to assess the external predictive power of individual models. In total, our virtual collaboratory has developed 15 different types of QSAR models of aquatic toxicity for the training set. The internal prediction accuracy for the modeling set ranged from 0.76 to 0.93 as measured by the leave-one-out cross-validation correlation coefficient ( Q abs2). The prediction accuracy for the external validation sets I and II ranged from 0.71 to 0.85 (linear regression coefficient R absI2) and from 0.38 to 0.83 (linear regression coefficient R absII2), respectively. The use of an applicability domain threshold implemented in most models generally improved the external prediction accuracy but at the same time led to a decrease in chemical space coverage. Finally, several consensus models were developed by averaging the predicted aquatic toxicity for every compound using all 15 models, with or without taking into account their respective applicability domains. We find that consensus models afford higher prediction accuracy for the external validation data sets with the highest space coverage as compared to individual constituent models. Our studies prove the power of a collaborative and consensual approach to QSAR model development. The best validated models of aquatic toxicity developed by our collaboratory (both individual and consensus) can be used as reliable computational predictors of aquatic toxicity and are available from any of the participating laboratories.  相似文献   

5.
6.
7.
The blood-brain permeation of a structurally diverse set of 281 compounds was modeled using linear regression and a multivariate genetic partial least squares (G/PLS) approach. Key structural features affecting the logarithm of blood-brain partitioning (logBB) were captured through statistically significant quantitative structure-activity relationship (QSAR) models. These relationships reveal the importance of logP, polar surface area, and a variety of electrotopological indices for accurate predictions of logBB. The best models reveal an excellent correlation (r > 0.9) for a training set of 58 compounds. Likewise, the comparison of the average logBB values obtained from an ensemble of QSAR models with experimental values also verifies the statistical quality of the models (r > 0.9). The models provide good agreement (r approximately 0.7) between the predicted logBB values for 34 molecules in the external validation set and the experimental values. To further validate the models for use during the drug discovery process, a prediction set of 181 drugs with reported CNS penetration data was used. A >70% success rate is obtained by using any of the QSAR models in the qualitative prediction for CNS permeable (active) drugs. A lower success rate (approximately 60%) was obtained for the best model for CNS impermeable (inactive) drugs. Combining the predictions obtained from all the models (consensus) did not significantly improve the discrimination of CNS active and CNS inactive molecules. Finally, using the therapeutic classification as a guiding tool, the CNS penetration capability of over 2000 compounds in the Synthline database was estimated. The results were very similar to the smaller set of 181 compounds.  相似文献   

8.
9.
The main utility of QSAR models is their ability to predict activities/properties for new chemicals, and this external prediction ability is evaluated by means of various validation criteria. As a measure for such evaluation the OECD guidelines have proposed the predictive squared correlation coefficient Q(2)(F1) (Shi et al.). However, other validation criteria have been proposed by other authors: the Golbraikh-Tropsha method, r(2)(m) (Roy), Q(2)(F2) (Schu?u?rmann et al.), Q(2)(F3) (Consonni et al.). In QSAR studies these measures are usually in accordance, though this is not always the case, thus doubts can arise when contradictory results are obtained. It is likely that none of the aforementioned criteria is the best in every situation, so a comparative study using simulated data sets is proposed here, using threshold values suggested by the proponents or those widely used in QSAR modeling. In addition, a different and simple external validation measure, the concordance correlation coefficient (CCC), is proposed and compared with other criteria. Huge data sets were used to study the general behavior of validation measures, and the concordance correlation coefficient was shown to be the most restrictive. On using simulated data sets of a more realistic size, it was found that CCC was broadly in agreement, about 96% of the time, with other validation measures in accepting models as predictive, and in almost all the examples it was the most precautionary. The proposed concordance correlation coefficient also works well on real data sets, where it seems to be more stable, and helps in making decisions when the validation measures are in conflict. Since it is conceptually simple, and given its stability and restrictiveness, we propose the concordance correlation coefficient as a complementary, or alternative, more prudent measure of a QSAR model to be externally predictive.  相似文献   

10.
11.
12.
13.
Using 84 structurally diverse and experimentally validated LSD1/KDM1A inhibitors, quantitative structure–activity relationship (QSAR) models were built by OECD requirements. In the QSAR analysis, certainly significant and understated pharmacophoric features were identified as critical for LSD1 inhibition, such as a ring Carbon atom with exactly six bonds from a Nitrogen atom, partial charges of lipophilic atoms within eight bonds from a ring Sulphur atom, a non-ring Oxygen atom exactly nine bonds from the amide Nitrogen, etc. The genetic algorithm–multi-linear regression (GA-MLR) and double cross-validation criteria were used to create robust QSAR models with high predictability. In this study, two QSAR models were developed, with fitting parameters like R2 = 0.83–0.81, F = 61.22–67.96, internal validation parameters such as Q2LOO = 0.79–0.77, Q2LMO = 0.78–0.76, CCCcv = 0.89–0.88, and external validation parameters such as, R2ext = 0.82 and CCCex = 0.90. In terms of mechanistic interpretation and statistical analysis, both QSAR models are well-balanced. Furthermore, utilizing the pharmacophoric features revealed by QSAR modelling, molecular docking experiments corroborated with the most active compound’s binding to the LSD1 receptor. The docking results are then refined using Molecular dynamic simulation and MMGBSA analysis. As a consequence, the findings of the study can be used to produce LSD1/KDM1A inhibitors as anticancer leads.  相似文献   

14.
15.
16.
17.
18.
19.
梅虎  周原  廖志华  李志良 《化学学报》2006,64(9):949-952
采用VHSE氨基酸结构描述子表征HLA-A*0201限制性表位结构, 以遗传算法和偏最小二乘相结合(GA-PLS)对102个训练集进行定量构效关系建模. 剔除3个异常样本后, 据候选模型交互检验及50个外部测试集预测结果, 筛选得到最优偏最小二乘模型(A=2), 其R2, Q2和 分别为0.755, 0.621和0.680. 构效研究显示: CTL表位活性主要与1, 2, 7, 8, 9位氨基酸残基疏水、1, 2位立体及6位残基电性等性质密切相关.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号