首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
雌激素类化合物由于其对人和野生动物健康的负面影响而受到广泛关注.雌激素受体存在两种亚型(ERα和ERβ),化合物与两种受体亚型在结合活性和化合物结构特征方面存在差异.以31种与雌激素β受体亚型(ERβ)结合的化合物为研究对象,采用启发式变量筛选方法,从1524个变量中筛选出5个与化合物活性(lgRBA)最相关的变量,然后采用多元线性回归(MLR)建立最佳预测模型.模型相关性显著,而且具有良好的稳健性和预测能力(r2=0.829,q2LOO=0.742,r2pred=0.772,q2ext=0.724,RMSEE=0.395).同时揭示了影响化合物与ERβ受体结合的配体化合物分子的结构特征,并对模型的应用域进行了研究.  相似文献   

2.
QSAR models using a large diverse set of estrogens   总被引:12,自引:0,他引:12  
Endocrine disruptors (EDs) have a variety of adverse effects in humans and animals. About 58,000 chemicals, most having little safety data, must be tested in a group of tiered assays. As assays will take years, it is important to develop rapid methods to help in priority setting. For application to large data sets, we have developed an integrated system that contains sequential four phases to predict the ability of chemicals to bind to the estrogen receptor (ER), a prevalent mechanism for estrogenic EDs. Here we report the results of evaluating two types of QSAR models for inclusion in phase III to quantitatively predict chemical binding to the ER. Our data set for the relative binding affinities (RBAs) to the ER consists of 130 chemicals covering a wide range of structural diversity and a 6 orders of magnitude spread of RBAs. CoMFA and HQSAR models were constructed and compared for performance. The CoMFA model had a r2 = 0.91 and a q2LOO = 0.66. HQSAR showed reduced performance compared to CoMFA with r2 = 0.76 and q2LOO = 0.59. A number of parameters were examined to improve the CoMFA model. Of these, a phenol indicator increased the q2LOO to 0.71. When up to 50% of the chemicals were left out in the leave-N-out cross-validation, the q2 remained significant. Finally, the models were tested by using two test sets; the q2pred for these were 0.71 and 0.62, a significant result which demonstrates the utility of the CoMFA model for predicting the RBAs of chemicals not included in the training set. If used in conjunction with phases I and II, which reduced the size of the data set dramatically by eliminating most inactive chemicals, the current CoMFA model (phase III) can be used to predict the RBA of chemicals with sufficient accuracy and to provide quantitative information for priority setting.  相似文献   

3.
Quantitative spectroscopic data-activity relationship (QSDAR) models for polychlorinated dibenzofurans (PCDFs), dibenzodioxins (PCDDs), and biphenyls (PCBs) binding to the aryl hydrocarbon receptor (AhR) have been developed based on simulated (13)C nuclear magnetic resonance (NMR) data. All the models were based on multiple linear regression of comparative spectral analysis (CoSA) between compounds. A 1.0 ppm resolution CoSA model for 26 PCDF compounds based on chemical shifts in five bins had an explained variance (r(2)) of 0.93 and a leave-one-out (LOO) cross-validated variance (q(2)) of 0.90. A 2.0 ppm resolution CoSA model for 14 PCDD compounds based on chemical shifts in five bins had an r(2) of 0.91 and a q(2) of 0.81. The 1.0 ppm resolution CoSA model for 12 PCB compounds based on chemical shifts in five bins had an r(2) of 0.87 and a q(2) of 0.45. The models with more compounds had a better q(2) because there are more multiple chemical shift populated bins available on which to base the linear regression. A 1.0 ppm resolution CoSA model for all 52 compounds that was based on chemical shifts in 12 bins had an r(2) of 0.85 and q(2) of 0.71. A canonical variance analysis of the 1.0 ppm CoSA model for all 52 compounds when they were separated into 27 strong binding and 25 weak binding compounds was 98% correct. Conventional quantitative structure-activity relationship (QSAR) modeling suffer from errors introduced by the assumptions and approximations involved in calculated electrostatic potentials and the molecular alignment process. QSDAR modeling is not limited by such errors since electrostatic potential calculations and molecular alignment are not done. The QSDAR models provide a rapid, simple and valid way to model the PCDF, PCDD, and PCB binding activity in relation to the aryl hydrocarbon receptor (AhR).  相似文献   

4.
5.
取代喹啉类化合物抗菌活性的定量构效关系及分子设计   总被引:1,自引:0,他引:1  
采用密度泛函理论(DFT)和逐步回归分析法对15种新合成的取代喹啉类化合物进行了定量构效关系(QSAR)研究. 在B3LYP/6-31G(d,p)水平上计算了取代喹啉的量子化学参数, 通过逐步多元回归分析筛选出影响抗菌活性的主要因素, 建立了定量构效关系方程, 并用留一法交叉分析了模型的稳定性及预测能力. 结果表明, C5的亲核电子密度fNC5及C9-N1的键级BC9-N1是影响喹啉类化合物抗金黄色葡萄球菌活性的主要因素, 所得模型对该类化合物抗菌活性有较好的预测效果. 同时基于QSAR研究结果设计了4个活性较高的新喹啉衍生物.  相似文献   

6.
On QSAR Study of Stereoselectivity for Wittig Reaction   总被引:1,自引:0,他引:1  
1INTRODUCTION Witting reaction is an important and well-known organic reaction.In this reaction system,phospho-nium ylides react with aldehydes or ketones to gene-rate olefins and phosphonium oxides.Obviously,the position of double bond in olefins is exactly the po-sition of carbonyl group in the reactants,so there are no other position isomers in products.Due to this advantage,Witting reaction has been widely used in organic synthesis[1].Witting reaction could introduce double bond to c…  相似文献   

7.
The two-dimensional Quantitative Structure-Activity Relationship (2D-QSAR) models have been developed to estimate and predict the inhibitory activities of a series of HEPT analogues against HIV-1 by using quantum chemical parameters and physicochemical parameters. The best model of three parameters yields r = 0.908, r^2A = 0.800 and s = 0.467 based on stepwise multiple regression (SMR) method. The stability of the model has been verified by t-test, and the results show that the model has perfect robustness. The predictive power of QSAR models has been tested by Leave-One-Out (LOO) and Leave-Group(regularly random set)-Out(LGO) procedure Cross-Validation methodology. The r^2cv of 0.755 and r^2pred of 0.759 were obtained, respectively.  相似文献   

8.
9.
运用三维全息原子场作用矢量(3D-HoVAIF)对33个Nevirapine类抗艾滋病药物进行了定量构效关系(QSAR)研究。采用偏最小二乘回归(PLSR)建立定量构效关系模型,同时采用内部及外部双重验证的方法对所得模型稳定性能进行深入分析和检验,所建模型的复相关系数(Rcum2)、留一法(LOO)交互校验(CV)复相关系数(Qcum2)和外部样本校验复相关系数(Qext2)分别为0·835、0·530和0·518。结果表明,3D-HoVAIF能较好表征Nevirapine类抗艾滋病药物分子结构信息,且所建模型具有较好稳定性能和预测能力。  相似文献   

10.
苯并咪唑类缓蚀剂的3D-QSAR研究及分子设计   总被引:1,自引:0,他引:1  
采用比较分子场分析法(CoMFA)和比较分子相似性指数分析法(CoMSIA), 对苯并咪唑衍生物抗盐酸腐蚀的缓蚀性能进行了三维定量构效关系研究, 并使用留一法交叉验证手段对3D-QSAR模型的稳定性及预测能力进行了分析. 结果表明, 立体场、静电场和氢键供体场(电子给体)是影响苯并咪唑缓蚀剂缓蚀性能的主要因素; 所构建的CoMFA模型(q2=0.541, R2=0.996)和CoMSIA模型(q2=0.581, R2=0.987)均具有较好的统计学稳定性和预测能力. 基于3D-QSAR等势图设计出了几种具有较好缓蚀性能的苯并咪唑化合物, 为油气田新型缓蚀剂的研发提供了一种新思路.  相似文献   

11.
A novel three-dimensional holographic vector of atomic interaction field(3D-HoVAIF) was used to describe the chemical structures of 23 benzoxazinone derivatives as antithrombotic drugs.Here a quantitative structure activity relationship(QSAR) model was built by partial least-squares(PLS) regression.The estimation stability and prediction ability of the model were strictly analyzed by both internal and external validations.The correlation coefficients of established PLS model,leave-one-out(LOO) cross-validation,and predicted values versus experimental ones of external samples were R2=0.899,RCV2=0.854 and Qext2=0.868,respectively.These values indicated that the built PLS model had both favorable estimation stability and good prediction capabilities.Furthermore,the satisfactory results showed that 3D-HoVAIF could preferably express the information related to the biological activity of benzoxazinone derivatives.  相似文献   

12.
Five quantitative spectroscopic data-activity relationships (QSDAR) models for 50 steroidal inhibitors binding to aromatase enzyme have been developed based on simulated (13)C nuclear magnetic resonance (NMR) data. Three of the models were based on comparative spectral analysis (CoSA), and the two other models were based on comparative structurally assigned spectral analysis (CoSASA). A CoSA QSDAR model based on five principal components had an explained variance (r(2)) of 0.78 and a leave-one-out (LOO) cross-validated variance (q(2)) of 0.71. A CoSASA model that used the assigned (13)C NMR chemical shifts from a steroidal backbone at five selected positions gave an r(2) of 0.75 and a q(2) of 0.66. The (13)C NMR chemical shifts from atoms in the steroid template position 9, 6, 3, and 7 each had correlations greater than 0.6 with the relative binding activity to the aromatase enzyme. All five QSDAR models had explained and cross-validated variances that were better than the explained and cross-validated variances from a five structural parameter quantitative structure-activity relationship (QSAR) model of the same compounds. QSAR modeling suffers from errors introduced by the assumptions and approximations used in partial charges, dielectric constants, and the molecular alignment process of one structural conformation. One postulated reason that the variances of QSDAR models are better than the QSAR models is that (13)C NMR spectral data, based on quantum mechanical principles, are more reflective of binding than the QSAR model's calculated electrostatic potentials and molecular alignment process. The QSDAR models provide a rapid, simple way to model the steroid inhibitor activity in relation to the aromatase enzyme.  相似文献   

13.
A quantitative structure–activity relationship (QSAR) of 3‐(9‐acridinylamino)‐5‐hydroxymethylaniline (AHMA) derivatives and their alkylcarbamates as potent anticancer agents has been studied using density functional theory (DFT), molecular mechanics (MM+), and statistical methods. In the best established QSAR equation, the energy (ENL) of the next lowest unoccupied molecular orbital (NLUMO) and the net charges (QFR) of the first atom of the substituent R, as well as the steric parameter (MR2) of subsituent R2 are the main independent factors contributing to the anticancer activity of the compounds. A new scheme determining outliers by “leave‐one‐out” (LOO) cross‐validation coefficient (q) was suggested and successfully used. The fitting correlation coefficient (R2) and the “LOO” cross‐validation coefficient (q2) values for the training set of 25 compounds are 0.881 and 0.829, respectively. The predicted activities of 5 compounds in the test set using this QSAR model are in good agreement with their experimental values, indicating that this model has excellent predictive ability. Based on the established QSAR equation, 10 new compounds with rather high anticancer activity much greater than that of 34 compounds have been designed and await experimental verification. © 2006 Wiley Periodicals, Inc. Int J Quantum Chem, 2007  相似文献   

14.
二肽肽酶IV是一类用于治疗II型糖尿病具有潜在价值的关键酶, 很多此类酶的抑制剂用于处理此病具有相当好的有效性. 一系列N-取代的甘氨酰氰基吡咯烷衍生物对于二肽肽酶具有高的活性和选择性. 我们使用比较分子力场分析方法建立DPP-IV抑制剂——N-取代的甘氨酰氰基吡咯衍生物的三维定量构效关系, 该模型为设计用于治疗II型糖尿病的高效DPP-IV抑制剂提供结构信息. CoMFA模型的交叉验证相关系数q2=0.575, 非交叉验证相关系数r2=0.981, 绝对误差S=0.184, F9.68=388.5. 使用七个预测集检验了模型的预测能力. 所得的模型解释了已有的构效关系, 并对同类化合物有较好的预测能力, 该模型可用于指导新型的DPP-IV抑制剂的设计与优化.  相似文献   

15.
Validation is a crucial aspect for quantitative structure–activity relationship (QSAR) model development. External validation is considered, in general, as the most conclusive proof of predictive capacity of a QSAR model. In the absence of truly external data set, external validation is usually performed on test set compounds, which are members of the original data set but not used in model development exercise. In the case of small data sets, QSAR researchers experience problem in model development due to the fact that the developed models may be less reliable on account of the small number of training set compounds and such models may also show poor external predictability because the models may not have captured all necessary features required for the particular structure–activity relationships. The present paper attempts to show that ‘true r(LOO)’ statistic calculated based on the model derived from the undivided data set with application of variable selection strategy at each cycle of leave‐one‐out (LOO) validation may reflect external validation characteristics of the developed model thus obviating the requirement of splitting of the data set into training and test sets. This approach may be helpful in the case of small data sets as it uses all available data for model development and validation thus making the resulting model more reliable. Copyright © 2009 John Wiley & Sons, Ltd.  相似文献   

16.
Quantitative Structure–Activity Relationship (QSAR) models are used increasingly to screen chemical databases and/or virtual chemical libraries for potentially bioactive molecules. These developments emphasize the importance of rigorous model validation to ensure that the models have acceptable predictive power. Using k nearest neighbors (kNN) variable selection QSAR method for the analysis of several datasets, we have demonstrated recently that the widely accepted leave-one-out (LOO) cross-validated R2 (q2) is an inadequate characteristic to assess the predictive ability of the models [Golbraikh, A., Tropsha, A. Beware of q2! J. Mol. Graphics Mod. 20, 269-276, (2002)]. Herein, we provide additional evidence that there exists no correlation between the values of q 2 for the training set and accuracy of prediction (R 2) for the test set and argue that this observation is a general property of any QSAR model developed with LOO cross-validation. We suggest that external validation using rationally selected training and test sets provides a means to establish a reliable QSAR model. We propose several approaches to the division of experimental datasets into training and test sets and apply them in QSAR studies of 48 functionalized amino acid anticonvulsants and a series of 157 epipodophyllotoxin derivatives with antitumor activity. We formulate a set of general criteria for the evaluation of predictive power of QSAR models.  相似文献   

17.
In the quantitative structure‐activity relationship (QSAR) study, local lazy regression (LLR) can predict the activity of a query molecule by using the information of its local neighborhood without need to produce QSAR models a priori. When a prediction is required for a query compound, a set of local models including different number of nearest neighbors are identified. The leave‐one‐out cross‐validation (LOO‐CV) procedure is usually used to assess the prediction ability of each model, and the model giving the lowest LOO‐CV error or highest LOO‐CV correlation coefficient is chosen as the best model. However, it has been proved that the good statistical value from LOO cross‐validation appears to be the necessary, but not the sufficient condition for the model to have a high predictive power. In this work, a new strategy is proposed to improve the predictive ability of LLR models and to access the accuracy of a query prediction. The bandwidth of k neighbor value for LLR is optimized by considering the predictive ability of local models using an external validation set. This approach was applied to the QSAR study of a series of thienopyrimidinone antagonists of melanin‐concentrating hormone receptor 1. The obtained results from the new strategy shows evident improvement compared with the commonly used LOO‐CV LLR methods and the traditional global linear model. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010  相似文献   

18.
19.
采用三维全息原子场作用矢量(3D-HoVAIF)对32个吡咯类抗艾滋病药物进行结构参数化表征,并与其活性建立定量构效关系。分别采用多元线性回归(MLR)和偏最小二乘(PLS)进行建模,建模的复相关系数(R2cum)、交互校验复相关系数(Q2cum)和模型的标准偏差(SD)分别为R2cum=0.914、Q2cum=0.812、SD=0.236(MLR);R2cum=0.836、Q2cum=0.719、SD=0.314(PLS),结果均优于文献值(R2cum=0.667,Q2cum=0.581,SD=0.420)。所建模型具有良好的稳定性和预测能力,表明3D-HoVAIF能够较好地表征该类分子的结构,值得进一步推广应用。  相似文献   

20.
在DFT-B3LYP/6-311+G(d,p)水平对60种非环状亚硝胺分子结构进行几何全优化,通过多元逐步线性回归(MSR)分析筛选出9个量子化学描述符作为自变量,log LD50(lethal dose 50%,LD50:大鼠口服急性毒性)作为因变量,采用人工神经网络(ANN)方法构建QSAR模型.经Levenber...  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号