首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 906 毫秒
1.
Membrane transport proteins are essential for cellular uptake of numerous salts, nutrients and drugs. Bilitranslocase is a transporter, specific for water-soluble organic anions, and is the only known carrier of nucleotides and nucleotide-like compounds. Experimental data of bilitranslocase ligand specificity for 120 compounds were used to construct classification models using counter-propagation artificial neural networks (CP-ANNs) and support vector machines (SVMs). A subset of active compounds with experimentally determined transport rates was used to build predictive QSAR models for estimation of transport rates of unknown compounds. Several modelling methods and techniques were applied, i.e. CP-ANN, genetic algorithm, self-organizing mapping and multiple linear regression method. The best predictions were achieved using CP-ANN coupled with a genetic algorithm, with the external validation parameter QV2 of 0.96. The applicability domains of the models were defined to determine the chemical space in which reliable predictions can be obtained. The models were applied for the estimation of bilitranslocase transport activity for two sets of pharmaceutically interesting compounds, antioxidants and antiprions. We found that the relative planarity and a high potential for hydrogen bond formation are the common structural features of anticipated substrates of bilitranslocase. These features may serve as guidelines in the design of new pharmaceuticals transported by bilitranslocase.  相似文献   

2.
3.
Alongside the validation, the concept of applicability domain (AD) is probably one of the most important aspects which determine the quality as well as reliability of the established quantitative structure–activity relationship (QSAR) models. To date, a variety of approaches for AD estimation have been devised which can be applied to particular type of QSAR models and their practical utilization is extensively elaborated in the literature. The present study introduces a novel, simple, and effective distance-based method for estimation of the AD in case of developed and validated predictive counter-propagation artificial neural network (CP ANN) models through a proficient exploitation of the Euclidean distance (ED) metric in the structure-representation vector space. The performance of the method was evaluated and explained in a case study by using a pre-built and validated CP ANN model for prediction of the transport activity of the transmembrane protein bilitranslocase for a diverse set of compounds. The method was tested on two more datasets in order to confirm its performance for evaluation of the applicability domain in CP ANN models. The chemical compounds determined as potential outliers, i.e., outside of the CP ANN model AD, were confirmed in a comparative AD assessment by using the leverage approach. Moreover, the method offers a graphical depiction of the AD for fast and simple determination of the extreme points.  相似文献   

4.
The predictive accuracy of the model is of the most concern for computational chemists in quantitative structure-activity relationship (QSAR) investigations. It is hypothesized that the model based on analogical chemicals will exhibit better predictive performance than that derived from diverse compounds. This paper develops a novel scheme called "clustering first, and then modeling" to build local QSAR models for the subsets resulted from clustering of the training set according to structural similarity. For validation and prediction, the validation set and test set were first classified into the corresponding subsets just as those of the training set, and then the prediction was performed by the relevant local model for each subset. This approach was validated on two independent data sets by local modeling and prediction of the baseline toxicity for the fathead minnow. In this process, hierarchical clustering was employed for cluster analysis, k-nearest neighbor for classification, and partial least squares for the model generation. The statistical results indicated that the predictive performances of the local models based on the subsets were much superior to those of the global model based on the whole training set, which was consistent with the hypothesis. This approach proposed here is promising for extension to QSAR modeling for various physicochemical properties, biological activities, and toxicities.  相似文献   

5.
6.
7.
8.
9.
10.
11.
Selecting most rigorous quantitative structure-activity relationship (QSAR) approaches is of great importance in the development of robust and predictive models of chemical toxicity. To address this issue in a systematic way, we have formed an international virtual collaboratory consisting of six independent groups with shared interests in computational chemical toxicology. We have compiled an aqueous toxicity data set containing 983 unique compounds tested in the same laboratory over a decade against Tetrahymena pyriformis. A modeling set including 644 compounds was selected randomly from the original set and distributed to all groups that used their own QSAR tools for model development. The remaining 339 compounds in the original set (external set I) as well as 110 additional compounds (external set II) published recently by the same laboratory (after this computational study was already in progress) were used as two independent validation sets to assess the external predictive power of individual models. In total, our virtual collaboratory has developed 15 different types of QSAR models of aquatic toxicity for the training set. The internal prediction accuracy for the modeling set ranged from 0.76 to 0.93 as measured by the leave-one-out cross-validation correlation coefficient ( Q abs2). The prediction accuracy for the external validation sets I and II ranged from 0.71 to 0.85 (linear regression coefficient R absI2) and from 0.38 to 0.83 (linear regression coefficient R absII2), respectively. The use of an applicability domain threshold implemented in most models generally improved the external prediction accuracy but at the same time led to a decrease in chemical space coverage. Finally, several consensus models were developed by averaging the predicted aquatic toxicity for every compound using all 15 models, with or without taking into account their respective applicability domains. We find that consensus models afford higher prediction accuracy for the external validation data sets with the highest space coverage as compared to individual constituent models. Our studies prove the power of a collaborative and consensual approach to QSAR model development. The best validated models of aquatic toxicity developed by our collaboratory (both individual and consensus) can be used as reliable computational predictors of aquatic toxicity and are available from any of the participating laboratories.  相似文献   

12.
We describe the use of Bayesian regularized artificial neural networks (BRANNs) coupled with automatic relevance determination (ARD) in the development of quantitative structure-activity relationship (QSAR) models. These BRANN-ARD networks have the potential to solve a number of problems which arise in QSAR modeling such as the following: choice of model; robustness of model; choice of validation set; size of validation effort; and optimization of network architecture. The ARD method ensures that irrelevant or highly correlated indices used in the modeling are neglected as well as showing which are the most important variables in modeling the activity data. The application of the methods to QSAR of compounds active at the benzodiazepine and muscarinic receptors as well as some toxicological data of the effect of substituted benzenes on Tetetrahymena pyriformis is illustrated.  相似文献   

13.
14.
15.
Support vector machines in water quality management   总被引:1,自引:0,他引:1  
Support vector classification (SVC) and regression (SVR) models were constructed and applied to the surface water quality data to optimize the monitoring program. The data set comprised of 1500 water samples representing 10 different sites monitored for 15 years. The objectives of the study were to classify the sampling sites (spatial) and months (temporal) to group the similar ones in terms of water quality with a view to reduce their number; and to develop a suitable SVR model for predicting the biochemical oxygen demand (BOD) of water using a set of variables. The spatial and temporal SVC models rendered grouping of 10 monitoring sites and 12 sampling months into the clusters of 3 each with misclassification rates of 12.39% and 17.61% in training, 17.70% and 26.38% in validation, and 14.86% and 31.41% in test sets, respectively. The SVR model predicted water BOD values in training, validation, and test sets with reasonably high correlation (0.952, 0.909, and 0.907) with the measured values, and low root mean squared errors of 1.53, 1.44, and 1.32, respectively. The values of the performance criteria parameters suggested for the adequacy of the constructed models and their good predictive capabilities. The SVC model achieved a data reduction of 92.5% for redesigning the future monitoring program and the SVR model provided a tool for the prediction of the water BOD using set of a few measurable variables. The performance of the nonlinear models (SVM, KDA, KPLS) was comparable and these performed relatively better than the corresponding linear methods (DA, PLS) of classification and regression modeling.  相似文献   

16.
17.
18.
Pharmacophore modeling and parallel screening for PPAR ligands   总被引:1,自引:0,他引:1  
We describe the generation and validation of pharmacophore models for PPARs, as well as a large scale validation of the parallel screening approach by screening PPAR ligands against a large database of structure-based models. A large test set of 357 PPAR ligands was screened against 48 PPAR models to determine the best models for agonists of PPAR-alpha, PPAR-delta, and PPAR-gamma. Afterwards, a parallel screen was performed using the 357 PPAR ligands and 47 structure-based models for PPARs, which were integrated into a 1537 models comprising in-house pharmacophore database, to assess the enrichment of PPAR ligands within the PPAR hypotheses. For these purposes, we categorized the 1537 database models into 181 protein targets and developed a score that ranks the retrieved targets for each ligand. Thus, we tried to find out if the concept of parallel screening is able to predict the correct pharmacological target for a set of compounds. The PPAR target was ranked first more often than any other target. This confirms the ability of parallel screening to forecast the pharmacological active target for a set of compounds.  相似文献   

19.
20.
Pharmacophore hypotheses were developed for six structurally diverse series of cholecystokinin-B/gastrin receptor (CCK-BR) antagonists. A training set consisting of 33 compounds was carefully selected. The activity spread of the training set molecules was from 0.1 to 2100 nM. The most predictive pharmacophore model (hypothesis 1), consisting of four features, namely, two hydrogen bond donors, one hydrophobic aliphatic, and one hydrophobic aromatic feature, had a correlation (r) of 0.884 and a root-mean-square deviation of 1.1526, and the cost difference between null cost and fixed cost was 81.5 bits. The model was validated on a test set consisting of six different series of 27 structurally diverse compounds and performed well in classifying active and inactive molecules correctly. This validation approach provides confidence in the utility of the predictive pharmacophore model developed in this work as a 3D query tool in the virtual screening of drug-like molecules to retrieve new chemical entities as potent CCK-BR antagonists. The model can also be used to predict the biological activities of compounds prior to their costly and time-consuming synthesis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号