首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Protein–peptide interactions are essential for all cellular processes including DNA repair, replication, gene‐expression, and metabolism. As most protein – peptide interactions are uncharacterized, it is cost effective to investigate them computationally as the first step. All existing approaches for predicting protein – peptide binding sites, however, are based on protein structures despite the fact that the structures for most proteins are not yet solved. This article proposes the first machine‐learning method called SPRINT to make Sequence‐based prediction of Protein – peptide Residue‐level Interactions. SPRINT yields a robust and consistent performance for 10‐fold cross validations and independent test. The most important feature is evolution‐generated sequence profiles. For the test set (1056 binding and non‐binding residues), it yields a Matthews’ Correlation Coefficient of 0.326 with a sensitivity of 64% and a specificity of 68%. This sequence‐based technique shows comparable or more accurate than structure‐based methods for peptide‐binding site prediction. SPRINT is available as an online server at: http://sparks-lab.org/ . © 2016 Wiley Periodicals, Inc.  相似文献   

3.
Analyses of known protein–ligand interactions play an important role in designing novel and efficient drugs, contributing to drug discovery and development. Recently, machine learning methods have proven useful in the design of novel drugs, which utilize intelligent techniques to predict the outcome of unknown protein–ligand interactions by learning from the physical and geometrical properties of known protein–ligand interactions. The aim of this study is to work through a specific example of a novel computational method, namely compressed images for affinity prediction (CIFAP), in which binding affinities for structurally related ligands in complexes with human checkpoint kinase 1 (CHK1) are predicted. The CIFAP algorithm presented here relates published pIC 50 values of 57 compounds, derived from a thienopyridine pharmacophore, in complexes with CHK1 to their two‐dimensional (2D) electrostatic potential images compressed in orthogonal dimensions. Patterns obtained from the 2D images are then used as inputs in regression and learning algorithms such as support vector regression (SVR) and adaptive neuro‐fuzzy inference system (ANFIS) methods to validate the experimental pIC 50 values. This study revealed that the 2D image pixels in the vicinity of bound ligand surfaces provide more relevant information to make correlations with the empirical pIC 50 values. As compared with ANFIS, SVR gave rise to the lowest root mean square errors and the greatest correlations, suggesting that SVR could be a plausible choice of machine learning methods in predicting binding affinities by CIFAP. Copyright © 2013 John Wiley & Sons, Ltd.  相似文献   

4.
Malonylation is a recently discovered post‐translational modification (PTM) in which a malonyl group attaches to a lysine (K) amino acid residue of a protein. In this work, a novel machine learning model, SPRINT‐Mal, is developed to predict malonylation sites by employing sequence and predicted structural features. Evolutionary information and physicochemical properties are found to be the two most discriminative features whereas a structural feature called half‐sphere exposure provides additional improvement to the prediction performance. SPRINT‐Mal trained on mouse data yields robust performance for 10‐fold cross validation and independent test set with Area Under the Curve (AUC) values of 0.74 and 0.76 and Matthews’ Correlation Coefficient (MCC) of 0.213 and 0.20, respectively. Moreover, SPRINT‐Mal achieved comparable performance when testing on H. sapiens proteins without species‐specific training but not in bacterium S. erythraea. This suggests similar underlying physicochemical mechanisms between mouse and human but not between mouse and bacterium. SPRINT‐Mal is freely available as an online server at: http://sparks-lab.org/server/SPRINT-Mal/ . © 2018 Wiley Periodicals, Inc.  相似文献   

5.
Self‐organizing maps (SOMs) (in particular, Matrix reOrganization Layout to Map Analytical Patterns (MOLMAP)) were used to unravel the main patterns in a three‐way dataset after a preliminary unfolding of the cube. Eleven sites of the ría of Vigo (NW of Spain) were monitored during the last decade (from 2000 to 2010) to assess pollution trends in this area. Twelve trace metals (Hg, Pb, Cd, Cu, Zn, Cr, As, Li, Fe, Al, Ni and Mn), the total organic carbon and the percentage of fine particles were measured. Results from MOLMAP, the SOM‐based approach, were compared to those of three established alternatives: parallel factor analysis, matrix‐augmented principal component analysis and generalized Procrustes rotation, the latter two employing unfolding as well. MOLMAP showed the best capabilities to differentiate groups of samples. The spatial and temporal trends, as well as the analytical variables causing them, were almost the same for all methods, which confirms MOLMAP as a simple and reliable methodology to treat three‐way environmental datasets. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

6.
Carbohydrate‐binding proteins (CBPs) are potential biomarkers and drug targets. However, the interactions between carbohydrates and proteins are challenging to study experimentally and computationally because of their low binding affinity, high flexibility, and the lack of a linear sequence in carbohydrates as exists in RNA, DNA, and proteins. Here, we describe a structure‐based function‐prediction technique called SPOT‐Struc that identifies carbohydrate‐recognizing proteins and their binding amino acid residues by structural alignment program SPalign and binding affinity scoring according to a knowledge‐based statistical potential based on the distance‐scaled finite‐ideal gas reference state (DFIRE). The leave‐one‐out cross‐validation of the method on 113 carbohydrate‐binding domains and 3442 noncarbohydrate binding proteins yields a Matthews correlation coefficient of 0.56 for SPalign alone and 0.63 for SPOT‐Struc (SPalign + binding affinity scoring) for CBP prediction. SPOT‐Struc is a technique with high positive predictive value (79% correct predictions in all positive CBP predictions) with a reasonable sensitivity (52% positive predictions in all CBPs). The sensitivity of the method was changed slightly when applied to 31 APO (unbound) structures found in the protein databank (14/31 for APO versus 15/31 for HOLO). The result of SPOT‐Struc will not change significantly if highly homologous templates were used. SPOT‐Struc predicted 19 out of 2076 structural genome targets as CBPs. In particular, one uncharacterized protein in Bacillus subtilis (1oq1A) was matched to galectin‐9 from Mus musculus. Thus, SPOT‐Struc is useful for uncovering novel carbohydrate‐binding proteins. SPOT‐Struc is available at http://sparks‐lab.org . © 2014 Wiley Periodicals, Inc.  相似文献   

7.
The fast pulling ligand (FPL) out of binding cavity using non‐equilibrium molecular dynamics (MD) simulations was demonstrated to be a rapid, accurate and low CPU demand method for the determination of the relative binding affinities of a large number of HIV‐1 protease (PR) inhibitors. In this approach, the ligand is pulled out of the binding cavity of the protein using external harmonic forces, and the work of pulling force corresponds to the relative binding affinity of HIV‐1 PR inhibitor. The correlation coefficient between the pulling work and the experimental binding free energy of shows that FPL results are in good agreement with experiment. It is thus easier to rank the binding affinities of HIV‐1 PR inhibitors, that have similar binding affinities because the mean error bar of pulling work amounts to . The nature of binding is discovered using the FPL approach. © 2016 Wiley Periodicals, Inc.  相似文献   

8.
9.
10.
We have developed a two‐dimensional replica‐exchange method for the prediction of protein–ligand binding structures. The first dimension is the umbrella sampling along the reaction coordinate, which is the distance between a protein binding pocket and a ligand. The second dimension is the solute tempering, in which the interaction between a ligand and a protein and water is weakened. The second dimension is introduced to make a ligand follow the umbrella potential more easily and enhance the binding events, which should improve the sampling efficiency. As test cases, we applied our method to two protein‐ligand complex systems (MDM2 and HSP 90‐alpha). Starting from the configuration in which the protein and the ligand are far away from each other in each system, our method predicted the ligand binding structures in excellent agreement with the experimental data from Protein Data Bank much faster with the improved sampling efficiency than the replica‐exchange umbrella sampling method that we have previously developed. © 2013 Wiley Periodicals, Inc.  相似文献   

11.
12.
13.
In the drug discovery process, accurate methods of computing the affinity of small molecules with a biological target are strongly needed. This is particularly true for molecular docking and virtual screening methods, which use approximated scoring functions and struggle in estimating binding energies in correlation with experimental values. Among the various methods, MM‐PBSA and MM‐GBSA are emerging as useful and effective approaches. Although these methods are typically applied to large collections of equilibrated structures of protein‐ligand complexes sampled during molecular dynamics in water, the possibility to reliably estimate ligand affinity using a single energy‐minimized structure and implicit solvation models has not been explored in sufficient detail. Herein, we thoroughly investigate this hypothesis by comparing different methods for the generation of protein‐ligand complexes and diverse methods for free energy prediction for their ability to correlate with experimental values. The methods were tested on a series of structurally diverse inhibitors of Plasmodium falciparum DHFR with known binding mode and measured affinities. The results showed that correlations between MM‐PBSA or MM‐GBSA binding free energies with experimental affinities were in most cases excellent. Importantly, we found that correlations obtained with the use of a single protein‐ligand minimized structure and with implicit solvation models were similar to those obtained after averaging over multiple MD snapshots with explicit water molecules, with consequent save of computing time without loss of accuracy. When applied to a virtual screening experiment, such an approach proved to discriminate between true binders and decoy molecules and yielded significantly better enrichment curves. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010  相似文献   

14.
15.
New empirical scoring functions have been developed to estimate the binding affinity of a given protein-ligand complex with known three-dimensional structure. These scoring functions include terms accounting for van der Waals interaction, hydrogen bonding, deformation penalty, and hydrophobic effect. A special feature is that three different algorithms have been implemented to calculate the hydrophobic effect term, which results in three parallel scoring functions. All three scoring functions are calibrated through multivariate regression analysis of a set of 200 protein-ligand complexes and they reproduce the binding free energies of the entire training set with standard deviations of 2.2 kcal/mol, 2.1 kcal/mol, and 2.0 kcal/mol, respectively. These three scoring functions are further combined into a consensus scoring function, X-CSCORE. When tested on an independent set of 30 protein-ligand complexes, X-CSCORE is able to predict their binding free energies with a standard deviation of 2.2 kcal/mol. The potential application of X-CSCORE to molecular docking is also investigated. Our results show that this consensus scoring function improves the docking accuracy considerably when compared to the conventional force field computation used for molecular docking.  相似文献   

16.
17.
Macroporous epoxy cryogels can be used as an alternative for classical matrices in affinity chromatography. Due to the structural properties of cryogels, with pores of up to 100 μm, crude samples can be processed at high speed without previous manipulations such as clarification or centrifugation. Also, we previously used a peptide‐expressing M13 bacteriophage as an affinity ligand. These ligands show high specificity toward the target to be purified. Combination of both, leads to a relative cost‐effective one‐step chromatographic set‐up delivering a high purity sample (>95%), however, so far with limited capacity. To increase the binding capacity of the affinity columns, we now inserted spacers between the chromatographic matrix and the phage ligand. Both linear spacers, di‐amino‐alkanes (C2–C10), and branched polyethyleneimine spacers with different molecular weights (800 Da–10 kDa) were analyzed. Two types of peptide expressing phage ligands, a linear 15‐mer and a cyclic 6‐mer, were used for screening. Up to a tenfold increase in binding capacity was observed depending on the combination of phage ligand and spacer type.  相似文献   

18.
19.
γ‐Secretase inhibitors have been explored for the prevention and treatment of Alzheimer's disease (AD). Methods for prediction and screening of γ‐secretase inhibitors are highly desired for facilitating the design of novel therapeutic agents against AD, especially when incomplete knowledge about the mechanism and three‐dimensional structure of γ‐secretase. We explored two machine learning methods, support vector machine (SVM) and random forest (RF), to develop models for predicting γ‐secretase inhibitors of diverse structures. Quantitative analysis of the receiver operating characteristic (ROC) curve was performed to further examine and optimize the models. Especially, the Youden index (YI) was initially introduced into the ROC curve of RF so as to obtain an optimal threshold of probability for prediction. The developed models were validated by an external testing set with the prediction accuracies of SVM and RF 96.48 and 98.83% for γ‐secretase inhibitors and 98.18 and 99.27% for noninhibitors, respectively. The different feature selection methods were used to extract the physicochemical features most relevant to γ‐secretase inhibition. To the best of our knowledge, the RF model developed in this work is the first model with a broad applicability domain, based on which the virtual screening of γ‐secretase inhibitors against the ZINC database was performed, resulting in 368 potential hit candidates. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010  相似文献   

20.
Supermolecular complexes formed by oligophenyleneethynylene derivatives and isophthalic acid were studied using AM1 method to obtain binding energy. Electronic spectra and IR spectra of the complexes were calculated by INDO/CIS and AM1 methods based on AM1 geometries. Results indicated that the dimer could be formed by the monomers via hydrogen bonding because of the negative binding energy. Binding energy of the complexes was affected by electronegativity and steric effects of the substituents. The first UV absorptions and IR frequencies of N-H bonds of the complexes were both red-shifted compared with those of the monomers. The complexes could bind small molecules via hydrogen bonds, resulting in the change in UV absorptions and an increase in IR frequencies of N-H bonds.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号