首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The three-dimensional structures of proteins provide their functions and incorrect folding of its β-strands can be the cause of many diseases. There are two major approaches for determining protein structures: computational prediction and experimental methods that employ technologies such as Cryo-electron microscopy. Due to experimental methods’s high costs, extended wait times for its lengthy processes, and incompleteness of results, computational prediction is an attractive alternative. As the focus of the present paper, β-sheet structure prediction is a major portion of overall protein structure prediction. Prediction of other substructures, such as α-helices, is simpler with lower computational time complexities. Brute force methods are the most common approach and dynamic programming is also utilized to generate all possible conformations. The current study introduces the Subset Sum Approach (SSA) for the direct search space generation method, which is shown to outperform the dynamic programming approach in terms of both time and space. For the first time, the present work has calculated both the state space cardinality of the dynamic programming approach and the search space cardinality of the general brute force approaches. In regard to a set of pruning rules, SSA has demonstrated higher efficiency with respect to both time and accuracy in comparison to state-of-the-art methods.  相似文献   

2.

Computational tools can bridge the gap between sequence and protein 3D structure based on the notion that information is to be retrieved from the databases and that knowledge-based methods can help in approaching a solution of the protein-folding problem. To this aim our group has implemented neural network-based predictors capable of performing with some success in different tasks, including predictions of the secondary structure of globular and membrane proteins, the topology of membrane proteins and porins and stable f -helical segments suited for protein design. Moreover we have developed methods for predicting contact maps in proteins and the probability of finding a cysteine in a disulfide bridge, tools which can contribute to the goal of predicting the 3D structure starting from the sequence (the so called ab initio prediction). All our predictors take advantage of evolution information derived from the structural alignments of homologous (evolutionary related) proteins and taken from the sequence and structure databases. When it is necessary to build models for proteins of unknown spatial structure, which have very little homology with other proteins of known structure, non-standard techniques need to be developed and the tools for protein structure predictions may help in protein modeling. The results of a recent simulation performed in our lab highlights the role of high performing computing technology and of tools of computational biology in protein modeling and peptidomimetic design.  相似文献   

3.
Studies on protein–protein interaction are important in proteome research. How to build more effective models based on sequence information, structure information and physicochemical characteristics, is the key technology in protein–protein interface prediction. In this paper, we study the protein–protein interface prediction problem. We propose a novel method for identifying residues on interfaces from an input protein with both sequence and 3D structure information, based on hexagon structure similarity. Experiments show that our method achieves better results than some state-of-the-art methods for identifying protein–protein interface. Comparing to existing methods, our approach improves F-measure value by at least 0.03. On a common dataset consisting of 41 complexes, our method has overall precision and recall values of 63% and 57%. On Benchmark v4.0, our method has overall precision and recall values of 55% and 56%. On CAPRI targets, our method has overall precision and recall values of 52% and 55%.  相似文献   

4.
Pathogen–host interactions are very important to figure out the infection process at the molecular level, where pathogen proteins physically bind to human proteins to manipulate critical biological processes in the host cell. Data scarcity and data unavailability are two major problems for computational approaches in the prediction of pathogen–host interactions. Developing a computational method to predict pathogen–host interactions with high accuracy, based on protein sequences alone, is of great importance because it can eliminate these problems. In this study, we propose a novel and robust sequence based feature extraction method, named Location Based Encoding, to predict pathogen–host interactions with machine learning based algorithms. In this context, we use Bacillus Anthracis and Yersinia Pestis data sets as the pathogen organisms and human proteins as the host model to compare our method with sequence based protein encoding methods, which are widely used in the literature, namely amino acid composition, amino acid pair, and conjoint triad. We use these encoding methods with decision trees (Random Forest, j48), statistical (Bayesian Networks, Naive Bayes), and instance based (kNN) classifiers to predict pathogen–host interactions. We conduct different experiments to evaluate the effectiveness of our method. We obtain the best results among all the experiments with RF classifier in terms of F1, accuracy, MCC, and AUC.  相似文献   

5.
6.
A series of three fluorine containing and three non-fluorinated Diels–Alder step-growth polyarylene polymers and copolymers was synthesized via conventional oil bath heating (days/weeks). A drastic time reduction was realized with a microwave-assisted polymerization (hours). The polymers were characterized by multinuclear (1H, 13C, and 19F) NMR and attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy, thermal analysis (thermogravimetric analysis [TGA], differential scanning calorimetry [DSC], and dynamic mechanical analysis [DMA]), gel permeation chromatography, X-ray diffraction (XRD), water contact analysis, and refractive index (RI) measurements. The NMR spectra indicated a mixture of para and meta conformations through the polymer backbone increasing to more para with greater fluorine content. TGA revealed the fluorine-containing polyarylenes possessed the highest char yields at almost 80% at 1000°C under nitrogen, and all the polyarylenes possessed onset of degradation temperatures above 550°C under nitrogen and air atmospheres. XRD analysis indicated more ordering for the fluorine-containing polyarylenes which afforded the high char yields. DMA gave storage moduli values in the range of 1–10 GPa for the polyarylenes. Molecular weights for all samples were above 100 kg/mol. Water contact angles did not change with fluorine content due to the shielding effect of the pendant phenyl groups. However, the RI decreased to 1.6497 at 632.8 nm for the polyarylene with the highest fluorine content.  相似文献   

7.
Quantitative structure–activity relationship (QSAR), a regression modeling methodology that establishes statistical correlation between structure feature and apparent behavior for a series of congeneric molecules quantitatively, has been widely used to evaluate the activity, toxicity and property of various small-molecule compounds such as drugs, toxicants and surfactants. However, it is surprising to see that such useful technique has only very limited applications to biomacromolecules, albeit the solved 3D atom-resolution structures of proteins, nucleic acids and their complexes have accumulated rapidly in past decades. Here, we present a proof-of-concept paradigm for the modeling, prediction and interpretation of the binding affinity of 144 sequence-nonredundant, structure-available and affinity-known protein complexes (Kastritis et al. Protein Sci 20:482–491, 2011) using a biomacromolecular QSAR (BioQSAR) scheme. We demonstrate that the modeling performance and predictive power of BioQSAR are comparable to or even better than that of traditional knowledge-based strategies, mechanism-type methods and empirical scoring algorithms, while BioQSAR possesses certain additional features compared to the traditional methods, such as adaptability, interpretability, deep-validation and high-efficiency. The BioQSAR scheme could be readily modified to infer the biological behavior and functions of other biomacromolecules, if their X-ray crystal structures, NMR conformation assemblies or computationally modeled structures are available.  相似文献   

8.
9.
There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interaction information to predict function for un-annotated proteins. In this paper, we propose a novel approach called “Neighbor Relativity Coefficient” (NRC) based on interaction network topology which estimates the functional similarity between two proteins. NRC is calculated for each pair of proteins based on their graph-based features including distance, common neighbors and the number of paths between them. In order to ascribe function to an un-annotated protein, NRC estimates a weight for each neighbor to transfer its annotation to the unknown protein. Finally, the unknown protein will be annotated by the top score transferred functions. We also investigate the effect of using different coefficients for various types of functions. The proposed method has been evaluated on Saccharomyces cerevisiae and Homo sapiens interaction networks. The performance analysis demonstrates that NRC yields better results in comparison with previous protein function prediction approaches that utilize interaction network.  相似文献   

10.
11.
12.
13.
Identifying protein–RNA binding residues is essential for understanding the mechanism of protein–RNA interactions. So far, rigid distance thresholds are commonly used to define protein–RNA binding residues. However, after investigating 182 non-redundant protein–RNA complexes, we find that it would be unsuitable for a certain amount of complexes since the distances between proteins and RNAs vary widely. In this work, a novel definition method was proposed based on a flexible distance cutoff. This method can fully consider the individual differences among complexes by setting a variable tolerance limit of protein–RNA interactions, i.e. the double minimum-distance by which different distance thresholds are achieved for different complexes. In order to validate our method, a comprehensive comparison between our flexible method and traditional rigid methods was implemented in terms of interface structure, amino acid composition, interface area and interaction force, etc. The results indicate that this method is more reasonable because it incorporates the specificity of different complexes by extracting the important residues lost by rigid distance methods and discarding some redundant residues. Finally, to further test our double minimum-distance definition strategy, we developed a classifier to predict those binding sites derived from our new method by using structural features and a random forest machine learning algorithm. The model achieved a satisfactory prediction performance and the accuracy on independent data sets reaches to 85.0%. To the best of our knowledge, it is the first prediction model to define positive and negative samples using a flexible cutoff. So the comparison analysis and modeling results have demonstrated that our method would be a very promising strategy for more precisely defining protein–RNA binding sites.  相似文献   

14.
15.
16.
One of the most promising anticancer and recent antimalarial targets is the heterodimeric zinc-containing protein farnesyltransferase (FT). In this work, we studied a highly diverse series of 192 Abbott-initiated imidazole-containing compounds and their FT inhibitory activities using 3D-QSAR and docking, in order to gain understanding of the interaction of these inhibitors with FT to aid development of a rational strategy for further lead optimization. We report several highly significant and predictive CoMFA and CoMSIA models. The best model, composed of CoMFA steric and electrostatic fields combined with CoMSIA hydrophobic and H-bond acceptor fields, had r 2 = 0.878, q 2 = 0.630, and r pred2 = 0.614. Docking studies on the statistical outliers revealed that some of them had a different binding mode in the FT active site based on steric bulk and available active site space, explaining why the predicted activities differed from the experimental activities. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

17.
Heparanase (HPSE) is an endo-β-d-glucuronidase that has diverse functions in mammals which includes cell survival, cell adhesion and cell migration. HPSE features both enzymatic and non-enzymatic functionalities in a pH dependent manner. Hence, in this study, an extensive molecular dynamics simulation, molecular docking, protein Angular dispersion analysis were performed for apo form and holo forms to understand its conformational changes at varied pH conditions. On comparative conformational analysis of apo and holo forms, it was inferred that the HSPE has undergone pH dependent structural changes, thereby affecting the binding of Heparan sulfate proteoglycan (HSPG). Moreover, HPSE also showed favourable structural changes for optimal binding of HSPG at pH 5.0 and 6.0, as inferred from functional flap displacements within HPSE. Thus, this study provides significant insights on optimal pH for HPSE to exhibit its enzymatic activity. The outcome of this study shall aid in ideal lead generation for targeting HPSE mediated disease conditions.  相似文献   

18.
19.
Wang  Q.  Chen  H. M.  Li  M. X.  Geng  D. L.  Wang  H. P. 《Journal of Thermal Analysis and Calorimetry》2021,145(5):2287-2294
Journal of Thermal Analysis and Calorimetry - The knowledge of specific heat for Ag–Si–Ge alloys in a broad temperature range would facilitate their practical applications in various...  相似文献   

20.
Glucocerebrosidase (GCase, acid β-Glucosidase) hydrolyzes the sphingolipid glucosylceramide into glucose and ceramide. Mutations in this enzyme lead to a lipid metabolism disorder known as Gaucher disease. The design of competitive inhibitors of GCase is a promising field of research for the design of pharmacological chaperones as new therapeutic agents. Using a series of recently reported molecules with experimental binding affinities for GCase in the nanomolar to micromolar range, we here report an extensive theoretical analysis of their binding mode. On the basis of molecular docking, molecular dynamics, and binding free energy calculations using the linear interaction energy method (LIE), we provide details on the molecular interactions supporting ligand binding in the different families of compounds. The applicability of other computational approaches, such as the COMBINE methodology, is also investigated. The results show the robustness of the standard parametrization of the LIE method, which reproduces the experimental affinities with a mean unsigned error of 0.7 kcal/mol. Several structure-activity relationships are established using the computational models here provided, including the identification of hot spot residues in the binding site. The models derived are envisaged as important tools in ligand-design programs for GCase inhibitors.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号