首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Coarse‐grained protein structure models offer increased efficiency in structural modeling, but these must be coupled with fast and accurate methods to revert to a full‐atom structure. Here, we present a novel algorithm to reconstruct mainchain models from C traces. This has been parameterized by fitting Gaussian mixture models (GMMs) to short backbone fragments centered on idealized peptide bonds. The method we have developed is statistically significantly more accurate than several competing methods, both in terms of RMSD values and dihedral angle differences. The method produced Ramachandran dihedral angle distributions that are closer to that observed in real proteins and better Phaser molecular replacement log‐likelihood gains. Amino acid residue sidechain reconstruction accuracy using SCWRL4 was found to be statistically significantly correlated to backbone reconstruction accuracy. Finally, the PD2 method was found to produce significantly lower energy full‐atom models using Rosetta which has implications for multiscale protein modeling using coarse‐grained models. A webserver and C++ source code is freely available for noncommercial use from: http://www.sbg.bio.ic.ac.uk/phyre2/PD2_ca2main/ . © 2013 Wiley Periodicals, Inc.  相似文献   

2.
We introduce PULCHRA, a fast and robust method for the reconstruction of full-atom protein models starting from a reduced protein representation. The algorithm is particularly suitable as an intermediate step between coarse-grained model-based structure prediction and applications requiring an all-atom structure, such as molecular dynamics, protein-ligand docking, structure-based function prediction, or assessment of quality of the predicted structure. The accuracy of the method was tested on a set of high-resolution crystallographic structures as well as on a set of low-resolution protein decoys generated by a protein structure prediction algorithm TASSER. The method is implemented as a standalone program that is available for download from http://cssb.biology.gatech.edu/skolnick/files/PULCHRA.  相似文献   

3.
Recent development of nuclear magnetic resonance (NMR) techniques provided new types of structural restraints that can be successfully used in fast and low‐cost global protein fold determination. Here, we present CABS‐NMR, an efficient protein modeling tool, which takes advantage of such structural restraints. The restraints are converted from original NMR data to fit the coarse grained protein representation of the C‐Alpha‐Beta‐Side‐group (CABS) algorithm. CABS is a Monte Carlo search algorithm that uses a knowledge‐based force field. Its versatile structure enables a variety of protein‐modeling protocols, including purely de novo folding, folding guided by restraints derived from template structures or, structure assembly based on experimental data. In particular, CABS‐NMR uses the distance and angular restraints set derived from various NMR experiments. This new modeling technique was successfully tested in structure determination of 10 globular proteins of size up to 216 residues, for which sparse NMR data were available. Additional detailed analysis was performed for a S100A1 protein. Namely, we successfully predicted Nuclear Overhauser Effect signals on the basis of low‐energy structures obtained from chemical shifts by CABS‐NMR. It has been observed that utility of chemical shifts and other types of experimental data (i.e. residual dipolar couplings and methyl‐methyl Nuclear Overhauser Effect signals) in the presented modeling pipeline depends mainly on size of a protein and complexity of its topology. In this work, we have provided tools for either post‐experiment processing of various kinds of NMR data or fast and low‐cost structural analysis in the still challenging field of new fold predictions. © 2010 Wiley Periodicals, Inc. J Comput Chem, 2011  相似文献   

4.
Protein modeling tools utilize many kinds of structural information that may be predicted from amino acid sequence of a target protein or obtained from experiments. Such data provide geometrical constraints in a modeling process. The main aim is to generate the best possible consensus structure. The quality of models strictly depends on the imposed conditions. In this work we present an algorithm, which predicts short-range distances between Cα atoms as well as a set of short structural fragments that possibly share structural similarity with a query sequence. The only input of the method is a query sequence profile. The algorithm searches for short protein fragments with high sequence similarity. As a result a statistics of distances observed in the similar fragments is returned. The method can be used also as a scoring function or a short-range knowledge-based potential based on the computed statistics.  相似文献   

5.
The conversion of polymer parameterization from internal coordinates (bond lengths, angles, and torsions) to Cartesian coordinates is a fundamental task in molecular modeling, often performed using the natural extension reference frame (NeRF) algorithm. NeRF can be parallelized to process multiple polymers simultaneously, but is not parallelizable along the length of a single polymer. A mathematically equivalent algorithm, pNeRF, has been derived that is parallelizable along a polymer's length. Empirical analysis demonstrates an order-of-magnitude speed up using modern GPUs and CPUs. In machine learning-based workflows, in which partial derivatives are backpropagated through NeRF equations and neural network primitives, switching to pNeRF can reduce the fractional computational cost of coordinate conversion from over two-thirds to around 10%. An optimized TensorFlow-based implementation of pNeRF is available on GitHub at https://github.com/aqlaboratory/pnerf © 2018 Wiley Periodicals, Inc.  相似文献   

6.
We present a computational protein design algorithm for finding low-energy sequences of fixed amino acid composition. The search algorithms used in protein design typically do not restrict amino acid composition. However, the random energy model of Shakhnovich suggests that the use of fixed-composition sequences may circumvent defects in the modeling of the denatured state. Our algorithm, FC_FASTER, links fixed-composition versions of Monte Carlo and the FASTER algorithm. As proof of principle, FC_FASTER was tested on an experimentally validated, full-sequence design of the beta1 domain of protein G. For the wild-type composition, FC_FASTER found a lower energy sequence than the experimentally validated sequence. Also, for a different composition, FC_FASTER found the hypothetical lowest-energy sequence in 14 out of 32 trials.  相似文献   

7.
We present a highly parallel algorithm to convert internal coordinates of a polymeric molecule into Cartesian coordinates. Traditionally, converting the structures of polymers (e.g., proteins) from internal to Cartesian coordinates has been performed serially, due to an inherent linear dependency along the polymer chain. We show this dependency can be removed using a tree-based concatenation of coordinate transforms between segments, and then parallelized efficiently on graphics processing units (GPUs). The conversion algorithm is applicable to protein engineering and fitting protein structures to experimental data, and we observe an order of magnitude speedup using parallel processing on a GPU compared to serial execution on a CPU.  相似文献   

8.
Proteins of both hyperthermophilic and mesophilic microorganisms generally constitute from the same 20 amino acids; however, the extent of thermal tolerance of any given protein is an inherent property of its amino acid sequence. The present study is the first to report a rapid method for predicting Tm (melting temperature), the temperature at which 50% of the protein is unfolded, directly from protein sequences (the Tm Index program is available at http://tm.life.nthu.edu.tw/). We examined 75 complete microbial genomes using the Tm Index, and the analysis clearly differentiated hyperthermophilic from mesophilic microorganisms on this global genomic basis. These results are consistent with the previous hypothesis that hyperthermophiles express a greater number of high Tm proteins compared with mesophiles. The Tm Index will be valuable for modifying existing proteins (enzymes, protein drugs and vaccines) or designing novel proteins having a desired melting temperature.  相似文献   

9.
The metabolic pathway of butylated hydroxytoluene (BHT) to the ring-oxygenated metabolites 2,6-di-tert-butylhydroquinone (BHQ) and 2,6-di-tert-butyl-p-benzoquinone (BBQ) was examined in rats. After intraperitoneal administration of 3,5-di-tert-butyl-4-hydroxybenzoic acid (BHT-acid), which had been regarded as one of the major metabolic end-products of BHT, 2,6-di-tert-butylphenol (DBP) and BBQ were identified in the feces by gas chromatography and gas chromatography-mass spectrometry (GC-MS). The biliary excretion of BBQ, BHQ glucuronide and BHT-acid glucuronide was also confirmed by GC-MS and high-performance liquid chromatography methods. The excretion rate of BHQ glucuronide for 24h after dosing with BHT-acid was about 9-fold higher than that after dosing with BHT. In addition, the formation of BBQ was confirmed in the urine after dosing with DBP. These results suggest that BHT is metabolized to BHQ and BBQ through DBP formed by decarboxylation of BHT-acid.  相似文献   

10.
In this article, we present a new approach to expand the range of application of protein‐ligand docking methods in the prediction of the interaction of coordination complexes (i.e., metallodrugs, natural and artificial cofactors, etc.) with proteins. To do so, we assume that, from a pure computational point of view, hydrogen bond functions could be an adequate model for the coordination bonds as both share directionality and polarity aspects. In this model, docking of metalloligands can be performed without using any geometrical constraints or energy restraints. The hard work consists in generating the convenient atom types and scoring functions. To test this approach, we applied our model to 39 high‐quality X‐ray structures with transition and main group metal complexes bound via a unique coordination bond to a protein. This concept was implemented in the protein‐ligand docking program GOLD. The results are in very good agreement with the experimental structures: the percentage for which the RMSD of the simulated pose is smaller than the X‐ray spectra resolution is 92.3% and the mean value of RMSD is < 1.0 Å. Such results also show the viability of the method to predict metal complexes–proteins interactions when the X‐ray structure is not available. This work could be the first step for novel applicability of docking techniques in medicinal and bioinorganic chemistry and appears generalizable enough to be implemented in most protein‐ligand docking programs nowadays available. © 2017 Wiley Periodicals, Inc.  相似文献   

11.
12.
Recent investigations on the stability of proteins have demonstrated various structural factors, but few have considered sequence factors such as protein motifs. These motifs represent highly conserved regions and describe critical regions that may only exist on proteins that remain functional at high temperatures. This investigation presents a method for identifying and comparing corresponding mesophilic and thermophilic sequence motifs between protein families. Discriminative motifs that are conserved only in the mesophilic or thermophilic subfamily are identified. Analysis of the results shows that, although the subfamilies of most protein families share similar motifs, some discriminative motifs are present in particular thermophilic/mesophilic subfamilies. The thermophilic discriminative motifs are conserved only in thermophilic organisms, revealing that physiochemical principles support thermostability.  相似文献   

13.
The function of protein, RNA, and DNA is modulated by fast, dynamic exchanges between three‐dimensional conformations. Conformational sampling of biomolecules with exact and nullspace inverse kinematics, using rotatable bonds as revolute joints and noncovalent interactions as holonomic constraints, can accurately characterize these native ensembles. However, sampling biomolecules remains challenging owing to their ultra‐high dimensional configuration spaces, and the requirement to avoid (self‐) collisions, which results in low acceptance rates. Here, we present two novel mechanisms to overcome these limitations. First, we introduce temporary constraints between near‐colliding links. The resulting constraint varieties instantaneously redirect the search for collision‐free conformations, and couple motions between distant parts of the linkage. Second, we adapt a randomized Poisson‐disk motion planner, which prevents local oversampling and widens the search, to ultra‐high dimensions. Tests on several model systems show that the sampling acceptance rate can increase from 16% to 70%, and that the conformational coverage in loop modeling measured as average closeness to existing loop conformations doubled. Correlated protein motions identified with our algorithm agree with those from MD simulations. © 2018 Wiley Periodicals, Inc.  相似文献   

14.
Nanodiscs are discoidal protein–lipid complexes that have wide applications in membrane protein studies. Modeling and simulation of nanodiscs are challenging due to the absence of structures of many membrane scaffold proteins (MSPs) that wrap around the membrane bilayer. We have developed CHARMM-GUI Nanodisc Builder ( http://www.charmm-gui.org/input/nanodisc ) to facilitate the setup of nanodisc simulation systems by modeling the MSPs with defined size and known structural features. A total of 11 different nanodiscs with a diameter from 80 to 180 Å are made available in both the all-atom CHARMM and two coarse-grained (PACE and Martini) force fields. The usage of the Nanodisc Builder is demonstrated with various simulation systems. The structures and dynamics of proteins and lipids in these systems were analyzed, showing similar behaviors to those from previous all-atom and coarse-grained nanodisc simulations. We expect the Nanodisc Builder to be a convenient and reliable tool for modeling and simulation of nanodisc systems. © 2019 Wiley Periodicals, Inc.  相似文献   

15.
A method for generating a full backbone protein structure from the coordinates of α-carbons, is presented. The method extracts information from known protein structures to generate statistical positions for the reconstructed atoms. Tests on a set of proteins structures show the algorithm to be of comparable accuracy to existing procedures. However, the basic advantage of the presented method is its simplicity and speed. In a test run, the present program is shown to be much faster than existing database searching algorithms, and reconstructs about 8000 residues per second. Thus, it may be included as an independent procedure in protein folding algorithms to rapidly generate approximate coordinates of backbone atoms. © 1997 by John Wiley & Sons, Inc.  相似文献   

16.
The ProBiS algorithm performs a local structural comparison of the query protein surface against the nonredundant database of protein structures. It finds proteins that have binding sites in common with the query protein. Here, we present a new parallelized algorithm, Parallel‐ProBiS, for detecting similar binding sites on clusters of computers. The obtained speedups of the parallel ProBiS scale almost ideally with the number of computing cores up to about 64 computing cores. Scaling is better for larger than for smaller query proteins. For a protein with almost 600 amino acids, the maximum speedup of 180 was achieved on two interconnected clusters with 248 computing cores. Source code of Parallel‐ProBiS is available for download free for academic users at http://probis.cmm.ki.si/download . © 2012 Wiley Periodicals, Inc.  相似文献   

17.
蛋白质磷酸化修饰是一种重要的蛋白质翻译后修饰,在细胞代谢过程中发挥着重要作用。当蛋白质的正常磷酸化调节发生异常时,会导致癌症、糖尿病、心脏病等各种疾病的发生。因此,蛋白磷酸化分析对于疾病的早期快速诊断、药物筛选和治疗等方面具有重大的意义。由于蛋白质磷酸化过程是动态的,并且磷酸化肽段或蛋白在生物样品中的含量较低,因此高灵敏的蛋白磷酸化分析面临着巨大的挑战。该文依据在检测过程中,选择性识别或捕获磷酸化的肽段或蛋白的主要机理,综述了近几年纳米材料对磷酸化肽段的富集和信号放大作用在蛋白磷酸化分析中的研究进展,并对其未来研究方向进行了展望。  相似文献   

18.
We previously reported a quinoxalin-2-one compound (Compound 1) that had inhibitory activity equivalent to existing platelet-derived growth factor-beta receptor (PDGFbeta R) inhibitors. Lead optimization of Compound 1 to increase its activity and selectivity, using structural information regarding PDGFbeta R-ligand interactions, is urgently needed. Here we present models of the PDGFbeta R kinase domain complexed with quinoxalin-2-one derivatives. The models were constructed using comparative modeling, molecular dynamics (MD) and ligand docking. In particular, conformations derived from MD, and ligand binding site information presented by alpha-spheres in the pre-docking processing, allowed us to identify optimal protein structures for docking of target ligands. By carrying out molecular modeling and MD of PDGFbeta R in its inactive state, we obtained two structural models having good Compound 1 binding potentials. In order to distinguish the optimal candidate, we evaluated the structural activity relationships (SAR) between the ligand-binding free energies and inhibitory activity values (IC50 values) for available quinoxalin-2-one derivatives. Consequently, a final model with a high SAR was identified. This model included a molecular interaction between the hydrophobic pocket behind the ATP binding site and the substitution region of the quinoxalin-2-one derivatives. These findings should prove useful in lead optimization of quinoxalin-2-one derivatives as PDGFb R inhibitors.  相似文献   

19.
The conversion of proteins between internal and cartesian coordinates is a limiting step in many pipelines, such as molecular dynamics simulations and machine learning models. This conversion is typically carried out by sequential or parallel applications of the Natural extension of Reference Frame (NeRF) algorithm. This work proposes a massively parallel NeRF implementation which, depending on the polymer length, achieves speedups between 400 and 1200× over the previous state-of-the-art. It accomplishes this by dividing the conversion into three main phases: parallel composition of the monomer backbone, assembly of backbone subunits, and parallel elongation of sidechains; and by batching these computations into a minimal number of efficient matrix operations. Special emphasis is placed on reusability and ease of use. We open source the code (available at https://github.com/EleutherAI/mp_nerf ) and provide a corresponding python package.  相似文献   

20.
Protein complex detection from protein–protein interaction (PPI) network has received a lot of focus in recent years. A number of methods identify protein complexes as dense sub-graphs using network information while several other methods detect protein complexes based on topological information. While the methods based on identifying dense sub-graphs are more effective in identifying protein complexes, not all protein complexes have high density. Moreover, existing methods focus more on static PPI networks and usually overlook the dynamic nature of protein complexes. Here, we propose a new method, Weighted Edge based Clustering (WEC), to identify protein complexes based on the weight of the edge between two interacting proteins, where the weight is defined by the edge clustering coefficient and the gene expression correlation between the interacting proteins. Our WEC method is capable of detecting highly inter-connected and co-expressed protein complexes. The experimental results of WEC on three real life data shows that our method can detect protein complexes effectively in comparison with other highly cited existing methods.Availability: The WEC tool is available at http://agnigarh.tezu.ernet.in/~rosy8/shared.html.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号