首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到9条相似文献,搜索用时 0 毫秒
1.
The fully polarizable, multipolar, and atomistic force field protein FFLUX is being built from machine learning (i.e., kriging) models, each of which predicts an atomic property. Each atom of a given protein geometry needs to be assigned such a kriging model. Such a knowledgeable atom needs to be informed about a sufficiently large environment around it. The resulting complexity can be tackled by collecting the 20 natural amino acids into a few groups. Using substituted deca‐alanines, we present the proof‐of‐concept that a given atom's charge can be modeled by a few kriging models only. © 2017 Wiley Periodicals, Inc.  相似文献   

2.
The Quantum Chemical Topological Force Field (QCTFF) uses the machine learning method kriging to map atomic multipole moments to the coordinates of all atoms in the molecular system. It is important that kriging operates on relevant and realistic training sets of molecular geometries. Therefore, we sampled single amino acid geometries directly from protein crystal structures stored in the Protein Databank (PDB). This sampling enhances the conformational realism (in terms of dihedral angles) of the training geometries. However, these geometries can be fraught with inaccurate bond lengths and valence angles due to artefacts of the refinement process of the X‐ray diffraction patterns, combined with experimentally invisible hydrogen atoms. This is why we developed a hybrid PDB/nonstationary normal modes (NM) sampling approach called PDB/NM. This method is superior over standard NM sampling, which captures only geometries optimized from the stationary points of single amino acids in the gas phase. Indeed, PDB/NM combines the sampling of relevant dihedral angles with chemically correct local geometries. Geometries sampled using PDB/NM were used to build kriging models for alanine and lysine, and their prediction accuracy was compared to models built from geometries sampled from three other sampling approaches. Bond length variation, as opposed to variation in dihedral angles, puts pressure on prediction accuracy, potentially lowering it. Hence, the larger coverage of dihedral angles of the PDB/NM method does not deteriorate the predictive accuracy of kriging models, compared to the NM sampling around local energetic minima used so far in the development of QCTFF. © 2015 The Authors. Journal of Computational Chemistry Published by Wiley Periodicals, Inc.  相似文献   

3.
Continuing the development of the FFLUX, a multipolar polarizable force field driven by machine learning, we present a modern approach to atom‐typing and building transferable models for predicting atomic properties in proteins. Amino acid atomic charges in a peptide chain respond to the substitution of a neighboring residue and this response can be categorized in a manner similar to atom‐typing. Using a machine learning method called kriging, we are able to build predictive models for an atom that is defined, not only by its local environment, but also by its neighboring residues, for a minimal additional computational cost. We found that prediction errors were up to 11 times lower when using a model specific to the correct group of neighboring residues, with a mean prediction of ∼0.0015 au. This finding suggests that atoms in a force field should be defined by more than just their immediate atomic neighbors. When comparing an atom in a single alanine to an analogous atom in a deca‐alanine helix, the mean difference in charge is 0.026 au. Meanwhile, the same difference between a trialanine and a deca‐alanine helix is only 0.012 au. When compared to deca‐alanine models, the transferable models are up to 20 times faster to train, and require significantly less ab initio calculation, providing a practical route to modeling large biological systems. © 2016 The Authors. Journal of Computational Chemistry Published by Wiley Periodicals, Inc.  相似文献   

4.
Machine learning algorithms have been demonstrated to predict atomistic properties approaching the accuracy of quantum chemical calculations at significantly less computational cost. Difficulties arise, however, when attempting to apply these techniques to large systems, or systems possessing excessive conformational freedom. In this article, the machine learning method kriging is applied to predict both the intra‐atomic and interatomic energies, as well as the electrostatic multipole moments, of the atoms of a water molecule at the center of a 10 water molecule (decamer) cluster. Unlike previous work, where the properties of small water clusters were predicted using a molecular local frame, and where training set inputs (features) were based on atomic index, a variety of feature definitions and coordinate frames are considered here to increase prediction accuracy. It is shown that, for a water molecule at the center of a decamer, no single method of defining features or coordinate schemes is optimal for every property. However, explicitly accounting for the structure of the first solvation shell in the definition of the features of the kriging training set, and centring the coordinate frame on the atom‐of‐interest will, in general, return better predictions than models that apply the standard methods of feature definition, or a molecular coordinate frame. © 2016 The Authors. Journal of Computational Chemistry Published by Wiley Periodicals, Inc.  相似文献   

5.
We propose a general coupling of the Smooth Particle Mesh Ewald SPME approach for distributed multipoles to a short‐range charge penetration correction modifying the charge‐charge, charge‐dipole and charge‐quadrupole energies. Such an approach significantly improves electrostatics when compared to ab initio values and has been calibrated on Symmetry‐Adapted Perturbation Theory reference data. Various neutral molecular dimers have been tested and results on the complexes of mono‐ and divalent cations with a water ligand are also provided. Transferability of the correction is adressed in the context of the implementation of the AMOEBA and SIBFA polarizable force fields in the TINKER‐HP software. As the choices of the multipolar distribution are discussed, conclusions are drawn for the future penetration‐corrected polarizable force fields highlighting the mandatory need of non‐spurious procedures for the obtention of well balanced and physically meaningful distributed moments. Finally, scalability and parallelism of the short‐range corrected SPME approach are addressed, demonstrating that the damping function is computationally affordable and accurate for molecular dynamics simulations of complex bio‐ or bioinorganic systems in periodic boundary conditions. © 2016 Wiley Periodicals, Inc.  相似文献   

6.
FFLUX is a novel force field based on quantum topological atoms, combining multipolar electrostatics with IQA intraatomic and interatomic energy terms. The program FEREBUS calculates the hyperparameters of models produced by the machine learning method kriging. Calculation of kriging hyperparameters ( θ and p ) requires the optimization of the concentrated log‐likelihood . FEREBUS uses Particle Swarm Optimization (PSO) and Differential Evolution (DE) algorithms to find the maximum of . PSO and DE are two heuristic algorithms that each use a set of particles or vectors to explore the space in which is defined, searching for the maximum. The log‐likelihood is a computationally expensive function, which needs to be calculated several times during each optimization iteration. The cost scales quickly with the problem dimension and speed becomes critical in model generation. We present the strategy used to parallelize FEREBUS, and the optimization of through PSO and DE. The code is parallelized in two ways. MPI parallelization distributes the particles or vectors among the different processes, whereas the OpenMP implementation takes care of the calculation of , which involves the calculation and inversion of a particular matrix, whose size increases quickly with the dimension of the problem. The run time shows a speed‐up of 61 times going from single core to 90 cores with a saving, in one case, of ~98% of the single core time. In fact, the parallelization scheme presented reduces computational time from 2871 s for a single core calculation, to 41 s for 90 cores calculation. © 2016 The Authors. Journal of Computational Chemistry Published by Wiley Periodicals, Inc.  相似文献   

7.
8.
9.
Accurate electrostatics necessitates the use of multipole moments centered on nuclei or extra point charges centered away from the nuclei. Here, we follow the former alternative and investigate the convergence behavior of atom‐atom electrostatic interactions in the pilot protein crambin. Amino acids are cut out from a Protein Data Bank structure of crambin, as single amino acids, di, or tripeptides, and are then capped with a peptide bond at each side. The atoms in the amino acids are defined through Quantum Chemical Topology (QCT) as finite volume electron density fragments. Atom‐atom electrostatic energies are computed by means of a multipole expansion with regular spherical harmonics, up to a total interaction rank of L = ?A+ ?B + 1 = 10. The minimum internuclear distance in the convergent region of all the 15 possible types of atom‐atom interactions in crambin that were calculated based on single amino acids are close to the values calculated from di and tripeptides. Values obtained at B3LYP/aug‐cc‐pVTZ and MP2/aug‐cc‐pVTZ levels are only slightly larger than those calculated at HF/6‐31G(d,p) level. This convergence behavior is transferable to the well‐known amyloid beta polypeptide Aβ1–42. Moreover, for a selected central atom, the influence of its neighbors on its multipole moments is investigated, and how far away this influence can be ignored is also determined. Finally, the convergence behavior of AMBER becomes closer to that of QCT with increasing internuclear distance. © 2013 Wiley Periodicals, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号