首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
In general, the docking scoring tends to have a size dependence related to the ranking of compounds. In this paper, we describe a novel method of parameter optimization for docking scores which reduce the size dependence and can efficiently discriminate active compounds from chemical databases. This method is based on a simplified theoretical model of docking scores which enables us to utilize large amounts of data of known active and inactive compounds for a particular target without requiring large computational resources or a complicated procedure. This method is useful for making scoring functions for the identification of novel scaffolds using the knowledge of active compounds for a particular target or a customized scoring function for an interesting family of drug targets.  相似文献   

2.
Virtual screening by molecular docking has become a widely used approach to lead discovery in the pharmaceutical industry when a high-resolution structure of the biological target of interest is available. The performance of three widely used docking programs (Glide, GOLD, and DOCK) for virtual database screening is studied when they are applied to the same protein target and ligand set. Comparisons of the docking programs and scoring functions using a large and diverse data set of pharmaceutically interesting targets and active compounds are carried out. We focus on the problem of docking and scoring flexible compounds which are sterically capable of docking into a rigid conformation of the receptor. The Glide XP methodology is shown to consistently yield enrichments superior to the two alternative methods, while GOLD outperforms DOCK on average. The study also shows that docking into multiple receptor structures can decrease the docking error in screening a diverse set of active compounds.  相似文献   

3.
4.
The low accuracy of predicted docking scores is critical at in silico drug screening. In order to improve the accuracy of docking scores, we approximated the protein-compound binding free energy as a linear combination of the raw docking scores of a target compound with many different protein pockets. The coefficients of the linear combination were estimated by the similarities among proteins, simply by using the amino-acid sequence similarities or identities of the proteins. This method was applied to in silico screening of the active compounds of five target proteins, and it increased the hit ratio by approximately four to five times compared to that given only by the raw docking scores in every case. The hit ratio also became robust against differences of target proteins.  相似文献   

5.
We developed a new method to improve the accuracy of molecular interaction data using a molecular interaction matrix. This method was applied to enhance the database enrichment of in silico drug screening and in silico target protein screening using a protein-compound affinity matrix calculated by a protein-compound docking software. Our assumption was that the protein-compound binding free energy of a compound could be improved by a linear combination of its docking scores with many different proteins. We proposed two approaches to determine the coefficients of the linear combination. The first approach is based on similarity among the proteins, and the second is a machine-learning approach based on the known active compounds. These methods were applied to in silico screening of the active compounds of several target proteins and in silico target protein screening.  相似文献   

6.
In silico target fishing, whose aim is to identify possible protein targets for a query molecule, is an emerging approach used in drug discovery due its wide variety of applications. This strategy allows the clarification of mechanism of action and biological activities of compounds whose target is still unknown. Moreover, target fishing can be employed for the identification of off targets of drug candidates, thus recognizing and preventing their possible adverse effects. For these reasons, target fishing has increasingly become a key approach for polypharmacology, drug repurposing, and the identification of new drug targets. While experimental target fishing can be lengthy and difficult to implement, due to the plethora of interactions that may occur for a single small-molecule with different protein targets, an in silico approach can be quicker, less expensive, more efficient for specific protein structures, and thus easier to employ. Moreover, the possibility to use it in combination with docking and virtual screening studies, as well as the increasing number of web-based tools that have been recently developed, make target fishing a more appealing method for drug discovery. It is especially worth underlining the increasing implementation of machine learning in this field, both as a main target fishing approach and as a further development of already applied strategies. This review reports on the main in silico target fishing strategies, belonging to both ligand-based and receptor-based approaches, developed and applied in the last years, with a particular attention to the different web tools freely accessible by the scientific community for performing target fishing studies.  相似文献   

7.
Reliable and effective virtual high-throughput screening (vHTS) methods are desperately needed to minimize the expenses involved in drug discovery projects. Here, we present an improvement to the negative image-based (NIB) screening: the shape, the electrostatics, and the solvation state of the target protein's ligand-binding site are included into the vHTS. Additionally, the initial vHTS results are postprocessed with molecular mechanics/generalized Born surface area (MMGBSA) calculations to estimate the favorability of ligand-protein interactions. The results show that docking produces very good early enrichment for phosphodiesterase-5 (PDE-5); however, in general, the NIB and the ligand-based screening performed better with or without the added electrostatics. Furthermore, the postprocessing of the NIB screening results using MMGBSA calculations improved the early enrichment for the PDE-5 considerably, thus, making hit discovery affordable.  相似文献   

8.
Products from combinatorial libraries generally share a common core structure that can be exploited to improve the efficiency of virtual high-throughput screening (vHTS). In general, it is more efficient to find a method that scales with the total number of reagents (Sigma growth) rather with the number of products (Pi growth). The OptiDock methodology described herein entails selecting a diverse but representative subset of compounds that span the structural space encompassed by the full library. These compounds are docked individually using the FlexX program (Rarey, M.; Kramer, B.; Lengauer, T.; Klebe, G. J. Mol. Biol. 1995, 251, 470-489) to define distinct docking modes in terms of reference placements for combinatorial core atoms. Thereafter, substituents in R-cores (consisting of the core structure substituted at a single variation site) are docked, keeping the core atoms fixed at the coordinates dictated by each reference placement. Interaction energies are calculated for each docked R-core with respect to the target protein, and energies for whole compounds are calculated by finding the reference core placement for which the sum of corresponding R-core energies is most negative. The use of diverse whole compounds to define binding modes is a key advantage of the protocol over other combinatorial docking programs. As a result, OptiDock returns better-scoring conformers than does serially applied FlexX. OptiDock is also better able to find a viable docked pose for each library member than are other combinatorial approaches.  相似文献   

9.
The three-dimensional (3D) structures of most protein targets have not been determined so far, with many of them not even having a known ligand, a truly general method to predict ligand-protein interactions in the absence of three-dimensional information would be of great potential value in drug discovery. Using the support vector machine (SVM) approach, we constructed a model for predicting ligand-protein interaction based only on the primary sequence of proteins and the structural features of small molecules. The model, trained by using 15,000 ligand-protein interactions between 626 proteins and over 10,000 active compounds, was successfully used in discovering nine novel active compounds for four pharmacologically important targets (i.e., GPR40, SIRT1, p38, and GSK-3β). To our knowledge, this is the first example of a successful sequence-based virtual screening campaign, demonstrating that our approach has the potential to discover, with a single model, active ligands for any protein.  相似文献   

10.
11.
A molecular docking method designated as ADDock, anchor- dependent molecular docking process for docking small flexible molecules into rigid protein receptors, is presented in this article. ADDock makes the bond connection lists for atoms based on anchors chosen for building molecular structures for docking small flexible molecules or ligands into rigid active sites of protein receptors. ADDock employs an extended version of piecewise linear potential for scoring the docked structures. Since no translational motion for small molecules is implemented during the docking process, ADDock searches the best docking result by systematically changing the anchors chosen, which are usually the single-edge connected nodes or terminal hydrogen atoms of ligands. ADDock takes intact ligand structures generated during the docking process for computing the docked scores; therefore, no energy minimization is required in the evaluation phase of docking. The docking accuracy by ADDock for 92 receptor-ligand complexes docked is 91.3%. All these complexes have been docked by other groups using other docking methods. The receptor-ligand steric interaction energies computed by ADDock for some sets of active and inactive compounds selected and docked into the same receptor active sites are apparently separated. These results show that based on the steric interaction energies computed between the docked structures and receptor active sites, ADDock is able to separate active from inactive compounds for both being docked into the same receptor.  相似文献   

12.
A new method has been developed to design a focused library based on available active compounds using protein-compound docking simulations. This method was applied to the design of a focused library for cytochrome P450 (CYP) ligands, not only to distinguish CYP ligands from other compounds but also to identify the putative ligands for a particular CYP. Principal component analysis (PCA) was applied to the protein-compound affinity matrix, which was obtained by thorough docking calculations between a large set of protein pockets and chemical compounds. Each compound was depicted as a point in the PCA space. Compounds that were close to the known active compounds were selected as candidate hit compounds. A machine-learning technique optimized the docking scores of the protein-compound affinity matrix to maximize the database enrichment of the known active compounds, providing an optimized focused library.  相似文献   

13.
Dipeptidyl peptidase-4 (DPP-4) inhibitors are becoming an essential drug in the treatment of type 2 diabetes mellitus; however, some classes of these drugs exert side effects, including joint pain and pancreatitis. Studies suggest that these side effects might be related to secondary inhibition of DPP-8 and DPP-9. In this study, we identified DPP-4-inhibitor hit compounds selective against DPP-8 and DPP-9. We built a virtual screening workflow using a quantitative structure–activity relationship (QSAR) strategy based on artificial intelligence to allow faster screening of millions of molecules for the DPP-4 target relative to other screening methods. Five regression machine learning algorithms and four classification machine learning algorithms were applied to build virtual screening workflows, with the QSAR model applied using support vector regression (R2pred 0.78) and the classification QSAR model using the random forest algorithm with 92.2% accuracy. Virtual screening results of > 10 million molecules obtained 2 716 hits compounds with a pIC50 value of > 7.5. Additionally, molecular docking results of several potential hit compounds for DPP-4, DPP-8, and DPP-9 identified CH0002 as showing high inhibitory potential against DPP-4 and low inhibitory potential for DPP-8 and DPP-9 enzymes. These results demonstrated the effectiveness of this technique for identifying DPP-4-inhibitor hit compounds selective for DPP-4 and against DPP-8 and DPP-9 and suggest its potential efficacy for applications to discover hit compounds of other targets.  相似文献   

14.
We developed a new structure-based in-silico screening method using a negative image of a ligand-binding pocket and a multi-protein–compound interaction matrix. Based on the structure of the ligand pocket of the target protein, we designed a negative image, which consists of virtual atoms whose radii are close to those of carbon atoms. The virtual atoms fit the pocket ideally and achieve an optimal Coulomb interaction. A protein–compound docking program calculates the protein–compound interaction matrix for many proteins and many compounds including the negative image, which can be treated as a virtual compound. With specific attention to a vector of docking scores for a single compound with many proteins, we selected a compound whose score vector was similar to that of the negative image as a candidate hit compound. This method was applied to representative target proteins and showed high database enrichment with a relatively quick procedure.  相似文献   

15.
16.
17.
The performance of the site-features docking algorithm LibDock has been evaluated across eight GlaxoSmithKline targets as a follow-up to a broad validation study of docking and scoring software (Warren, G. L.; Andrews, W. C.; Capelli, A.; Clarke, B.; Lalonde, J.; Lambert, M. H.; Lindvall, M.; Nevins, N.; Semus, S. F.; Senger, S.; Tedesco, G.; Walls, I. D.; Woolven, J. M.; Peishoff, C. E.; Head, M. S. J. Med. Chem. 2006, 49, 5912-5931). Docking experiments were performed to assess both the accuracy in reproducing the binding mode of the ligand and the retrieval of active compounds in a virtual screening protocol using both the DJD (Diller, D. J.; Merz, K. M., Jr. Proteins 2001, 43, 113-124) and LigScore2 (Krammer, A. K.; Kirchoff, P. D.; Jiang, X.; Venkatachalam, C. M.; Waldman, M. J. Mol. Graphics Modell. 2005, 23, 395-407) scoring functions. This study was conducted using DJD scoring, and poses were rescored using all available scoring functions in the Accelrys LigandFit module, including LigScore2. For six out of eight targets at least 30% of the ligands were docked within a root-mean-square difference (RMSD) of 2.0 A for the crystallographic poses when the LigScore2 scoring function was used. LibDock retrieved at least 20% of active compounds in the top 10% of screened ligands for four of the eight targets in the virtual screening protocol. In both studies the LigScore2 scoring function enhanced the retrieval of crystallographic poses or active compounds in comparison with the results obtained using the DJD scoring function. The results for LibDock accuracy and ligand retrieval in virtual screening are compared to 10 other docking and scoring programs. These studies demonstrate the utility of the LigScore2 scoring function and that LibDock as a feature directed docking method performs as well as docking programs that use genetic/growing and Monte Carlo driven algorithms.  相似文献   

18.
ABSTRACT

Existing data on structures and biological activities are limited and distributed unevenly across distinct molecular targets and chemical compounds. The question arises if these data represent an unbiased sample of the general population of chemical-biological interactions. To answer this question, we analyzed ChEMBL data for 87,583 molecules tested against 919 protein targets using supervised and unsupervised approaches. Hierarchical clustering of the Murcko frameworks generated using Chemistry Development Toolkit showed that the available data form a big diffuse cloud without apparent structure. In contrast hereto, PASS-based classifiers allowed prediction whether the compound had been tested against the particular molecular target, despite whether it was active or not. Thus, one may conclude that the selection of chemical compounds for testing against specific targets is biased, probably due to the influence of prior knowledge. We assessed the possibility to improve (Q)SAR predictions using this fact: PASS prediction of the interaction with the particular target for compounds predicted as tested against the target has significantly higher accuracy than for those predicted as untested (average ROC AUC are about 0.87 and 0.75, respectively). Thus, considering the existing bias in the data of the training set may increase the performance of virtual screening.  相似文献   

19.
We developed a new protocol for in silico drug screening for G-protein-coupled receptors (GPCRs) using a set of "universal active probes" (UAPs) with an ensemble docking procedure. UAPs are drug-like compounds, which are actual active compounds of a variety of known proteins. The current targets were nine human GPCRs whose three-dimensional (3D) structures are unknown, plus three GPCRs, namely β(2)-adrenergic receptor (ADRB2), A(2A) adenosine receptor (A(2A)), and dopamine D3 receptor (D(3)), whose 3D structures are known. Homology-based models of the GPCRs were constructed based on the crystal structures with careful sequence inspection. After subsequent molecular dynamics (MD) simulation taking into account the explicit lipid membrane molecules with periodic boundary conditions, we obtained multiple model structures of the GPCRs. For each target structure, docking-screening calculations were carried out via the ensemble docking procedure, using both true active compounds of the target proteins and the UAPs with the multiple target screening (MTS) method. Consequently, the multiple model structures showed various screening results with both poor and high hit ratios, the latter of which could be identified as promising for use in in silico screening to find candidate compounds to interact with the proteins. We found that the hit ratio of true active compounds showed a positive correlation to that of the UAPs. Thus, we could retrieve appropriate target structures from the GPCR models by applying the UAPs, even if no active compound is known for the GPCRs. Namely, the screening result that showed a high hit ratio for the UAPs could be used to identify actual hit compounds for the target GPCRs.  相似文献   

20.
Zika virus (ZIKV) infection has been associated with Guillain-Barre syndrome in adults and microcephaly in infants. The existence of insufficient structural data in most of the protein databases hinders the synthesis of anti-ZIKV pharmaceutics. In this work, we attempted to model the catalytic domain of the ZIKV RNA polymerase (RdRpC) along with a detailed assessment of conserved aspartates in ZIKV RdRpC palm domain as potential drug targets. The conserved and catalytically active aspartate residues present in the predicted RdRpC protein were virtually screened against a ZINC database for inhibitors, and the selected potential drug candidates were further filtered based on their ADMET profiles. One of the pharmacokinetically active compounds (Ligand 6) showed a remarkable docking profile against the strictly conserved aspartate residues of the RdRpC active site. We hypothesize that the Ligand 6 may form a potential drug candidate for RdRpC inhibition in the clinical treatment of ZIKV infection.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号