首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The Flexible Ligand Unified Force Field (FLUFF) is a molecular mechanistic superposition algorithm utilizing a template structure, on top of which the ligand(s) are superimposed. FLUFF enables a flexible semiautomatic superimposition in which the ligand and the template are allowed to seek the best common conformation, which can then be used to predict the biological activity by Boundless Adaptive Localized Ligand (BALL). In BALL, the similarity of the electrostatic and van der Waals volumes of the template and ligand is evaluated using the template-based coordinate system which makes the FLUFF-BALL invariant as to the rotations and translations of the global coordinate system. When tested using the CBG (corticosteroid binding globulin) affinities of 31 benchmark steroids, the FLUFF-BALL technique produced results comparable to standard 3D-QSAR methods. Supplementary test calculations were performed with five additional data sets. Due to its high level of automation and high throughput, the FLUFF-BALL is highly suitable for use in drug design and in scanning of large molecular libraries.  相似文献   

2.
This paper presents a theoretical model of how data fusion can be used to combine the results of multiple similarity searches of chemical databases. The model is based on frequency distributions of similarity values that are fused using a multiple integration over regions defined by the particular fusion rule that is being applied. For pairwise fusion, the resulting double integrals are straightforward to evaluate for simple model distributions. Similarity values for recovered-active and recovered-nonactive frequency distributions are independently modeled using a constant background, linearly biased terms, and a first-order correlated term. The model shows that two standard fusion rules can give performance enhancements in some cases but that the results of fusion are dependent on many factors that, taken together, can lead to seemingly inconsistent levels of enhancement.  相似文献   

3.
FieldScreen, a ligand-based Virtual Screening (VS) method, is described. Its use of 3D molecular fields makes it particularly suitable for scaffold hopping, and we have rigorously validated it for this purpose using a clustered version of the Directory of Useful Decoys (DUD). Using thirteen pharmaceutically relevant targets, we demonstrate that FieldScreen produces superior early chemotype enrichments, compared to DOCK. Additionally, hits retrieved by FieldScreen are consistently lower in molecular weight than those retrieved by docking. Where no X-ray protein structures are available, FieldScreen searches are more robust than docking into homology models or apo structures.  相似文献   

4.
In recent years, many virtual screening (VS) tools have been developed that employ different molecular representations and have different speed and accuracy characteristics. In this paper, we compare ten popular ligand-based VS tools using the publicly available Directory of Useful Decoys (DUD) data set comprising over 100?000 compounds distributed across 40 protein targets. The DUD was developed initially to evaluate docking algorithms, but our results from an operational correlation analysis show that it is also well suited for comparing ligand-based VS tools. Although it is conventional wisdom that 3D molecular shape is an important determinant of biological activity, our results based on permutational significance tests of several commonly used VS metrics show that the 2D fingerprint-based methods generally give better VS performance than the 3D shape-based approaches for surprisingly many of the DUD targets. To help understand this finding, we have analyzed the nature of the scoring functions used and the composition of the DUD data set itself. We propose that to improve the VS performance of current 3D methods, it will be necessary to devise screening queries that can represent multiple possible conformations and which can exploit knowledge of known actives that span multiple scaffold families.  相似文献   

5.
In a recent companion paper we have related the operation of simple data fusion rules used in virtual screening to a multiple integral formalism. In this paper we extend these ideas to the analysis of data fusion methods applied to real data. We examine several cases of similarity fusion using different coefficients and different representations and consider the reasons for positive or negative results in terms of the similarity distributions. Results are obtained using the SUM-, MAX- MIN-, and CombMNZ-fusion rules. We also develop a customized fusion rule, which provides an estimate of the optimal possible result for fusing multiple searches of a specific database; this shows that similarity fusion can, in principle, achieve retrieval enhancements even if this is not achieved in practice with current fusion rules. The methods are extended to analyze the comparatively successful results of group fusion with multiple actives, and we provide a rationale for the observed superiority of the MAX-rule over the SUM-rule in this context.  相似文献   

6.
7.
Despite the growing importance of qualitative screening tests in routine laboratories involved in the EU official control, their validation is not as deeply explained in Commission Decision 2002/657/EC as the validation of quantitative confirmatory methods. At the same time, the issue of quality assurance of screening assays defining internal quality control (IQC) procedures as required by accreditation bodies is undoubtedly less developed in this analytical field. As an example the present study describes the development, the validation and the IQC implemented for a commercial enzyme linked immunosorbent assay (ELISA) able to detect 17-α-19-nortestosterone (α-NT) and 17-β-19-nortestosterone (β-NT) isomers in bullock urine. In order to select a suitable sample treatment, two SPE purification protocols were preliminary compared. The chosen method was therefore fully validated determining the mandatory parameters required by Commission Decision 2002/657/EC: specificity, detection capability and robustness. An in-depth discussion was carried out illustrating the possible validation approaches and their implications especially in the assessment of the key performance characteristic: detection capability. Finally, the control charts implemented for continuous method verification during analyses of real samples were reported.  相似文献   

8.
The goal of this paper is to present and describe a novel 2D- and 3D-QSAR (quantitative structure-activity relationship) binary classification data set for the inhibition of c-Jun N-terminal kinase-3 with previously unpublished activities for a diverse set of compounds. JNK3 is an important pharmaceutical target because it is involved in many neurological disorders. Accordingly, the development of JNK3 inhibitors has gained increasing interest. 2D and 3D versions of the data set were used, consisting of 313 (70 actives) and 249 (60 actives) compounds, respectively. All compounds, for which activity was only determined for the racemate, were removed from the 3D data set. We investigated the diversity of the data sets by an agglomerative clustering with feature trees and show that the data set contains several different scaffolds. Furthermore, we show that the benchmarks can be tackled with standard supervised learning algorithms with a convincing performance. For the 2D problem, a random decision forest classifier achieves a Matthew's correlation coefficient of 0.744, the 3D problem could be modeled with a Matthew's correlation coefficient of 0.524 with 3D pharmacophores and a support vector machine. The performance of both data sets was evaluated within a nested 10-fold cross-validation. We therefore suggest that the data set is a reasonable basis for generating QSAR models for JNK3 because of its diverse composition and the performance of the classifiers presented in this study.  相似文献   

9.
A novel software (VSDMIP) for the virtual screening (VS) of chemical libraries integrated within a MySQL relational database is presented. Two main features make VSDMIP clearly distinguishable from other existing computational tools: (i) its database, which stores not only ligand information but also the results from every step in the VS process, and (ii) its modular and pluggable architecture, which allows customization of the VS stages (such as the programs used for conformer generation or docking), through the definition of a detailed workflow employing user-configurable XML files. VSDMIP, therefore, facilitates the storage and retrieval of VS results, easily adapts to the specific requirements of each method and tool used in the experiments, and allows the comparison of different VS methodologies. To validate the usefulness of VSDMIP as an automated tool for carrying out VS several experiments were run on six protein targets (acetylcholinesterase, cyclin-dependent kinase 2, coagulation factor Xa, estrogen receptor alpha, p38 MAP kinase, and neuraminidase) using nine binary (actives/inactive) test sets. The performance of several VS configurations was evaluated by means of enrichment factors and receiver operating characteristic plots. ángel R. Ortiz deceased on May 5, 2008.  相似文献   

10.
Consideration of stereochemistry early in the identification and optimization of lead compounds can improve the efficiency and efficacy of the drug discovery process and reduce the time spent on subsequent drug development. These improvements can result by focusing on specific enantiomers that have the desired potential therapeutic effect (eutomers), while removing from consideration enantiomers that may have no, or even undesirable, effects (distomers). A virtual screening campaign that correctly takes stereochemical information into account can, in theory, be utilized to provide information about the relative binding affinities of enantiomers. Thus, the proper enumeration of the relevant stereoisomers in general, and enantiomeric pairs in particular, of chiral compounds is crucial if one is to use virtual screening as an effective drug discovery tool. As is obvious, in cases where no stereochemical information is provided for chiral compounds in a 2D chemical database, then each possible stereoisomer should be generated for construction of the subsequent 3D database to be used for virtual screening. However, acute problems can arise in 3D database construction when relative stereochemistry is encoded in a 2D database for a chiral compound containing multiple stereogenic atoms but absolute stereochemistry is not implied. In this case, we report that generation of enantiomeric pairs is imperative in database development if one is to obtain accurate docking results. A study is described on the impact of the neglect of enantiomeric pairs on virtual screening using the human homolog of murine double minute 2 (MDM2) protein, the product of a proto-oncogene, as the target. Docking in MDM2 with GLIDE 4.0 was performed using the NCI Diversity Set 3D database and, for comparison, a set of enantiomers we created corresponding to mirror image structures of the single enantiomers of chiral compounds present in the NCI Diversity Set. Our results demonstrate that potential lead candidates may be overlooked when databases contain 3D structures representing only a single enantiomer of racemic chiral compounds.  相似文献   

11.
Receptor flexibility is a critical issue in structure-based virtual screening methods. Although a multiple-receptor conformation docking is an efficient way to account for receptor flexibility, it is still too slow for large molecular libraries. It was reported that a fast ligand-centric, shape-based virtual screening was more consistent for hit enrichment than a typical single-receptor conformation docking. Thus, we designed a "distributed docking" method that improves virtual high throughput screening by combining a shape-matching method with a multiple-receptor conformation docking. Database compounds are classified in advance based on shape similarities to one of the crystal ligands complexed with the target protein. This classification enables us to pick the appropriate receptor conformation for a single-receptor conformation docking of a given compound, thereby avoiding time-consuming multiple docking. In particular, this approach utilizes cross-docking scores of known ligands to all available receptor structures in order to optimize the algorithm. The present virtual screening method was tested for reidentification of known PPARgamma and p38 MAP kinase active compounds. We demonstrate that this method improves the enrichment while maintaining the computation speed of a typical single-receptor conformation docking.  相似文献   

12.
13.
The performance of a classification model is often assessed in terms of how well it separates a set of known observations into appropriate classes. If the validation sets used for such analyses are redundant due to bias in sampling, the relevance of the conclusions drawn to prospective work in which new kinds of positives are sought may be compromised. In the case of the various virtual screening techniques used in modern drug discovery, such bias generally appears as over‐representation of particular structural subclasses in the test set. We show how clustering by substructural similarity, followed by applying arithmetic and harmonic weighting schemes to receiver operating characteristic (ROC) curves, can be used to identify validation sets that are biased due to such redundancies. This can be accomplished qualitatively by direct examination or quantitatively by comparing the areas under the respective linear or semilog curves (AUCs or pAUCs). Copyright © 2009 John Wiley & Sons, Ltd.  相似文献   

14.
Flexible ligand docking is a routine part of a modern structure-based lead discovery process. As of today, there are quite a number of commercial docking programs that can be used to screen large databases (hundreds of thousands to millions of compounds). However, limiting factors such as the number of commercial software licenses needed to perform docking simultaneously on multiple processors ("software cost") and the relatively long time required per molecule to get good results ("quality-to-speed") should be taken into account when planning a large docking run. How can we optimize the efficiency of selecting lead candidates by docking, in respect to the quality of the results, search speed, and software cost? We present a combination of two methods, our "fast-free-approximate" in-house docking program and the "slow-costly-accurate" ICM-Dock, as an example of one solution to the problem. Our proposed protocol is illustrated by a series of virtual screening experiments aimed at identifying active compounds in the MDL Drug Data Report database. In more than half of the 20 cases examined, at least several actives per protein target were identified in approximately 24 hours per target.  相似文献   

15.
Similarity searching using a single bioactive reference structure is a well-established technique for accessing chemical structure databases. This paper describes two extensions of the basic approach. First, we discuss the use of group fusion to combine the results of similarity searches when multiple reference structures are available. We demonstrate that this technique is notably more effective than conventional similarity searching in scaffold-hopping searches for structurally diverse sets of active molecules; conversely, the technique will do little to improve the search performance if the actives are structurally homogeneous. Second, we make the assumption that the nearest neighbors resulting from a similarity search, using a single bioactive reference structure, are also active and use this assumption to implement approximate forms of group fusion, substructural analysis, and binary kernel discrimination. This approach, called turbo similarity searching, is notably more effective than conventional similarity searching.  相似文献   

16.
17.
The use of depleted uranium in military ordinance has led to an increasing need to determine isotope-specific uranium concentrations in environmental matrices. To this end, gamma-ray spectrometry, ICP-MS and INAA methods have been validated, in accordance with the ISO 17025 standard. Reporting limits of 0.21 (U-235) and 0.91 (U-238) ng/L were obtained by ICP-MS analysis of water. Higher reporting limits were obtained for INAA (U-238 only) validations of water and gamma-ray spectrometric validations of soil and water. Accredited methods have been used to determine uranium concentration and isotope ratio of samples obtained from the Defence Research and Development Canada Valcartier, Quebec.  相似文献   

18.
With the emergence of combinatorial chemistry, whether based on parallel, mixture, solution, or solid phase chemistry, it is now possible to generate large numbers of diverse or focused compound libraries. In this paper we aim to demonstrate that it is possible to design targeted libraries by applying nonparametric statistical methods, recursive partitioning in particular, to large data sets containing thousands of compounds and their associated biological data. Moreover, when applied to an experimental high-throughput screening (HTS) data set, our data strongly suggest that this method can improve the hit rate of our primary screens (about 4- to 5-fold) while increasing screening efficiency: less than one-fifth of the complete selection needs to be screened in order to identify about 75% of all actives present.  相似文献   

19.
20.
Current systems for similarity-based virtual screening use similarity measures in which all the fragments in a fingerprint contribute equally to the calculation of structural similarity. This paper discusses the weighting of fragments on the basis of their frequencies of occurrence in molecules. Extensive experiments with sets of active molecules from the MDL Drug Data Report and the World of Molecular Bioactivity databases, using fingerprints encoding Tripos holograms, Pipeline Pilot ECFC_4 circular substructures and Sunset Molecular keys, demonstrate clearly that frequency-based screening is generally more effective than conventional, unweighted screening. The results suggest that standardising the raw occurrence frequencies by taking the square root of the frequencies will maximise the effectiveness of virtual screening. An upper-bound analysis shows the complex interactions that can take place between representations, weighting schemes and similarity coefficients when similarity measures are computed, and provides a rationalisation of the relative performance of the various weighting schemes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号