期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Fighting high molecular weight in bioactive molecules with sub-pharmacophore-based virtual screening

Korff Mv Freyss J Sander T Boss C Ciana CL 《Journal of chemical information and modeling》2012,52(2):380-390

相似文献

2.

The reduced graph descriptor in virtual screening and data-driven clustering of high-throughput screening data 总被引：3，自引：0，他引：3

Harper G Bravi GS Pickett SD Hussain J Green DV 《Journal of chemical information and computer sciences》2004,44(6):2145-2156

相似文献

3.

Electronic van der Waals surface property descriptors and genetic algorithms for developing structure-activity correlations in olfactory databases

Lavine BK Davidson CE Breneman C Katt W Sundling CM 《Journal of chemical information and computer sciences》2003,43(6):1890-1905

相似文献

4.

Efficient substructure searching of large chemical libraries: the ABCD chemical cartridge

Agrafiotis DK Lobanov VS Shemanarev M Rassokhin DN Izrailev S Jaeger EP Alex S Farnum M 《Journal of chemical information and modeling》2011,51(12):3113-3130

Efficient substructure searching is a key requirement for any chemical information management system. In this paper, we describe the substructure search capabilities of ABCD, an integrated drug discovery informatics platform developed at Johnson & Johnson Pharmaceutical Research & Development, L.L.C. The solution consists of several algorithmic components: 1) a pattern mapping algorithm for solving the subgraph isomorphism problem, 2) an indexing scheme that enables very fast substructure searches on large structure files, 3) the incorporation of that indexing scheme into an Oracle cartridge to enable querying large relational databases through SQL, and 4) a cost estimation scheme that allows the Oracle cost-based optimizer to generate a good execution plan when a substructure search is combined with additional constraints in a single SQL query. The algorithm was tested on a public database comprising nearly 1 million molecules using 4,629 substructure queries, the vast majority of which were submitted by discovery scientists over the last 2.5 years of user acceptance testing of ABCD. 80.7% of these queries were completed in less than a second and 96.8% in less than ten seconds on a single CPU, while on eight processing cores these numbers increased to 93.2% and 99.7%, respectively. The slower queries involved extremely generic patterns that returned the entire database as screening hits and required extensive atom-by-atom verification. 相似文献

5.

Representing clusters using a maximum common edge substructure algorithm applied to reduced graphs and molecular graphs

Gardiner EJ Gillet VJ Willett P Cosgrove DA 《Journal of chemical information and modeling》2007,47(2):354-366

相似文献

6.

Schuffenhauer A Floersheim P Acklin P Jacoby E 《Journal of chemical information and computer sciences》2003,43(2):391-405

In this study we evaluate how far the scope of similarity searching can be extended to identify not only ligands binding to the same target as the reference ligand(s) but also ligands of other homologous targets without initially known ligands. This "homology-based similarity searching" requires molecular representations reflecting the ability of a molecule to interact with target proteins. The Similog keys, which are introduced here as a new molecular representation, were designed to fulfill such requirements. They are based only on the molecular constitution and are counts of atom triplets. Each triplet is characterized by the graph distances and the types of its atoms. The atom-typing scheme classifies each atom by its function as H-bond donor or acceptor and by its electronegativity and bulkiness. In this study the Similog keys are investigated in retrospective in silico screening experiments and compared with other conformation independent molecular representations. Studied were molecules of the MDDR database for which the activity data was augmented by standardized target classification information from public protein classification databases. The MDDR molecule set was split randomly into two halves. The first half formed the candidate set. Ligands of four targets (dopamine D2 receptor, opioid delta-receptor, factor Xa serine protease, and progesterone receptor) were taken from the second half to form the respective reference sets. Different similarity calculation methods are used to rank the molecules of the candidate set by their similarity to each of the four reference sets. The accumulated counts of molecules binding to the reference target and groups of targets with decreasing homology to it were examined as a function of the similarity rank for each reference set and similarity method. In summary, similarity searching based on Unity 2D-fingerprints or Similog keys are found to be equally effective in the identification of molecules binding to the same target as the reference set. However, the application of the Similog keys is more effective in comparison with the other investigated methods in the identification of ligands binding to any target belonging to the same family as the reference target. We attribute this superiority to the fact that the Similog keys provide a generalization of the chemical elements and that the keys are counted instead of merely noting their presence or absence in a binary form. The second most effective molecular representation are the occurrence counts of the public ISIS key fragments, which like the Similog method, incorporates key counting as well as a generalization of the chemical elements. The results obtained suggest that ligands for a new target can be identified by the following three-step procedure: 1. Select at least one target with known ligands which is homologous to the new target. 2. Combine the known ligands of the selected target(s) to a reference set. 3. Search candidate ligands for the new targets by their similarity to the reference set using the Similog method. This clearly enlarges the scope of similarity searching from the classical application for a single target to the identification of candidate ligands for whole target families and is expected to be of key utility for further systematic chemogenomics exploration of previously well explored target families. 相似文献

7.

Relationships between Molecular Complexity, Biological Activity, and Structural Diversity

Schuffenhauer A Brown N Selzer P Ertl P Jacoby E 《Journal of chemical information and modeling》2006,46(2):525-535

相似文献

8.

Evaluation of descriptors and mini-fingerprints for the identification of molecules with similar activity

Xue L Godden JW Bajorath J 《Journal of chemical information and computer sciences》2000,40(5):1227-1234

相似文献

9.

Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks

Maged Nasser Naomie Salim Hentabli Hamza Faisal Saeed Idris Rabiu 《Molecules (Basel, Switzerland)》2021,26(1)

相似文献

10.

Text Influenced Molecular Indexing (TIMI): a literature database mining approach that handles text and chemistry

Singh SB Hull RD Fluder EM 《Journal of chemical information and computer sciences》2003,43(3):743-752

相似文献

11.

Mini-fingerprints for virtual screening: Design principles and generation of novel prototypes based on information theory

L. Xue J.W. Godden J. Bajorath 《SAR and QSAR in environmental research》2013,24(1):27-40

相似文献

12.

Profile scaling increases the similarity search performance of molecular fingerprints containing numerical descriptors and structural keys

Xue L Godden JW Stahura FL Bajorath J 《Journal of chemical information and computer sciences》2003,43(4):1218-1225

相似文献

13.

Mini-fingerprints for virtual screening: design principles and generation of novel prototypes based on information theory

Xue L Godden JW Bajorath J 《SAR and QSAR in environmental research》2003,14(1):27-40

相似文献

14.

Efficient generation,storage, and manipulation of fully flexible pharmacophore multiplets and their use in 3-D similarity searching

Abrahamian E Fox PC Naerum L Christensen IT Thøgersen H Clark RD 《Journal of chemical information and computer sciences》2003,43(2):458-468

Pharmacophore triplets and quartets have been used by many groups in recent years, primarily as a tool for molecular diversity analysis. In most cases, slow processing speeds and the very large size of the bitsets generated have forced researchers to compromise in terms of how such multiplets were stored, manipulated, and compared, e.g., by using simple unions to represent multiplets for sets of molecules. Here we report using bitmaps in place of bitsets to reduce storage demands and to improve processing speed. Here, a bitset is taken to mean a fully enumerated string of zeros and ones, from which a compressed bitmap is obtained by replacing uniform blocks ("runs") of digits in the bitset with a pair of values identifying the content and length of the block (run-length encoding compression). High-resolution multiplets involving four features are enabled by using 64 bit executables to create and manipulate bitmaps, which "connect" to the 32 bit executables used for database access and feature identification via an extensible mark-up language (XML) data stream. The encoding system used supports simple pairs, triplets, and quartets; multiplets in which a privileged substructure is used as an anchor point; and augmented multiplets in which an additional vertex is added to represent a contingent feature such as a hydrogen bond extension point linked to a complementary feature (e.g., a donor or an acceptor atom) in a base pair or triplet. It can readily be extended to larger, more complex multiplets as well. Database searching is one particular potential application for this technology. Consensus bitmaps built up from active ligands identified in preliminary screening can be used to generate hypothesis bitmaps, a process which includes allowance for differential weighting to allow greater emphasis to be placed on bits arising from multiplets expected to be particularly discriminating. Such hypothesis bitmaps are shown to be useful queries for database searching, successfully retrieving active compounds across a range of structural classes from a corporate database. The current implementation allows multiconformer bitmaps to be obtained from pregenerated conformations or by random perturbation on-the-fly. The latter application involves random sampling of the full range of conformations not precluded by steric clashes, which limits the usefulness of classical fingerprint similarity measures. A new measure of similarity, The Stochastic Cosine, is introduced here to address this need. This new similarity measure uses the average number of bits common to independently drawn conformer sets to normalize the cosine coefficient. Its use frees the user from having to ensure strict comparability of starting conformations and having to use fixed torsional increments, thereby allowing fully flexible characterization of pharmacophoric patterns. 相似文献

15.

High-throughput structure-based pharmacophore modelling as a basis for successful parallel virtual screening

Steindl TM Schuster D Wolber G Laggner C Langer T 《Journal of computer-aided molecular design》2006,20(12):703-715

相似文献

16.

Design and evaluation of a molecular fingerprint involving the transformation of property descriptor values into a binary classification scheme

Xue L Godden JW Stahura FL Bajorath J 《Journal of chemical information and computer sciences》2003,43(4):1151-1157

相似文献

17.

Chemical database mining through entropy-based molecular similarity assessment of randomly generated structural fragment populations

Batista J Bajorath J 《Journal of chemical information and modeling》2007,47(1):59-68

相似文献