首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Total order ranking (TOR) strategies, which are mathematically based on elementary methods of discrete mathematics, seem to be attractive and simple tools for performing data analysis. Moreover order-ranking strategies seem to be a very useful tool not only to perform data exploration but also to develop order ranking models, a possible alternative to conventional quantitative structure–activity relationship (QSAR) methods. In fact, when data material is characterised by uncertainties, order methods can be used as alternative to statistical methods such as multilinear regression (MLR), because they do not require specific functional relationships between the independent and dependent variables (responses). A ranking model is a relationship between a set of dependent attributes, experimentally investigated, and a set of independent attributes, i.e. model attributes, which are calculated attributes. As in regression and classification models, the variable selection model is one of the main steps in finding predictive models. In this work the genetic algorithm–variable subset selection (GA–VSS) approach is proposed as the variable selection method for searching for the best ranking models within a wide set of variables. The models based on the selected subsets of variables are compared with the experimental ranking and evaluated by the Spearmans rank index. A case study application is presented on a TOR model developed for polychlorinated biphenyl (PCB) compounds, which have been analysed according to some of their physicochemical properties which play an important role in their environmental impact.  相似文献   

2.
We developed a new protocol for in silico drug screening for G-protein-coupled receptors (GPCRs) using a set of "universal active probes" (UAPs) with an ensemble docking procedure. UAPs are drug-like compounds, which are actual active compounds of a variety of known proteins. The current targets were nine human GPCRs whose three-dimensional (3D) structures are unknown, plus three GPCRs, namely β(2)-adrenergic receptor (ADRB2), A(2A) adenosine receptor (A(2A)), and dopamine D3 receptor (D(3)), whose 3D structures are known. Homology-based models of the GPCRs were constructed based on the crystal structures with careful sequence inspection. After subsequent molecular dynamics (MD) simulation taking into account the explicit lipid membrane molecules with periodic boundary conditions, we obtained multiple model structures of the GPCRs. For each target structure, docking-screening calculations were carried out via the ensemble docking procedure, using both true active compounds of the target proteins and the UAPs with the multiple target screening (MTS) method. Consequently, the multiple model structures showed various screening results with both poor and high hit ratios, the latter of which could be identified as promising for use in in silico screening to find candidate compounds to interact with the proteins. We found that the hit ratio of true active compounds showed a positive correlation to that of the UAPs. Thus, we could retrieve appropriate target structures from the GPCR models by applying the UAPs, even if no active compound is known for the GPCRs. Namely, the screening result that showed a high hit ratio for the UAPs could be used to identify actual hit compounds for the target GPCRs.  相似文献   

3.
Increasing evidence that many pharmaceutically relevant compounds elicit their effects through binding to multiple targets, so-called polypharmacology, is beginning to change conventional drug discovery and design strategies. In light of this paradigm shift, we have mined publicly available compound and bioactivity data for promiscuous chemotypes. For this purpose, a hierarchy of active compounds, atomic property based scaffolds, and unique molecular topologies were generated, and activity annotations were analyzed using this framework. Starting from ~35?000 compounds active against human targets with at least 1 μM potency, 33 chemotypes with distinct topology were identified that represented molecules active against at least 3 different target families. Network representations were utilized to study scaffold-target family relationships and activity profiles of scaffolds corresponding to promiscuous chemotypes. A subset of promiscuous chemotypes displayed a significant enrichment in drugs over bioactive compounds. A total of 190 drugs were identified that had on average only 2 known target annotations but belonged to the 7 most promiscuous chemotypes that were active against 8-15 target families. These drugs should be attractive candidates for polypharmacological profiling.  相似文献   

4.
Virtual database screening allows for millions of chemical compounds to be computationally selected based on structural complimentary to known inhibitors or to a target binding site on a biological macromolecule. Compound selection in virtual database screening when targeting a biological macromolecule is typically based on the interaction energy between the chemical compound and the target macromolecule. In the present study it is shown that this approach is biased toward the selection of high molecular weight compounds due to the contribution of the compound size to the energy score. To account for molecular weight during energy based screening, we propose normalization strategies based on the total number of heavy atoms in the chemical compounds being screened. This approach is computationally efficient and produces molecular weight distributions of selected compounds that can be selected to be (1) lower than that of the original database used in the virtual screening, which may be desirable for selection of leadlike compounds or (2) similar to that of the original database, which may be desirable for the selection of drug-like compounds. By eliminating the bias in target-based database screening toward higher molecular weight compounds it is anticipated that the proposed procedure will enhance the success rate of computer-aided drug design.  相似文献   

5.
6.
We describe a novel method for ligand-based virtual screening, based on utilizing Self-Organizing Maps (SOM) as a novelty detection device. Novelty detection (or one-class classification) refers to the attempt of identifying patterns that do not belong to the space covered by a given data set. In ligand-based virtual screening, chemical structures perceived as novel lie outside the known activity space and can therefore be discarded from further investigation. In this context, the concept of "novel structure" refers to a compound, which is unlikely to share the activity of the query structures. Compounds not perceived as "novel" are suspected to share the activity of the query structures. Nowadays, various databases contain active structures but access to compounds which have been found to be inactive in a biological assay is limited. This work addresses this problem via novelty detection, which does not require proven inactive compounds. The structures are described by spatial autocorrelation functions weighted by atomic physicochemical properties. Different methods for selecting a subset of targets from a larger set are discussed. A comparison with similarity search based on Daylight fingerprints followed by data fusion is presented. The two methods complement each other to a large extent. In a retrospective screening of the WOMBAT database novelty detection with SOM gave enrichment factors between 105 and 462-an improvement over the similarity search based on Daylight fingerprints between 25% and 100%, when the 100 top ranked structures were considered. Novelty detection with SOM is applicable (1) to improve the retrieval of potentially active compounds also in concert with other virtual screening methods; (2) as a library design tool for discarding a large number of compounds, which are unlikely to possess a given biological activity; and (3) for selecting a small number of potentially active compounds from a large data set.  相似文献   

7.
There is growing interest in computational chemogenomics, which aims to identify all possible ligands of all target families using in silico prediction models. In particular, kernel methods provide a means of integrating compounds and proteins in a principled manner and enable the exploration of ligand-target binding on a genomic scale. To better understand the link between ligands and targets, it is of fundamental interest to identify molecular interaction features that contribute to prediction of ligand-target binding. To this end, we describe a feature selection approach based on kernel dimensionality reduction (KDR) that works in a ligand-target space defined by kernels. We further propose an efficient algorithm to overcome a computational bottleneck and thereby provide a useful general approach to feature selection for chemogenomics. Our experiment on cytochrome P450 (CYP) enzymes has shown that the algorithm is capable of identifying predictive features, as well as prioritizing features that are indicative of ligand preference for a given target family. We further illustrate its applicability on the mutation data of HIV protease by identifying influential mutated positions within protease variants. These results suggest that our approach has the potential to uncover the molecular basis for ligand selectivity and off-target effects.  相似文献   

8.
We developed a new method to improve the accuracy of molecular interaction data using a molecular interaction matrix. This method was applied to enhance the database enrichment of in silico drug screening and in silico target protein screening using a protein-compound affinity matrix calculated by a protein-compound docking software. Our assumption was that the protein-compound binding free energy of a compound could be improved by a linear combination of its docking scores with many different proteins. We proposed two approaches to determine the coefficients of the linear combination. The first approach is based on similarity among the proteins, and the second is a machine-learning approach based on the known active compounds. These methods were applied to in silico screening of the active compounds of several target proteins and in silico target protein screening.  相似文献   

9.
10.
In the absence of X‐ray data, the exploration of compound binding modes continues to be a challenging task. For structure‐based design, specific features of active sites in different targets play a major role in rationalizing ligand binding characteristics. For example, dibasic compounds have been reported as potent inhibitors of various trypsin‐like serine proteases, the active sites of which contain several binding pockets that can be targeted by cationic moieties. This results in several possible orientations within the active site, complicating the binding mode prediction of such compounds by docking tools. Therefore, we introduced symmetry in bi‐ and tribasic compounds to reduce conformational space in docking calculations and to simplify binding mode selection by limiting the number of possible pocket occupations. Asymmetric bisbenzamidines were used as starting points for a multistage and structure‐guided optimization. A series of 24 final compounds with either two or three benzamidine substructures was ultimately synthesized and evaluated as inhibitors of five serine proteases, leading to potent symmetric inhibitors for the pharmaceutical drug targets matriptase, matriptase‐2, thrombin and factor Xa. This study underlines the relevance of ligand symmetry for chemical biology.  相似文献   

11.
Genetic algorithms have properties which make them attractive in de novo drug design. Like other de novo design programs, genetic algorithms require a method to reduce the enormous search space of possible compounds. Most often this is done using information from known ligands. We have developed the ADAPT program, a genetic algorithm which uses molecular interactions evaluated with docking calculations as a fitness function to reduce the search space. ADAPT does not require information about known ligands. The program takes an initial set of compounds and iteratively builds new compounds based on the fitness scores of the previous set of compounds. We describe the particulars of the ADAPT algorithm and its application to three well-studied target systems. We also show that the strategies of enhanced local sampling and re-introducing diversity to the compound population during the design cycle provide better results than conventional genetic algorithm protocols.  相似文献   

12.
13.
Increasingly, chemical libraries are being produced which are focused on a biological target or group of related targets, rather than simply being constructed in a combinatorial fashion. A screening collection compiled from such libraries will contain multiple analogues of a number of discrete series of compounds. The question arises as to how many analogues are necessary to represent each series in order to ensure that an active series will be identified. Based on a simple probabilistic argument and supported by in-house screening data, guidelines are given for the number of compounds necessary to achieve a "hit", or series of hits, at various levels of certainty. Obtaining more than one hit from the same series is useful since this gives early acquisition of SAR (structure-activity relationship) and confirms a hit is not a singleton. We show that screening collections composed of only small numbers of analogues of each series are sub-optimal for SAR acquisition. Based on these studies, we recommend a minimum series size of about 200 compounds. This gives a high probability of confirmatory SAR (i.e. at least two hits from the same series). More substantial early SAR (at least 5 hits from the same series) can be gained by using series of about 650 compounds each. With this level of information being generated, more accurate assessment of the likely success of the series in hit-to-lead and later stage development becomes possible.  相似文献   

14.
The SARS coronavirus 3C-like proteinase is considered as a potential drug design target for the treatment of severe acute respiratory syndrome (SARS). Owing to the lack of available drugs for the treatment of SARS, the discovery of inhibitors for SARS coronavirus 3C-like proteinase that can potentially be optimized as drugs appears to be highly desirable. We have built a "flexible" three-dimensional model for SARS 3C-like proteinase by homology modeling and multicanonical molecular dynamics method and used the model for virtual screening of chemical databases. After Dock procedures, strategies including pharmocophore model, consensus scoring, and "drug-like" filters were applied in order to accelerate the process and improve the success rate of virtual docking screening hit lists. Forty compounds were purchased and tested by HPLC and colorimetric assay against SARS 3C-like proteinase. Three of them including calmidazolium, a well-known antagonist of calmodulin, were found to inhibit the enzyme with an apparent K(i) from 61 to 178 microM. These active compounds and their binding modes provide useful information for understanding the binding sites and for further selective drug design against SARS and other coronavirus.  相似文献   

15.
A series of triarylamines based on dehydroabietic acid methyl ester moieties (6ah) were synthesized for possible application as hole transporting materials for organic electroluminescent devices. The target compounds were characterized by elemental analysis, FT-IR, NMR, and mass spectrometry. Their optical, electrochemical, and thermal properties were investigated using UV–vis, PL spectroscopy, cyclic voltammetry (CV), thermogravimetric analysis (TGA), and differential scanning calorimetry (DSC), respectively. CV measurements show that the compounds present suitable HOMO values (in a range of −4.63 to −5.11 eV) for hole injection, which is confirmed by theoretical calculations. All compounds were thermally stable. Organic light-emitting diode devices having 6f, 6g, and 6h as a hole transporting layer showed better performance of maximum brightness, turn-on voltage, and maximum luminous efficiency than a comparable device NPB. These compounds could be excellent candidates for applications in OLED devices.  相似文献   

16.
The field of molecular based magnetism is an active area of research directed toward the design of new magnetic materials. The idea is to introduce molecular strategies in magneto-chemistry. This can open completely new synthetic routes to materials with previously unknown physical properties. Spin carriers used within this approach range from purely organic radicals to metal complexes and organometallic compounds. The design of new magnetic materials with tailor-made properties requires a detailed knowledge about the interactions between possible spin carriers and the strategies necessary to achieve interactions in all three dimensions. The latter is closely related to the field of crystal engineering. Starting from introductory remarks to magnetochemistry the underlaying concepts for the design of magnetic materials on the basis of molecular compounds as well as new developments and possible applications are described.  相似文献   

17.
ABSTRACT

Existing data on structures and biological activities are limited and distributed unevenly across distinct molecular targets and chemical compounds. The question arises if these data represent an unbiased sample of the general population of chemical-biological interactions. To answer this question, we analyzed ChEMBL data for 87,583 molecules tested against 919 protein targets using supervised and unsupervised approaches. Hierarchical clustering of the Murcko frameworks generated using Chemistry Development Toolkit showed that the available data form a big diffuse cloud without apparent structure. In contrast hereto, PASS-based classifiers allowed prediction whether the compound had been tested against the particular molecular target, despite whether it was active or not. Thus, one may conclude that the selection of chemical compounds for testing against specific targets is biased, probably due to the influence of prior knowledge. We assessed the possibility to improve (Q)SAR predictions using this fact: PASS prediction of the interaction with the particular target for compounds predicted as tested against the target has significantly higher accuracy than for those predicted as untested (average ROC AUC are about 0.87 and 0.75, respectively). Thus, considering the existing bias in the data of the training set may increase the performance of virtual screening.  相似文献   

18.
The appropriate selection of the mobile phase facilitated the use cellulose tris(3,5-dichlorophenylcarbamate (CDCPC) as a chiral stationary phase (CSP) during high-performance liquid chromatography (HPLC). A preliminary evaluations of this material indicated its very high chiral resolving ability toward many analytes of different chemical and pharmacological groups. Some chemicals and drugs containing two centers of chirality were also successfully resolved into all possible stereoisomers. The applicability of CDCPC for enantioseparations in capillary liquid chromatography was also shown giving promising prospects for the screening of novel biologically active compounds (which may be also synthesized based on combinatorial strategies) for their enantiomeric composition.  相似文献   

19.
Polypharmacology has emerged as a new theme in drug discovery. In this paper, we studied polypharmacology using a ligand-based target fishing (LBTF) protocol. To implement the protocol, we first generated a chemogenomic database that links individual protein targets with a specified set of drugs or target representatives. Target profiles were then generated for a given query molecule by computing maximal shape/chemistry overlap between the query molecule and the drug sets assigned to each protein target. The overlap was computed using the program ROCS (Rapid Overlay of Chemical Structures). We validated this approach using the Directory of Useful Decoys (DUD). DUD contains 2950 active compounds, each with 36 property-matched decoys, against 40 protein targets. We chose a set of known drugs to represent each DUD target, and we carried out ligand-based virtual screens using data sets of DUD actives seeded into DUD decoys for each target. We computed Receiver Operator Characteristic (ROC) curves and associated area under the curve (AUC) values. For the majority of targets studied, the AUC values were significantly better than for the case of a random selection of compounds. In a second test, the method successfully identified off-targets for drugs such as rimantadine, propranolol, and domperidone that were consistent with those identified by recent experiments. The results from our ROCS-based target fishing approach are promising and have potential application in drug repurposing for single and multiple targets, identifying targets for orphan compounds, and adverse effect prediction.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号