期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Development of a spectral clustering method for the analysis of molecular data sets

Brewer ML 《Journal of chemical information and modeling》2007,47(5):1727-1733

A spectral clustering method is presented and applied to two-dimensional molecular structures, where it has been found particularly useful in the analysis of screening data. The method provides a means to quantify (1) the degree of intermolecular similarity within a cluster and (2) the contribution that the features of a molecule make to a cluster. In an application of the spectral clustering method to an example data set of 125 COX-2 inhibitors, these two criteria were used to place the molecules into clusters of chemically related two-dimensional structures. 相似文献

2.

A. C. Good E. E. Hodgkin W. G. Richards 《Journal of computer-aided molecular design》1992,6(5):513-520

Summary Three-dimensional (3D)-database searches are now being widely applied to determine potential new active molecules. Many structural data sets obtained as a result of these searches are still large in size. In this paper we apply molecular similarity calculations as a rapid method to screen two such data sets. In the first investigation, synthetic candidates, produced as a result of a tendamistat -turn mimic search, were tested for their ability to imitate the -turn backbone. In the second study, structures extracted through a histamine pharmacophore query search were examined on the basis of their electronic similarity to histamine. Molecular similarity is shown to provide a rapid means of gaining insight into the composition of molecular data sets, with possible implications for future full 3D-database searches. 相似文献

3.

Nonlinear mapping of massive data sets by fuzzy clustering and neural networks

Dmitrii N. Rassokhin Victor S. Lobanov Dimitris K. Agrafiotis 《Journal of computational chemistry》2001,22(4):373-386

Producing good low‐dimensional representations of high‐dimensional data is a common and important task in many data mining applications. Two methods that have been particularly useful in this regard are multidimensional scaling and nonlinear mapping. These methods attempt to visualize a set of objects described by means of a dissimilarity or distance matrix on a low‐dimensional display plane in a way that preserves the proximities of the objects to whatever extent is possible. Unfortunately, most known algorithms are of quadratic order, and their use has been limited to relatively small data sets. We recently demonstrated that nonlinear maps derived from a small random sample of a large data set exhibit the same structure and characteristics as that of the entire collection, and that this structure can be easily extracted by a neural network, making possible the scaling of data set orders of magnitude larger than those accessible with conventional methodologies. Here, we present a variant of this algorithm based on local learning. The method employs a fuzzy clustering methodology to partition the data space into a set of Voronoi polyhedra, and uses a separate neural network to perform the nonlinear mapping within each cell. We find that this local approach offers a number of advantages, and produces maps that are virtually indistinguishable from those derived with conventional algorithms. These advantages are discussed using examples from the fields of combinatorial chemistry and optical character recognition. © 2001 John Wiley & Sons, Inc. J Comput Chem 22: 373–386, 2001 相似文献

4.

Quasi-orthogonal basis sets of molecular graph descriptors as a chemical diversity measure

Ivanciuc O Taraviras SL Cabrol-Bass D 《Journal of chemical information and computer sciences》2000,40(1):126-134

相似文献

5.

In silico prediction and screening of γ‐secretase inhibitors by molecular descriptors and machine learning methods

Xue‐Gang Yang Wei Lv Yu‐Zong Chen Ying Xue 《Journal of computational chemistry》2010,31(6):1249-1258

γ‐Secretase inhibitors have been explored for the prevention and treatment of Alzheimer's disease (AD). Methods for prediction and screening of γ‐secretase inhibitors are highly desired for facilitating the design of novel therapeutic agents against AD, especially when incomplete knowledge about the mechanism and three‐dimensional structure of γ‐secretase. We explored two machine learning methods, support vector machine (SVM) and random forest (RF), to develop models for predicting γ‐secretase inhibitors of diverse structures. Quantitative analysis of the receiver operating characteristic (ROC) curve was performed to further examine and optimize the models. Especially, the Youden index (YI) was initially introduced into the ROC curve of RF so as to obtain an optimal threshold of probability for prediction. The developed models were validated by an external testing set with the prediction accuracies of SVM and RF 96.48 and 98.83% for γ‐secretase inhibitors and 98.18 and 99.27% for noninhibitors, respectively. The different feature selection methods were used to extract the physicochemical features most relevant to γ‐secretase inhibition. To the best of our knowledge, the RF model developed in this work is the first model with a broad applicability domain, based on which the virtual screening of γ‐secretase inhibitors against the ZINC database was performed, resulting in 368 potential hit candidates. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010 相似文献

6.

Development of three-dimensional descriptors represented by tensors: free energy of hydration density tensor.

S H Son C K Han S K Ahn J H Yoon K T No 《Journal of chemical information and computer sciences》1999,39(3):601-609

相似文献

7.

Fast,fuzzy c-means clustering of data sets with many features

Bjrn K. Alsberg 《Journal of computational chemistry》1995,16(4):414-421

A fuzzy c-means clustering algorithm is presented which is much faster than the traditional algorithm for data sets in which the number of features is significantly larger than the number of feature vectors. The algorithm is constructed by utilizing the covariance structure of feature vectors and cluster centers. By using results from a previous clustering, modified versions of the new algorithm achieve additional reductions in floating point operations. © 1995 by John Wiley & Sons, Inc. 相似文献

8.

Using molecular docking, 3D-QSAR, and cluster analysis for screening structurally diverse data sets of pharmacological interest

Santos-Filho OA Cherkasov A 《Journal of chemical information and modeling》2008,48(10):2054-2065

相似文献

9.

Pharmacological classification of drugs by principal component analysis applying molecular modeling descriptors and HPLC retention data

Bober L Koba M Judycka-Proma U Baczek T 《Journal of chromatographic science》2011,49(10):758-763

相似文献

10.

Translations of fields represented by spherical-harmonic expansions for molecular calculations

E. Otto Steinborn Eckhard Filter 《Theoretical chemistry accounts》1975,38(4):247-260

In quantum chemistry one needs expansions of Orbitals and operators, defined with respect to one origin, about another origin. Because there is no straightforward method of obtaining such expansions, it is helpful to interpret them as translations of fields. The connection between translations and rotations of fields with the transformations of functions is considered. Of special physical interest are expansions in spherical harmonics, which have the form of an addition theorem. General properties of such expansions and possible methods to derive them are discussed. 相似文献

11.

Effect of selection of molecular descriptors on the prediction of blood-brain barrier penetrating and nonpenetrating agents by statistical learning methods

Li H Yap CW Ung CY Xue Y Cao ZW Chen YZ 《Journal of chemical information and modeling》2005,45(5):1376-1384

相似文献

12.

Comparison of methods for sequential screening of large compound sets

Blower PE Cross KP Eichler GS Myatt GJ Weinstein JN Yang C 《Combinatorial chemistry & high throughput screening》2006,9(2):115-122

相似文献

13.

Quantitative prediction of logk of peptides in high-performance liquid chromatography based on molecular descriptors by using the heuristic method and support vector machine

Liu HX Xue CX Zhang RS Yao XJ Liu MC Hu ZD Fan BT 《Journal of chemical information and computer sciences》2004,44(6):1979-1986

相似文献

14.

Correlations between gas chromatographic retention data of polycyclic aromatic hydrocarbons and several molecular descriptors

R. Corbella M. A. Rodríguez Ma J. Sánchez F. García Montelongo 《Chromatographia》1995,41(9-10):532-538

相似文献

15.

1D 13C-NMR data as molecular descriptors in spectra--structure relationship analysis of oligosaccharides

Pereira F 《Molecules (Basel, Switzerland)》2012,17(4):3818-3833

Spectra-structure relationships were investigated for estimating the anomeric configuration, residues and type of linkages of linear and branched trisaccharides using 13C-NMR chemical shifts. For this study, 119 pyranosyl trisaccharides were used that are trimers of the α or β anomers of D-glucose, D-galactose, D-mannose, L-fucose or L-rhamnose residues bonded through a or b glycosidic linkages of types 1→2, 1→3, 1→4, or 1→6, as well as methoxylated and/or N-acetylated amino trisaccharides. Machine learning experiments were performed for: (1) classification of the anomeric configuration of the first unit, second unit and reducing end; (2) classification of the type of first and second linkages; (3) classification of the three residues: reducing end, middle and first residue; and (4) classification of the chain type. Our previously model for predicting the structure of disaccharides was incorporated in this new model with an improvement of the predictive power. The best results were achieved using Random Forests with 204 di- and trisaccharides for the training set-it could correctly classify 83%, 90%, 88%, 85%, 85%, 75%, 79%, 68% and 94% of the test set (69 compounds) for the nine tasks, respectively, on the basis of unassigned chemical shifts. 相似文献

16.

Approximating the properties of some chemical solvents by two-dimensional molecular descriptors

Abid Mahboob Muhammad Waheed Rasheed Iqra Hanif Imran Siddique 《International journal of quantum chemistry》2024,124(1):e27305

相似文献

17.

Comparison of free energy methods for molecular systems

Ytreberg FM Swendsen RH Zuckerman DM 《The Journal of chemical physics》2006,125(18):184114

We present a detailed comparison of computational efficiency and precision for several free energy difference (DeltaF) methods. The analysis includes both equilibrium and nonequilibrium approaches, and distinguishes between unidirectional and bidirectional methodologies. We are primarily interested in comparing two recently proposed approaches, adaptive integration, and single-ensemble path sampling to more established methodologies. As test cases, we study relative solvation free energies of large changes to the size or charge of a Lennard-Jones particle in explicit water. The results show that, for the systems used in this study, both adaptive integration and path sampling offer unique advantages over the more traditional approaches. Specifically, adaptive integration is found to provide very precise long-simulation DeltaF estimates as compared to other methods used in this report, while also offering rapid estimation of DeltaF. The results demonstrate that the adaptive integration approach is the best overall method for the systems studied here. The single-ensemble path sampling approach is found to be superior to ordinary Jarzynski averaging for the unidirectional, "fast-growth" nonequilibrium case. Closer examination of the path sampling approach on a two-dimensional system suggests it may be the overall method of choice when conformational sampling barriers are high. However, it appears that the free energy landscapes for the systems used in this study have rather modest configurational sampling barriers. 相似文献

18.

Toward an improved clustering of large data sets using maximum common substructures and topological fingerprints

Böcker A 《Journal of chemical information and modeling》2008,48(11):2097-2107

相似文献

19.

Calculation of Hildebrand solubility parameters of some polymers using QSPR methods based on LS-SVM technique and theoretical molecular descriptors 总被引：1，自引：0，他引：1

Nasser Goudarzi M.Arab Chamjangali A.H.Amin 《高分子科学》2014,32(5):587-594

相似文献

20.

Chemometric exploration of the dependencies between molecular modeling descriptors and analytical chemistry data of antihistaminic drugs

Konieczna L Bober L Belka M Ciesielski T Baczeki T 《Journal of AOAC International》2012,95(3):713-723

相似文献