首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
4.
The external factor variable connectivity index (EFVCI) is interpreted by mining out the structural features hidden in the space spanned by the EFVCI indices through projection pursuit combining with number-theory net (NT-net) on the unit sphere U(Us). Projection pursuit is concerned with "interesting" projections of high-dimensional data sets to machine-pick "interesting" low-dimensional projections of a high-dimensional point cloud by numerically maximizing a certain objective function or projection index. At first, the optimal EFVCI index reaches to -0.80 in the correlation with a retention index of 207 hydrocarbons produced by insects. The EFVCI indices, with regression results of R = 0.99998, s = 3.49, RMSECV = 3.90, and F = 7.9560e+005, obtain high regression quality. The model is proven valid by leave-one-out cross validation. Second, the EFVCI index is interpreted by the structure information, that is, size, branch number, graph center, and branching position of topological structures, which is searched out on the unit sphere U(Us) by projection pursuit. Finally, the interpretation information is used to discover some chemical knowledge concerning the variation of the retention index with the change in chemical structures.  相似文献   

5.
Earlier attempts to assess the complexity of molecules are analyzed and summarized in a number of definitions of general and topological complexity. A concept which specifies topological complexity as overall connectivity, and generalizes the idea of molecular connectivities of Randic, Kier, and Hall, is presented. Two overall connectivity indices, TC and TC1, are defined as the connectivity (the sum of the vertex degrees) of all connected subgraphs in the molecular graph. The contributions to TC and TC1, which originate from all subgraphs having the same number of edges e, form two sets of eth-order overall connectivities, eTC and eTC1. The total number of subgraphs K is also analyzed as a complexity measure, and the vector of its eth-order components, eK, is examined as well. The TC, TC1, and K indices match very well the increase in molecular complexity with the increase in the number of atoms and, at a constant number of atoms, with the increased degree of branching and cyclicity of the molecular skeleton, as well as with the multiplicity of bonds and the presence of heteroatoms. The potential of the three sets of eth-order complexities for applications to QSPR was tested by the modeling of 10 alkane properties (boiling point, critical temperature, critical pressure, critical volume, molar volume, molecular refraction, heat of formation, heat of vaporization, heat of atomization, and surface tension), in parallel with Kier and Hall's molecular connectivity indices (k)chi. The topological complexity indices were shown to outperform molecular connectivity indices in 44 out of the 50 pairs of models compared, including all models with four and five parameters.  相似文献   

6.
on a basic element, vertex degree, of topological graphs, insights are obtained on the structural features hidden in the degree distributions (DD) of saturated hydrocarbons. The investigation shows that the cyclicity and branching features are mainly coded by the different mathematical characteristics of the degree distributions. Surprisingly, the center (or mathematical expectation) of a degree distribution corresponds to the cyclicity of a saturated hydrocarbon, and the dispersion (mean absolute deviation or MAD) around its center of a distribution is a measure of branching. The structural feature such as number of quaternary atoms is also mined out as a special case of branching. The cyclicity and branching information in the present work is with least human intervention, and an interesting thing is that the two features can be unified into the mathematical characteristics of a degree distribution. By the strict mathematical characteristics of a distribution, the structure features within the degree distributions (DD somer domains) are studied. The space spanned by the size (number of carbons), mathematical expectation, and MAD shows some enlightening results. The results also give a new idea on how to model the properties of diverse structures.  相似文献   

7.
8.
Abstract

The development of molecular complexity measures is reviewed. Two novel sets of indices termed topological complexities are introduced proceeding from the idea that topological complexity increases with the overall connectivity of the molecular graph. The latter is assessed as the connectivity of all connected subgraphs in the molecular graph, including the graph itself. First-order, second-order, third-order, etc., topological complexities i TC are defined as the sum of the vertex degrees in the connected subgraphs with one, two, three, etc., edges, respectively. Zero-order complexity is also specified for the simplest subgraphs–the graph vertices. The overall topological complexity TC is then defined as the sum of the complexities of all orders. These new indices mirror the increase in complexity with the increase in the number of atoms and, at a constant number of atoms, with the increase in molecular branching and cyclicity. Topological complexities compare favorably to molecular connectivities of Kier and Hall, as demonstrated in detail for the classical QSPR test-the boiling points of alkanes. Related to the wide application of molecular connectivities to QSAR studies, a similar importance of the new indices is anticipated.  相似文献   

9.
An accurate and generally applicable method for estimating aqueous solubilities for a diverse set of 1297 organic compounds based on multilinear regression and artificial neural network modeling was developed. Molecular connectivity, shape, and atom-type electrotopological state (E-state) indices were used as structural parameters. The data set was divided into a training set of 884 compounds and a randomly chosen test set of 413 compounds. The structural parameters in a 30-12-1 artificial neural network included 24 atom-type E-state indices and six other topological indices, and for the test set, a predictive r2 = 0.92 and s = 0.60 were achieved. With the same parameters the statistics in the multilinear regression were r2 = 0.88 and s = 0.71, respectively.  相似文献   

10.
11.
12.
13.
14.
倪才华  冯志云  黄鹤 《有机化学》2004,24(8):966-969
为了探讨有机化合物的分子结构与理化性质的关系,作者在以前的工作基础上,基于一阶分子连通性拓扑指数构建了一个信息拓扑指数.将拓扑指数与取代脂环烃系列分子的气态标准生成热、气态标准熵、气态标准生成自由能、沸点、临界温度、临界压力、临界体积、汽化热、密度、热容及表面张力等十一种热力学性质及物理化学性质相关联,用一个通用公式对各类性质进行概括,然后用已有的实验数据与拓扑指数进行回归分析,得到一系列计算各性质的经验公式.回归结果发现,分子的理化性质与拓扑指数有较好的相关性,有5类性质的复相关系数大于0.99.用经验公式对各类性质进行重新计算,其结果与实验值比较符合,用实验值对计算值作图,发现各数据点紧靠对角线,说明误差较小.残差分布呈正态分布.  相似文献   

15.
A new extension of the generalized topological indices (GTI) approach is carried out to represent “simple” and “composite” topological indices (TIs) in an unified way. This approach defines a GTI-space from which both simple and composite TIs represent particular subspaces. Accordingly, simple TIs such as Wiener, Balaban, Zagreb, Harary and Randić connectivity indices are expressed by means of the same GTI representation introduced for composite TIs such as hyper-Wiener, molecular topological index (MTI), Gutman index and reverse MTI. Using GTI-space approach we easily identify mathematical relations between some composite and simple indices, such as the relationship between hyper-Wiener and Wiener index and the relation between MTI and first Zagreb index. The relation of the GTI-space with the sub-structural cluster expansion of property/activity is also analysed and some routes for the applications of this approach to QSPR/QSAR are also given.  相似文献   

16.
17.
18.
A comparative study was undertaken to test the ability of several different topological indices to predict the retention indices of chlorinated benzenes on polar and non-polar stationary phases using both correlation coefficients and correctly predicted elution sequences as criteria of fit. The test was performed on three topological indices: connectivity indices, Wiener numbers, and Balaban indices. The regression analyses showed that the molecular connectivity model predicted the retention indices of chlorinated benzenes more successfully than either Wiener numbers or Balaban indices. The results also demonstrated that the major structural property controlling chromatographic behavior was the size of the chlorinated benzene. In addition, the use of the new non-empirical heteroatom parameterization scheme in the calculation of Wiener numbers and Balaban indices was successfully tested for the first time.  相似文献   

19.
20.
The sum of the topological distances in the molecular graph (the Wiener number) is used for a topological characterization of the condensed polycyclic molecular systems. This topological index discriminates well the isomeric cyclic molecules. In addition, it also properly reflects their structural features. On this basis the principal points of molecular cyclicity are formulated in 15 rules.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号