期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

定量结构－活性／性质相关性（QSAR／QSPR）研究的基本依据是化合物的性质与结构具有相关性,所以只要有方法描述化合物的结构（得到X）就可与化合物的性质（作为Y）建立起数学模型,并由引模型预测未知化合物。由化合物的结构可衍生（即描述）出诸多变量,从统计学出发,希望用尽可能少的变量来表征尽可能多的信息（如多元回归分析）。过多的变量不仅计算量大,从而可以导致所得的数学模型不稳定,使预测结果较差^[1],而且不同变量的组合所得结果可能差别很大,由此需要对变量进行压缩和选择。虽然变量的选择是一个非常费时和复杂的工作,但变量选择的好坏对数学模型的稳定性及准确性有致关重要的影响,从某种角度上讲,它能决定一项QSAR／QSPR研究的成败。最简单的选择变量的方法是穷举组合法,但此方法的计算量非常大,特别是当变量数较大时,该方法是实际上是不可行的,尽管用于变量选择的方法已有报道,但问题尚有待进一步研究。本文侧重比较了正交变换法与变量最优子集回归法,得到了很有启示性的结果。相似文献

6.

Legitimate utilization of large descriptor pools for QSPR/QSAR models

Katritzky AR Dobchev DA Slavov S Karelson M 《Journal of chemical information and modeling》2008,48(11):2207-2213

相似文献

7.

Application of modified particle swarm optimization as an efficient variable selection strategy in QSAR/QSPR studies

Aboozar Khajeh Hamid Modarress Hamed Zeinoddini‐Meymand 《Journal of Chemometrics》2012,26(11-12):598-603

相似文献

8.

Review: Quantitative structure–activity/property relationships as related to organotin chemistry

《应用有机金属化学》2017,31(10)

相似文献

9.

The great descriptor melting pot: mixing descriptors for the common good of QSAR models

Tseng YJ Hopfinger AJ Esposito EX 《Journal of computer-aided molecular design》2012,26(1):39-43

相似文献

10.

Scores of extended connectivity fingerprint as descriptors in QSPR study of melting point and aqueous solubility

Zhou D Alelyunas Y Liu R 《Journal of chemical information and modeling》2008,48(5):981-987

相似文献

11.

Genetic Algorithm guided Selection: variable selection and subset selection 总被引：3，自引：0，他引：3

Cho SJ Hermsmeier MA 《Journal of chemical information and computer sciences》2002,42(4):927-936

相似文献

12.

Building a chemical space based on fragment descriptors

Baskin I Varnek A 《Combinatorial chemistry & high throughput screening》2008,11(8):661-668

相似文献

13.

On the use of 1H and 13C 1D NMR spectra as QSPR descriptors

Willighagen EL Denissen HM Wehrens R Buydens LM 《Journal of chemical information and modeling》2006,46(2):487-494

相似文献

14.

Quantitative Structure-Property Relationships Generated with Optimizable Even/Odd Wiener Polynomial Descriptors

O. Ivanciuc T. Ivanciuc D. J. Klein 《SAR and QSAR in environmental research》2013,24(1-2):1-16

相似文献

15.

Evaluation of a novel electronic eigenvalue (EEVA) molecular descriptor for QSAR/QSPR studies: validation using a benchmark steroid data set 总被引：2，自引：0，他引：2

Tuppurainen K Viisas M Laatikainen R Peräkylä M 《Journal of chemical information and computer sciences》2002,42(3):607-613

相似文献

16.

QSAR comparative study of Wiener descriptors for weighted molecular graphs

Ivanciuc O 《Journal of chemical information and computer sciences》2000,40(6):1412-1422

相似文献

17.

Quantitative structure–property relationship prediction of liquid thermal conductivity for some alcohols

Aboozar Khajeh Hamid Modarress 《Structural chemistry》2011,22(6):1315-1323

相似文献

18.

Generalized topological indices. Modeling gas-phase rate coefficients of atmospheric relevance

Estrada E Matamala AR 《Journal of chemical information and modeling》2007,47(3):794-804

相似文献

19.

Modified particle swarm optimization method for variable selection in QSAR/QSPR studies

Aboozar Khajeh Hamid Modarress Hamed Zeinoddini-Meymand 《Structural chemistry》2013,24(5):1401-1409

相似文献

20.

QSAR and QSPR studies of a highly structured physicochemical domain

Nicolotti O Carotti A 《Journal of chemical information and modeling》2006,46(1):264-276

The relevance of terms other than linear when deriving quantitative structure-activity relationship/quantitative structure-property relationship (QSAR/QSPR) models has been rarely considered so far. In this study, the impact of quadratic and interacting terms has been taken into account. The first effect of including such highly structured terms is a significant extension of the parametric domain that moves from the initial N to N(N + 3)/2 parameters. This substantial enlargement over the conventional linear boundaries involves a higher computational cost due to the increased combinatorial number of resulting theoretical QSAR/QSPR models. To face this issue, novel genetic-algorithm-based software, MGZ (multigenetic zooming), was developed and used for both variable selection and model building. To speed up the entire process of domain searching, MGZ was supported with multiple independent evolving populations and genetic storms to further QSAR/QSPR analyses. In addition, a novel fitness function was developed to score models on the basis of their inner predictive capability, assessed on the training set, structure complexity, and presence of nonlinear terms. The models were further validated by monitoring model redundancy and performing intensive randomization runs. The Selwood data set was used as a reference set to derive QSAR models. Furthermore, a QSPR study was conducted on the solubility data set of a large array of organic compounds. The results reported in the present paper demonstrate that our approach is successful in finding linear models, which are at least as good as the models previously derived using standard statistical approaches, and in deriving new nonlinear models with good statistical figures. 相似文献