期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Variable selection and interpretation in structure-affinity correlation modeling of estrogen receptor binders

Marini F Roncaglioni A Novic M 《Journal of chemical information and modeling》2005,45(6):1507-1519

相似文献

2.

Classification of GC‐MS measurements of wines by combining data dimension reduction and variable selection techniques

Davide Ballabio Thomas Skov Riccardo Leardi Rasmus Bro 《Journal of Chemometrics》2008,22(8):457-463

Different classification methods (Partial Least Squares Discriminant Analysis, Extended Canonical Variates Analysis and Linear Discriminant Analysis), in combination with variable selection approaches (Forward Selection and Genetic Algorithms), were compared, evaluating their capabilities in the geographical discrimination of wine samples. Sixty‐two samples were analysed by means of dynamic headspace gas chromatography mass spectrometry (HS‐GC‐MS) and the entire chromatographic profile was considered to build the dataset. Since variable selection techniques pose a risk of overfitting when a large number of variables is used, a method for coupling data dimension reduction and variable selection was proposed. This approach compresses windows of the original data by retaining only significant components of local Principal Component Analysis models. The subsequent variable selection is then performed on these locally derived score variables. The results confirmed that the classification models achieved on the reduced data were better than those obtained on the entire chromatographic profile, with the exception of Extended Canonical Variates Analysis, which gave acceptable models in both cases. Copyright © 2008 John Wiley & Sons, Ltd. 相似文献

3.

Toward an optimal procedure for variable selection and QSAR model building

Yasri A Hartsough D 《Journal of chemical information and computer sciences》2001,41(5):1218-1227

相似文献

4.

A simple,fast, and accurate thermodynamic‐based approach for transfer and prediction of gas chromatography retention times between columns and instruments Part III: Retention time prediction on target column

《Journal of separation science》2018,41(12):2559-2564

This is the third part of a three‐part series of papers. In Part I, we presented a method for determining the actual effective geometry of a reference column as well as the thermodynamic‐based parameters of a set of probe compounds in an in‐house mixture. Part II introduced an approach for estimating the actual effective geometry of a target column by collecting retention data of the same mixture of probe compounds on the target column and using their thermodynamic parameters, acquired on the reference column, as a bridge between both systems. Part III, presented here, demonstrates the retention time transfer and prediction from the reference column to the target column using experimental data for a separate mixture of compounds. To predict the retention time of a new compound, we first estimate its thermodynamic‐based parameters on the reference column (using geometric parameters determined previously). The compound's retention time on a second column (of previously determined geometry) is then predicted. The models and the associated optimization algorithms were tested using simulated and experimental data. The accuracy of predicted retention times shows that the proposed approach is simple, fast, and accurate for retention time transfer and prediction between gas chromatography columns. 相似文献

5.

Infrared spectroscopy and multivariate calibration to monitor stability quality parameters of biodiesel

Liliana Fátima Bezerra de Lira Jose Geraldo Andrade Pacheco Eduardo Homem de Siqueira Cavalcanti Maria Fernanda Pimentel 《Microchemical Journal》2010,96(1):126-34

This paper evaluates analytical methods based on near infrared (NIR) and middle infrared (MIR) spectroscopy and multivariate calibration to monitor the stability of biodiesel. There was a focus on three parameters: oxidative stability index, acid number and water content. Ethylic and methylic biodiesel from different feedstocks were used in experiments of accelerated aging, in order to take into account the wide variety of oilseeds and feedstocks available in Brazil. Partial least squares (PLS) and multiple linear regression (MLR) models were developed. Different pre-processing techniques and spectral variable/regions selection algorithms were evaluated. For MLR models, the successive projection algorithm (SPA) was employed. Interval PLS (iPLS) and selection of variables taking into account the significant regression coefficients were used for PLS models. Results showed that both near and middle infrared regions, and all variable selection methods tested were efficient for predicting these three important quality parameters of B100, the root mean squares error of prediction (RMSEP) values being comparable to the reproducibility of the corresponding standard method for each property investigated. 相似文献

6.

Retention prediction using quantitative structure‐retention relationships combined with the hydrophobic subtraction model in reversed‐phase liquid chromatography

Yabin Wen Ruth I. J. Amos Mohammad Talebi Roman Szucs John W. Dolan Christopher A. Pohl Paul R. Haddad 《Electrophoresis》2019,40(18-19):2415-2419

相似文献

7.

Prediction of tumoricidal activity and accumulation of photosensitizers in photodynamic therapy using multiple linear regression and artificial neural networks

Vanyrúr R Héberger K Kövesdi I Jakus J 《Photochemistry and photobiology》2002,75(5):471-478

相似文献

8.

A new and efficient variable selection algorithm based on ant colony optimization. Applications to near infrared spectroscopy/partial least-squares analysis 总被引：3，自引：0，他引：3

Allegrini F Olivieri AC 《Analytica chimica acta》2011,(1):18-25

A new variable selection algorithm is described, based on ant colony optimization (ACO). The algorithm aim is to choose, from a large number of available spectral wavelengths, those relevant to the estimation of analyte concentrations or sample properties when spectroscopic analysis is combined with multivariate calibration techniques such as partial least-squares (PLS) regression. The new algorithm employs the concept of cooperative pheromone accumulation, which is typical of ACO selection methods, and optimizes PLS models using a pre-defined number of variables, employing a Monte Carlo approach to discard irrelevant sensors. The performance has been tested on a simulated system, where it shows a significant superiority over other commonly employed selection methods, such as genetic algorithms. Several near infrared spectroscopic experimental data sets have been subjected to the present ACO algorithm, with PLS leading to improved analytical figures of merit upon wavelength selection. The method could be helpful in other chemometric activities such as classification or quantitative structure-activity relationship (QSAR) problems. 相似文献

9.

Evaluation of stochastic global optimization methods for modeling vapor–liquid equilibrium data

Adrián Bonilla-Petriciolet Gade Pandu Rangaiah Juan Gabriel Segovia-Hernández 《Fluid Phase Equilibria》2010

Parameter estimation for vapor–liquid equilibrium (VLE) data modeling plays an important role in design, optimization and control of separation units. This optimization problem is very challenging due to the high non-linearity of thermodynamic models. Recently, several stochastic optimization methods such as Differential Evolution with Tabu List (DETL) and Particle Swarm Optimization (PSO) have evolved as alternative and reliable strategies for solving global optimization problems including parameter estimation in thermodynamic models. However, these methods have not been applied and compared with respect to other stochastic strategies such as Simulated Annealing (SA), Differential Evolution (DE) and Genetic Algorithm (GA) in the context of parameter estimation for VLE data modeling. Therefore, in this study several stochastic optimization methods are applied to solve parameter estimation problems for VLE modeling using both the classical least squares and maximum likelihood approaches. Specifically, we have tested and compared the reliability and efficiency of SA, GA, DE, DETL and PSO for modeling several binary VLE data using local composition models. These methods were also tested on benchmark problems for global optimization. Our results show that the effectiveness of these stochastic methods varies significantly between the different tested problems and also depends on the stopping criterion especially for SA, GA and PSO. Overall, DE and DETL have better performance for solving the parameter estimation problems in VLE data modeling. 相似文献

10.

A QSPR study on the GC retention times of a series of fatty, dicarboxylic and amino acids by MLR and ANN

Rouhollahi A Shafieyan H Ghasemi JB 《Annali di chimica》2007,97(9):925-933

相似文献

11.

A self‐adaptive genetic algorithm‐artificial neural network algorithm with leave‐one‐out cross validation for descriptor selection in QSAR study

Jingheng Wu Juan Mei Sixiang Wen Siyan Liao Jincan Chen Yong Shen 《Journal of computational chemistry》2010,31(10):1956-1968

相似文献

12.

A Comparative QSRR Study on Enantioseparation of Ethanol Ester Enantiomers in HPLC Using Multivariate Image Analysis,Quantum Mechanical and Structural Descriptors

《中国化学会会志》2017,64(2):176-187

相似文献

13.

Incorporating PLS model information into particle swarm optimization for descriptor selection in QSAR/QSPR

Yong Wang Jing‐Jing Huang Neng Zhou Dong‐Sheng Cao Jie Dong Han‐Xiong Li 《Journal of Chemometrics》2015,29(12):627-636

相似文献

14.

Nano‐scaled intrachain ordering and dynamics in two‐dimensional orientational polymer domains1)

A.V. Maximov S.S. Shevchenko O.G. Maximova Yu.Ya. Gotlib 《Macromolecular Symposia》2001,175(1):309-320

Statistical and local relaxation properties of two‐dimensional finite polymer systems (domains) are considered. The domains consist of a large number of semirigid chains with the finite contour length at free, half‐free and fixed boundary conditions for chain ends. The intermolecular orientational order at short distances between chains in the thick domains is similar to the order in infinite two‐dimensional systems. The correlations of orientation between sufficiently distant elements of different chains decay by the exponential law, but the effective constant of interchain interactions in the domain is proportional to the molecular weight of the chain. At the given intra‐and interchain interactions an elongtation of the chains leads to a local ordering of chains in the domain (at free boundary conditions) or, on the contrary, to the decreasing of the parameter of short‐range orientational order (at fixed and half‐free boundary conditions). Independently of type of boundary conditions the parameter of large‐range orientational order tends to zero with increasing of the chain contour length. Dynamical equations and relaxation spectrums for times of local motions are obtained. From time correlation functions of local relaxation the times of nano‐scaled mobility of chains were calculated in depending on the bending rigidity of chains, the parameter of interchain interactions, and the contour length of chains. At the given intra‐and interchain interactions an elongtation of chains forming the domain leads to to the slowing‐down of local mobility of chains in the domain. The comparison with experimental date obtained by dielectric relaxation and polarized luminescence methods on investigation of nano‐scaled mobility in the dilute melts of comb‐shaped polymers has been carried out. 相似文献

15.

Comparison of ridge regression, partial least-squares, pairwise correlation, forward- and best subset selection methods for prediction of retention indices for aliphatic alcohols

Farkas O Héberger K 《Journal of chemical information and modeling》2005,45(2):339-346

相似文献

16.

Comparison of commercially available genetic algorithms: GAs as variable selection tool

Schefzick S Bradley M 《Journal of computer-aided molecular design》2004,18(7-9):511-521

Many commercially available software programs claim similar efficiency and accuracy as variable selection tools. Genetic algorithms are commonly used variable selection methods where most relevant variables can be differentiated from ‘less important’ variables using evolutionary computing techniques. However, different vendors offer several algorithms, and the puzzling question is: which one is the appropriate method of choice? In this study, several genetic algorithm tools (e.g. GFA from Cerius2, QuaSAR-Evolution from MOE and Partek’s genetic algorithm) were compared. Stepwise multiple linear regression models were generated using the most relevant variables identified by the above genetic algorithms. This procedure led to the successful generation of Quantitative Structure–activity Relationship (QSAR) models for (a) proprietary datasets and (b) the Selwood dataset. 相似文献

17.

A simple,fast, and accurate thermodynamic‐based approach for transfer and prediction of gas chromatography retention times between columns and instruments Part I: Estimation of reference column geometry and thermodynamic parameters

《Journal of separation science》2018,41(12):2544-2552

The transfer of retention times based on thermodynamic models between columns can aid in separation optimization and compound identification in gas chromatography. Although earlier investigations have been reported, this problem remains unsuccessfully addressed. One barrier is poor predictive accuracy when moving from a reference column or system to a new target column or system. This is attributed to challenges associated with the accurate determination of the effective geometric parameters of the columns. To overcome this, we designed least squares‐based models that account for geometric parameters of the columns and thermodynamic parameters of compounds as they partition between mobile and stationary phases. Quasi‐Newton‐based algorithms were then used to perform the numerical optimization. In this first of three parts, the model used to determine the geometric parameters of the reference column and the thermodynamic parameters of compounds subjected to separation is introduced. As will be shown, the overall approach significantly improves the predictive accuracy and transferability of thermodynamic data (and retention times) between columns of the same stationary phase chemistry. The data required for the determination of the thermodynamic parameters and retention time prediction are obtained from fast and simple experiments. The proposed model and optimization algorithms were tested and validated using simulated and experimental data. 相似文献

18.

Genetic Algorithm guided Selection: variable selection and subset selection 总被引：3，自引：0，他引：3

Cho SJ Hermsmeier MA 《Journal of chemical information and computer sciences》2002,42(4):927-936

相似文献

19.

X‐tended target projection (XTP)—comparison with orthogonal partial least squares (OPLS) and PLS post‐processing by similarity transformation (PLS + ST)

Olav M. Kvalheim Tarja Rajalahti Reidar Arneberg 《Journal of Chemometrics》2009,23(1):49-55

Target projection (TP) also called target rotation (TR) was introduced to facilitate interpretation of latent‐variable regression models. Orthogonal partial least squares (OPLS) regression and PLS post‐processing by similarity transform (PLS + ST) represent two alternative algorithms for the same purpose. In addition, OPLS and PLS + ST provide components to explain systematic variation in X orthogonal to the response. We show, that for the same number of components, OPLS and PLS + ST provide score and loading vectors for the predictive latent variable that are the same as for TP except for a scaling factor. Furthermore, we show how the TP approach can be extended to become a hybrid of latent‐variable (LV) regression and exploratory LV analysis and thus embrace systematic variation in X unrelated to the response. Principal component analysis (PCA) of the residual variation after removal of the target component is here used to extract the orthogonal components, but X‐tended TP (XTP) permits other criteria for decomposition of the residual variation. If PCA is used for decomposing the orthogonal variation in XTP, the variance of the major orthogonal components obtained for OPLS and XTP is observed to be almost the same, showing the close relationship between the methods. The XTP approach is tested and compared with OPLS for a three‐component mixture analyzed by infrared spectroscopy and a multicomponent mixture measured by near infrared spectroscopy in a reactor. Copyright © 2008 John Wiley & Sons, Ltd. 相似文献

20.

Hybridized particle swarm algorithm for adaptive structure training of multilayer feed-forward neural network: QSAR studies of bioactivity of organic compounds

Shen Q Jiang JH Jiao CX Lin WQ Shen GL Yu RQ 《Journal of computational chemistry》2004,25(14):1726-1735

The multilayer feed-forward ANN is an important modeling technique used in QSAR studying. The training of ANN is usually carried out only to optimize the weights of the neural network and without paying attention to the network topology. Some other strategies used to train ANN are, first, to discover an optimum structure of the network, and then to find weights for an already defined structure. These methods tend to converge to local optima, and may also lead to overfitting. In this article, a hybridized particle swarm optimization (PSO) approach was applied to the neural network structure training (HPSONN). The continuous version of PSO was used for the weight training of ANN, and the modified discrete PSO was applied to find appropriate the network architecture. The network structure and connectivity are trained simultaneously. The two versions of PSO can jointly search the global optimal ANN architecture and weights. A new objective function is formulated to determine the appropriate network architecture and optimum value of the weights. The proposed HPSONN algorithm was used to predict carcinogenic potency of aromatic amines and biological activity of a series of distamycin and distamycin-like derivatives. The results were compared to those obtained by PSO and GA training in which the network architecture was kept fixed. The comparison demonstrated that the HPSONN is a useful tool for training ANN, which converges quickly towards the optimal position, and can avoid overfitting in some extent. 相似文献