首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 984 毫秒
1.
2.
Different classification methods (Partial Least Squares Discriminant Analysis, Extended Canonical Variates Analysis and Linear Discriminant Analysis), in combination with variable selection approaches (Forward Selection and Genetic Algorithms), were compared, evaluating their capabilities in the geographical discrimination of wine samples. Sixty‐two samples were analysed by means of dynamic headspace gas chromatography mass spectrometry (HS‐GC‐MS) and the entire chromatographic profile was considered to build the dataset. Since variable selection techniques pose a risk of overfitting when a large number of variables is used, a method for coupling data dimension reduction and variable selection was proposed. This approach compresses windows of the original data by retaining only significant components of local Principal Component Analysis models. The subsequent variable selection is then performed on these locally derived score variables. The results confirmed that the classification models achieved on the reduced data were better than those obtained on the entire chromatographic profile, with the exception of Extended Canonical Variates Analysis, which gave acceptable models in both cases. Copyright © 2008 John Wiley & Sons, Ltd.  相似文献   

3.
4.
This is the third part of a three‐part series of papers. In Part I, we presented a method for determining the actual effective geometry of a reference column as well as the thermodynamic‐based parameters of a set of probe compounds in an in‐house mixture. Part II introduced an approach for estimating the actual effective geometry of a target column by collecting retention data of the same mixture of probe compounds on the target column and using their thermodynamic parameters, acquired on the reference column, as a bridge between both systems. Part III, presented here, demonstrates the retention time transfer and prediction from the reference column to the target column using experimental data for a separate mixture of compounds. To predict the retention time of a new compound, we first estimate its thermodynamic‐based parameters on the reference column (using geometric parameters determined previously). The compound's retention time on a second column (of previously determined geometry) is then predicted. The models and the associated optimization algorithms were tested using simulated and experimental data. The accuracy of predicted retention times shows that the proposed approach is simple, fast, and accurate for retention time transfer and prediction between gas chromatography columns.  相似文献   

5.
This paper evaluates analytical methods based on near infrared (NIR) and middle infrared (MIR) spectroscopy and multivariate calibration to monitor the stability of biodiesel. There was a focus on three parameters: oxidative stability index, acid number and water content. Ethylic and methylic biodiesel from different feedstocks were used in experiments of accelerated aging, in order to take into account the wide variety of oilseeds and feedstocks available in Brazil. Partial least squares (PLS) and multiple linear regression (MLR) models were developed. Different pre-processing techniques and spectral variable/regions selection algorithms were evaluated. For MLR models, the successive projection algorithm (SPA) was employed. Interval PLS (iPLS) and selection of variables taking into account the significant regression coefficients were used for PLS models. Results showed that both near and middle infrared regions, and all variable selection methods tested were efficient for predicting these three important quality parameters of B100, the root mean squares error of prediction (RMSEP) values being comparable to the reproducibility of the corresponding standard method for each property investigated.  相似文献   

6.
7.
8.
A new variable selection algorithm is described, based on ant colony optimization (ACO). The algorithm aim is to choose, from a large number of available spectral wavelengths, those relevant to the estimation of analyte concentrations or sample properties when spectroscopic analysis is combined with multivariate calibration techniques such as partial least-squares (PLS) regression. The new algorithm employs the concept of cooperative pheromone accumulation, which is typical of ACO selection methods, and optimizes PLS models using a pre-defined number of variables, employing a Monte Carlo approach to discard irrelevant sensors. The performance has been tested on a simulated system, where it shows a significant superiority over other commonly employed selection methods, such as genetic algorithms. Several near infrared spectroscopic experimental data sets have been subjected to the present ACO algorithm, with PLS leading to improved analytical figures of merit upon wavelength selection. The method could be helpful in other chemometric activities such as classification or quantitative structure-activity relationship (QSAR) problems.  相似文献   

9.
Parameter estimation for vapor–liquid equilibrium (VLE) data modeling plays an important role in design, optimization and control of separation units. This optimization problem is very challenging due to the high non-linearity of thermodynamic models. Recently, several stochastic optimization methods such as Differential Evolution with Tabu List (DETL) and Particle Swarm Optimization (PSO) have evolved as alternative and reliable strategies for solving global optimization problems including parameter estimation in thermodynamic models. However, these methods have not been applied and compared with respect to other stochastic strategies such as Simulated Annealing (SA), Differential Evolution (DE) and Genetic Algorithm (GA) in the context of parameter estimation for VLE data modeling. Therefore, in this study several stochastic optimization methods are applied to solve parameter estimation problems for VLE modeling using both the classical least squares and maximum likelihood approaches. Specifically, we have tested and compared the reliability and efficiency of SA, GA, DE, DETL and PSO for modeling several binary VLE data using local composition models. These methods were also tested on benchmark problems for global optimization. Our results show that the effectiveness of these stochastic methods varies significantly between the different tested problems and also depends on the stopping criterion especially for SA, GA and PSO. Overall, DE and DETL have better performance for solving the parameter estimation problems in VLE data modeling.  相似文献   

10.
11.
12.
13.
14.
Statistical and local relaxation properties of two‐dimensional finite polymer systems (domains) are considered. The domains consist of a large number of semirigid chains with the finite contour length at free, half‐free and fixed boundary conditions for chain ends. The intermolecular orientational order at short distances between chains in the thick domains is similar to the order in infinite two‐dimensional systems. The correlations of orientation between sufficiently distant elements of different chains decay by the exponential law, but the effective constant of interchain interactions in the domain is proportional to the molecular weight of the chain. At the given intra‐and interchain interactions an elongtation of the chains leads to a local ordering of chains in the domain (at free boundary conditions) or, on the contrary, to the decreasing of the parameter of short‐range orientational order (at fixed and half‐free boundary conditions). Independently of type of boundary conditions the parameter of large‐range orientational order tends to zero with increasing of the chain contour length. Dynamical equations and relaxation spectrums for times of local motions are obtained. From time correlation functions of local relaxation the times of nano‐scaled mobility of chains were calculated in depending on the bending rigidity of chains, the parameter of interchain interactions, and the contour length of chains. At the given intra‐and interchain interactions an elongtation of chains forming the domain leads to to the slowing‐down of local mobility of chains in the domain. The comparison with experimental date obtained by dielectric relaxation and polarized luminescence methods on investigation of nano‐scaled mobility in the dilute melts of comb‐shaped polymers has been carried out.  相似文献   

15.
16.
Many commercially available software programs claim similar efficiency and accuracy as variable selection tools. Genetic algorithms are commonly used variable selection methods where most relevant variables can be differentiated from ‘less important’ variables using evolutionary computing techniques. However, different vendors offer several algorithms, and the puzzling question is: which one is the appropriate method of choice? In this study, several genetic algorithm tools (e.g. GFA from Cerius2, QuaSAR-Evolution from MOE and Partek’s genetic algorithm) were compared. Stepwise multiple linear regression models were generated using the most relevant variables identified by the above genetic algorithms. This procedure led to the successful generation of Quantitative Structure–activity Relationship (QSAR) models for (a) proprietary datasets and (b) the Selwood dataset.  相似文献   

17.
The transfer of retention times based on thermodynamic models between columns can aid in separation optimization and compound identification in gas chromatography. Although earlier investigations have been reported, this problem remains unsuccessfully addressed. One barrier is poor predictive accuracy when moving from a reference column or system to a new target column or system. This is attributed to challenges associated with the accurate determination of the effective geometric parameters of the columns. To overcome this, we designed least squares‐based models that account for geometric parameters of the columns and thermodynamic parameters of compounds as they partition between mobile and stationary phases. Quasi‐Newton‐based algorithms were then used to perform the numerical optimization. In this first of three parts, the model used to determine the geometric parameters of the reference column and the thermodynamic parameters of compounds subjected to separation is introduced. As will be shown, the overall approach significantly improves the predictive accuracy and transferability of thermodynamic data (and retention times) between columns of the same stationary phase chemistry. The data required for the determination of the thermodynamic parameters and retention time prediction are obtained from fast and simple experiments. The proposed model and optimization algorithms were tested and validated using simulated and experimental data.  相似文献   

18.
19.
Target projection (TP) also called target rotation (TR) was introduced to facilitate interpretation of latent‐variable regression models. Orthogonal partial least squares (OPLS) regression and PLS post‐processing by similarity transform (PLS + ST) represent two alternative algorithms for the same purpose. In addition, OPLS and PLS + ST provide components to explain systematic variation in X orthogonal to the response. We show, that for the same number of components, OPLS and PLS + ST provide score and loading vectors for the predictive latent variable that are the same as for TP except for a scaling factor. Furthermore, we show how the TP approach can be extended to become a hybrid of latent‐variable (LV) regression and exploratory LV analysis and thus embrace systematic variation in X unrelated to the response. Principal component analysis (PCA) of the residual variation after removal of the target component is here used to extract the orthogonal components, but X‐tended TP (XTP) permits other criteria for decomposition of the residual variation. If PCA is used for decomposing the orthogonal variation in XTP, the variance of the major orthogonal components obtained for OPLS and XTP is observed to be almost the same, showing the close relationship between the methods. The XTP approach is tested and compared with OPLS for a three‐component mixture analyzed by infrared spectroscopy and a multicomponent mixture measured by near infrared spectroscopy in a reactor. Copyright © 2008 John Wiley & Sons, Ltd.  相似文献   

20.
The multilayer feed-forward ANN is an important modeling technique used in QSAR studying. The training of ANN is usually carried out only to optimize the weights of the neural network and without paying attention to the network topology. Some other strategies used to train ANN are, first, to discover an optimum structure of the network, and then to find weights for an already defined structure. These methods tend to converge to local optima, and may also lead to overfitting. In this article, a hybridized particle swarm optimization (PSO) approach was applied to the neural network structure training (HPSONN). The continuous version of PSO was used for the weight training of ANN, and the modified discrete PSO was applied to find appropriate the network architecture. The network structure and connectivity are trained simultaneously. The two versions of PSO can jointly search the global optimal ANN architecture and weights. A new objective function is formulated to determine the appropriate network architecture and optimum value of the weights. The proposed HPSONN algorithm was used to predict carcinogenic potency of aromatic amines and biological activity of a series of distamycin and distamycin-like derivatives. The results were compared to those obtained by PSO and GA training in which the network architecture was kept fixed. The comparison demonstrated that the HPSONN is a useful tool for training ANN, which converges quickly towards the optimal position, and can avoid overfitting in some extent.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号