首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
Fourier-transform mid-infrared (FT-MIR) spectroscopy, combined with partial least-squares (PLS) regression and IPW as feature selection method, was used to develop reduced-spectrum calibration models based on a few IR bands to provide near-real-time predictions of two key parameters for the characterization of finished red wines, which are essential from a quality assurance standpoint: total and volatile acidity. Separate PLS calibration models, correlating IR data (only considering those regions showing a high signal to noise ratio) with each response studied, were developed. Wavenumber selection was also performed applying IPW-PLS to take into account only significant predictors, in an attempt to improve the quality of the final models constructed. Using both PLS and IPW-PLS regression, prediction of the two responses modelled was performed with very high reliability, with RMSECV and RMSEP values on the order of 1% (comparable in terms of accuracy to the results provided by the respective reference analysis methods). An important advantage derived from the application of the IPW-PLS method had to do with the low number of original variables needed for modelling both total acidity (22 significant wavenumbers) and volatile acidity (only 11 selected predictor variables), in such a way that variable selection contributed to enhance the stability and parsimony properties of the final calibration models. The high quality of the calibration models proposed encourages the feasibility of implementing them as a fast and reliable tool in routine analysis for the determination of critical parameters for wine quality.  相似文献   

2.
A method for sulfur determination in diesel fuel employing near infrared spectroscopy, variable selection and multivariate calibration is described. The performances of principal component regression (PCR) and partial least square (PLS) chemometric methods were compared with those shown by multiple linear regression (MLR), performed after variable selection based on the genetic algorithm (GA) or the successive projection algorithm (SPA). Ninety seven diesel samples were divided into three sets (41 for calibration, 30 for internal validation and 26 for external validation), each of them covering the full range of sulfur concentrations (from 0.07 to 0.33% w/w). Transflectance measurements were performed from 850 to 1800 nm. Although principal component analysis identified the presence of three groups, PLS, PCR and MLR provided models whose predicting capabilities were independent of the diesel type. Calibration with PLS and PCR employing all the 454 wavelengths provided root mean square errors of prediction (RMSEP) of 0.036% and 0.043% for the validation set, respectively. The use of GA and SPA for variable selection provided calibration models based on 19 and 9 wavelengths, with a RMSEP of 0.031% (PLS-GA), 0.022% (MLR-SPA) and 0.034% (MLR-GA). As the ASTM 4294 method allows a reproducibility of 0.05%, it can be concluded that a method based on NIR spectroscopy and multivariate calibration can be employed for the determination of sulfur in diesel fuels. Furthermore, the selection of variables can provide more robust calibration models and SPA provided more parsimonious models than GA.  相似文献   

3.
Using principal component regression (PCR) as a multivariate calibration tool, always brings up the question what subset of factors, i.e. principal components (PCs) gives the best calibration model. Normally factor selection is based on deterministic methods like top–down procedures, forward–backward-stepwise variable selection or correlated principal component regression (CPCR). In contrast to this, we applied a stochastic method, i.e. a genetic algorithm (GA) for factor selection in this paper. A new kind of fitness function was applied which combined the prediction error of the calibration and an independent validation set. The performance of eigenvalue and correlation ranking was compared. A general statistical criterion for judging the significance of differences between individual calibration models is introduced. In this context it could be shown that for the uncertainties of the standard deviations representing the prediction errors a very simple approximation formula holds which only includes the number of standards. For the current applications it is shown that the GA gives a result very close to CPCR-solutions.  相似文献   

4.
This paper evaluates analytical methods based on near infrared (NIR) and middle infrared (MIR) spectroscopy and multivariate calibration to monitor the stability of biodiesel. There was a focus on three parameters: oxidative stability index, acid number and water content. Ethylic and methylic biodiesel from different feedstocks were used in experiments of accelerated aging, in order to take into account the wide variety of oilseeds and feedstocks available in Brazil. Partial least squares (PLS) and multiple linear regression (MLR) models were developed. Different pre-processing techniques and spectral variable/regions selection algorithms were evaluated. For MLR models, the successive projection algorithm (SPA) was employed. Interval PLS (iPLS) and selection of variables taking into account the significant regression coefficients were used for PLS models. Results showed that both near and middle infrared regions, and all variable selection methods tested were efficient for predicting these three important quality parameters of B100, the root mean squares error of prediction (RMSEP) values being comparable to the reproducibility of the corresponding standard method for each property investigated.  相似文献   

5.
This paper reports the results of a rapid method to determine sucrose in chocolate mass using near infrared spectroscopy (NIRS). We applied a broad-based calibration approach, which consists in putting together in one single calibration samples of various types of chocolate mass. This approach increases the concentration range for one or more compositional parameters, improves the model performance and requires just one calibration model for several recipes. The data were modelled using partial least squares (PLS) and multiple linear regression (MLR). The MLR models were developed using a variable selection based on the coefficient regression of PLS and genetic algorithm (GA). High correlation coefficients (0.998, 0.997, 0.998 for PLS, MLR and GA-MLR, respectively) and low prediction errors confirms the good predictability of the models. The results show that NIR can be used as rapid method to determine sucrose in chocolate mass in chocolate factories.  相似文献   

6.
Carolina C. Acebal 《Talanta》2010,82(1):222-226
A novel variable selection strategy for multiple lineal regression (MLR), the successive projections algorithm (SPA), was applied to spectrophotometric data (190-320 nm) for the simultaneous determination of monosodium glutamate (MSG), guanosine-5′-monophosphate (GMP) and inosine-5′-monophosphate (IMP) in dehydrated broths samples. This selection method uses simple operations in a vector space to minimize variable collinearity and has become an interesting variable selection strategy for multivariate calibration. In this work, nine, six and four wavelengths for MSG, GMP and IMP, respectively, were selected to construct calibrations models in order to solve successfully the serious spectral overlapping in samples containing these analytes. The relative errors of prediction (REP) for the validation set were 2.3%, 0.9% and 1.8% for MSG, GMP and IMP, respectively. Commercial samples were analysed and a recovery study was carried out to verify the accuracy of the proposed method with satisfactory results. A continuous flow system was used to develop a simple, cheap and rapid method (sample throughput: 200 h−1), without any previous extraction step.  相似文献   

7.
The non-linear regression technique known as alternating conditional expectations (ACE) method is only applicable when the number of objects available for calibration is considerably greater than the number of considered predictors. Alternating conditional expectations regression with selection of significant predictors by genetic algorithms (GA-ACE), the non-linear regression technique presented here, is based on the ACE algorithm but introducing several modifications to resolve the applicability limitations of the original ACE method, thus facilitating the practical implementation of a very interesting calibration tool. In order to overcome the lack of reliability displayed by the original ACE algorithm when working on data sets characterized by a too large number of variables and prior to the development of the non-linear regression model, GA-ACE applies genetic algorithms as a variable selection technique to select a reduced subset of significant predictors able to accurately model and predict a considered variable response. Furthermore, GA-ACE actually provides two alternative application approaches, since it allows either the performance of prior data compression computing a number of principal components to be subsequently subjected to GA-selection, or working directly on original variables.In this study, GA-ACE was applied to two real calibration problems, with a very low observation/variable ratio (NIR data), and the results were compared with those obtained by several linear regression techniques usually employed. When using the GA-ACE non-linear method, notably improved regression models were developed for the two response variables modeled, with root mean square errors of the residuals in external prediction (RMSEP) equal to 11.51 and 6.03% for moisture and lipid contents of roasted coffee samples, respectively. The improvement achieved by applying the new non-linear method introduced is even more remarkable taking into account the results obtained with the best performance linear method (IPW-PLS) applied to predict the studied responses (14.61 and 7.74% RMSEP, respectively).  相似文献   

8.
Near-infrared (NIR) spectroscopy and characteristic variables selection methods were used to develop a quick method for the determination of cellulose, hemicellulose, and lignin contents in Sargassum horneri. Calibration models for cellulose, hemicellulose, and lignin in Sargassum horneri were established using partial least square regression methods with full variables (full-PLSR). The PLSR calibration models were established by four characteristic variables selection methods, including interval partial least square (iPLS), competitive adaptive reweighted sampling (CARS), correlation coefficient (CC), and genetic algorithm (GA). The results showed that the performance of the four calibration models, namely iPLS-PLSR, CARS-PLSR, CC-PLSR, and GA-PLSR, was better than the full-PLSR calibration model. The iPLS method was best in the performance of the models. For iPLS-PLSR, the determination coefficient (R2), root mean square error (RMSE), and residual predictive deviation (RPD) of the prediction set were as follows: 0.8955, 0.8232%, and 3.0934 for cellulose, 0.8669, 0.4697%, and 2.7406 for hemicellulose, and 0.7307, 0.7533%, and 1.9272 for lignin, respectively. These findings indicate that the NIR calibration models can be used to predict cellulose, hemicellulose, and lignin contents in Sargassum horneri quickly and accurately.  相似文献   

9.
Application of hand scanner in multivariate quantification of povidone-iodine (PVI), as a popular antiseptic agent, in some of pharmaceutical products is presented. Brightness, contrast, and mixed gamma were the adjustable scanner parameters. For selection of optimum values of the scanner parameters, partial least squares (PLS) and multiple linear regression (MLR), coupled with genetic algorithm, were performed. For the selected variables, both MLR and PLS performances were similar and appropriate. From the results obtained, it was concluded that the simpler method of MLR could be successfully applied instead of PLS, which requires more statistical experience. The considered concentration range for PVI in the calibration and prediction samples was 0.0-10.0% (w/v). For the analysis of pharmaceutical samples, generalized standard addition method (GSAM) was applied (on the variables selected by GA) and desirable results were obtained. Relative standard error (RSE) of less than 8% was obtained for the majority of samples analyzed.  相似文献   

10.
A partial least squares (PLS) regression model based on attenuated total reflectance–Fourier transform infrared spectra of heated olive oil samples has been developed for the determination of polymerized triacylglycerides (PTGs) generated during thermal treatment of oil. Three different approaches for selection of the spectral regions used to build the PLS model were tested and compared: (1) variable selection based on expert knowledge, (2) uninformative variable elimination PLS, and (3) interval PLS. Each of the three variable selection methods provided PLS models from heated olive oil samples with excellent performance for the prediction of PTGs in fried olive oils with comparable model statistics. However, besides a high coefficient of determination (R 2 of 0.991) and low calibration, validation, and prediction errors of 1.14%, 1.21%, and 1.40% w/w, respectively, variable selection based on expert knowledge gave additionally almost identical low calibration (−0.0017% w/w) and prediction (−0.0023% w/w) bias. Furthermore, it was verified that the determination of PTGs was not influenced by the type of foodstuff fried in the olive oil.  相似文献   

11.
The ability of five sample selection methods for local chemometrics and three variable selection algorithms were compared for the development and the transfer of whole soybeans protein and oil near infrared prediction models. Two new methods based on a similarity index considering Euclidian distance among Fourier coefficients were introduced and tested against more common approaches (locally weighted regression, LOCAL). Genetic algorithms were also challenged with the development of models based on particle swarm optimization (PSO). A modification to the original PSO model was introduced. Sample and variable selection methods, as well as their combinations, were tested in the transfer of models in intra‐ and inter‐brand situations using two Foss Infratecs and two Bruins OmegAnalyzerGs. For each brand, a master was designated and its models transferred onto the second unit of its network and the two units of the second brand. Calibration models were proven transferable from brand to brand with similar or better precisions than when all instruments were calibrated on their own calibration sets (relative predictive determinant (RPD) improving from 10.42 to 12.76 and 12.39 in intra‐brand standardization for Infratec network with local and variable selection methods respectively). These methods provided contrasted results depending on the instrument, the parameter, and the variability of interest. Copyright © 2010 John Wiley & Sons, Ltd.  相似文献   

12.
In this paper, we propose a genetic algorithm‐based wavelength selection (GAWLS) method for visible and near‐infrared (Vis/NIR) spectral calibration. The objective of GAWLS is to construct robust and predictive regression models by selecting informative wavelength regions. To demonstrate the ability of the proposed method, regression models for soil properties and sugar content of apples are constructed by using GAWLS and other variable selection methods. Copyright © 2010 John Wiley & Sons, Ltd.  相似文献   

13.
In this work, the development of a robust spectroscopic procedure for determining, simultaneously and non-destructively, relevant quality parameters of processed tomato products (total and soluble solids, total acidity, total sugars, glucose and fructose), is described. Samples of tomato concentrate products with total solids content ranging from 6.9 to 35.9% were collected from Latin America, the US and Europe and NIR spectra were acquired in the 4000-10,000 cm(-1) region. The original spectra were pre-processed by mean-smoothing or by Fourier filter, followed by multiplicative signal correction (MSC) or derivatives. Partial least squares (PLS2 and PLS1) models were built and their predictive abilities were compared through the RMSEP of external validation. The PLS2 regression had better predictive abilities for four out of the six properties under study, namely total solids, total sugars, glucose and fructose. Besides, the model was less complex than the PLS1 models in the sense that only four factors were demanded whilst from 4 to 11 factors were necessary for building the PLS1 models. The standard error of prediction (SEP%) of the PLS2 model for each property was: total solids, 2.67; soluble solids, 1.14; total acidity, 9.60; total sugar, 18.69; glucose, 11.60; and fructose, 13.45.  相似文献   

14.
Ni Xin  Qinghua Meng  Yizhen Li  Yuzhu Hu 《中国化学》2011,29(11):2533-2540
This paper indicates the possibility to use near infrared (NIR) spectral similarity as a rapid method to estimate the quality of Flos Lonicerae. Variable selection together with modelling techniques is utilized to select representative variables that are used to calculate the similarity. NIR is used to build calibration models to predict the bacteriostatic activity of Flos Lonicerae. For the determination of the bacteriostatic activity, the in vitro experiment is used. Models are built for the Gram‐positive bacteria and also for the Gram‐negative bacteria. A genetic algorithm combined with partial least squares regression (GA‐PLS) is used to perform the calibration. The results of GA‐PLS models are compared to interval partial least squares (iPLS) models, full‐spectrum PLS and full‐spectrum principal component regression (PCR) models. Then, the variables in the two GA‐PLS models are combined and then used to calculate the NIR spectral similarity of samples. The similarity based on the characteristic variables and full spectrum is used for evaluating the fingerprints of Flos Lonicerae, respectively. The results show that the combination of variable selection method, modelling techniques and similarity analysis might be a powerful tool for quality control of traditional Chinese medicine (TCM).  相似文献   

15.
16.
Multivariate calibration problems often involve the identification of a meaningful subset of variables, from a vast number of variables for better prediction of output variables. A new graph theoretic method based on partial correlations (variable interaction network—VIN) is proposed. Many well studied representative calibration datasets spanning different application domains are selected for investigating the performance. Partial least squares (PLS) regression models combined with variable selection techniques are employed for benchmarking the performance. Subsets of variables with different number of variables are retained for the final analysis after VIN selection and progressive prediction accuracies are used for comparison. VIN-PLS results show significant improvement in prediction efficiencies and variable subset optimization. Improvement of up to 45% over existing methods with significantly fewer variables is achieved using the new method. Advantages of VIN based variable selection are highlighted.  相似文献   

17.
18.
Rodrigues LO  Cardoso JP  Menezes JC 《Talanta》2008,75(5):1203-1207
The use of near infrared spectroscopy (NIRS) in downstream solvent based processing steps of an active pharmaceutical ingredient (API) is reported. A single quantitative method was developed for API content assessment in the organic phase of a liquid–liquid extraction process and in multiple process streams of subsequent concentration and depuration steps. A new methodology based in spectra combinations and variable selection by genetic algorithm was used with an effective improvement in calibration model prediction ability. Root mean standard error of prediction (RMSEP) of 0.05 in the range of 0.20–3.00% (w/w) was achieved. With this method, it is possible to balance the calibration data set with spectra of desired concentrations, whenever acquisition of new spectra is no longer possible or improvements in model's accuracy for a specific selected range are necessary. The inclusion of artificial spectra prior to genetic algorithms use improved RMSEP by 10%. This method gave a relative RMSEP improvement of 46% compared with a standard PLS of full spectral length.  相似文献   

19.
Near-infrared reflectance spectroscopy (NIRS) is often applied when a rapid quantification of major components in feed is required. This technique is preferred over the other analytical techniques due to the relatively few requirements concerning sample preparations, high efficiency and low costs of the analysis. In this study, NIRS was used to control the content of crude protein, fat and fibre in extracted rapeseed meal which was produced in the local industrial crushing plant. For modelling the NIR data, the partial least squares approach (PLS) was used. The satisfactory prediction errors were equal to 1.12, 0.13 and 0.45 (expressed in percentages referring to dry mass) for crude protein, fat and fibre content, respectively. To point out the key spectral regions which are important for modelling, uninformative variable elimination PLS, PLS with jackknife-based variable elimination, PLS with bootstrap-based variable elimination and the orthogonal partial least squares approach were compared for the data studied. They enabled an easier interpretation of the calibration models in terms of absorption bands and led to similar predictions for test samples compared to the initial models.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号