首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 312 毫秒
1.
In the present study, boosting has been combined with partial least‐squares discriminant analysis (PLS‐DA) to develop a new pattern recognition method called boosting partial least‐squares discriminant analysis (BPLS‐DA). BPLS‐DA is implemented by firstly constructing a series of PLS‐DA models on the various weighted versions of the original calibration set and then combining the predictions from the constructed PLS‐DA models to obtain the integrative results by weighted majority vote. Coupled with near infrared (NIR) spectroscopy, BPLS‐DA has been applied to discriminate different kinds of tea varieties. As comparisons to BPLS‐DA, the conventional principal component analysis, linear discriminant analysis (LDA), and PLS‐DA have also been investigated. Experimental results have shown that the inter‐variety difference can be accurately and rapidly distinguished via NIR spectroscopy coupled with BPLS‐DA. Moreover, the introduction of boosting drastically enhances the performance of an individual PLS‐DA, and BPLS‐DA is a well‐performed pattern recognition technique superior to LDA. Copyright © 2012 John Wiley & Sons, Ltd.  相似文献   

2.
A new regression method based on independent component analysis   总被引:1,自引:0,他引:1  
Shao X  Wang W  Hou Z  Cai W 《Talanta》2006,69(3):676-680
Based on independent component analysis (ICA), a new regression method, independent component regression (ICR), was developed to build the model of NIR spectra and the routine components of plant samples. It is found that ICR and principal component regression (PCR) are completely equivalent when they are applied in quantitative prediction. However, independent components (ICs) can give more chemical explanation than principal components (PCs) because independence is a high-order statistic that is a much stronger condition than orthogonality. Three ICs are obtained by ICA from the NIR spectra of plant samples; it is found that they are strongly correlated to the NIR spectra of water, hydrocarbons and organonitrogen compounds, respectively. Therefore, ICA may be a promising tool to retrieve both quantitative and qualitative information from complex chemical data sets.  相似文献   

3.
Infrared emissions (IREs) of samples of pentaerythritol tetranitrate (PETN) deposited as contamination residues on various substrates were measured to generate models for the detection and discrimination of the important nitrate ester from the emissions of the substrates. Mid‐infrared emissions were generated by heating the samples remotely using laser‐induced thermal emission (LITE). Chemometrics multivariate analysis techniques such as principal component analysis (PCA), soft independent modeling by class analogy (SIMCA), partial least squares‐discriminant analysis (PLS‐DA), support vector machines (SVMs), and neural network (NN) were employed to generate the models for the classification and discrimination of PETN IREs from substrate thermal emissions. PCA exhibited less variability for the LITE spectra of PETN/substrates. SIMCA was able to predict only 44.7% of all samples, while SVM proved to be the most effective statistical analysis routine, with a discrimination performance of 95%. PLS‐DA and NN achieved prediction accuracies of 94% and 88%, respectively. High sensitivity and specificity values were achieved for five of the seven substrates investigated. Copyright © 2015 John Wiley & Sons, Ltd.  相似文献   

4.
In this work, a strategy was proposed to discriminate Polygoni Multiflori Radix (PMR) and its adulterant (Cynanchi Auriculati Radix, CAR). Ultra‐high performance liquid chromatography (UHPLC) fingerprints were established to analyze samples containing PMR, CAR and mixtures simultaneously. Multivariate classification methods were applied to analyze the obtained UHPLC fingerprints, including principal component analysis (PCA), partial least square discriminant analysis (PLS‐DA), soft independent modeling of class analogy (SIMCA), support vector machine discriminant analysis (SVMDA) and counter‐propagation artificial neural network (CP‐ANN). A plot of PCA score showed that PMR and CAR samples belonged to separate clusters (PMR class and CAR class), and samples of mixtures were located near PMR or CAR classes. Analysis by PLS‐DA, SVMDA and CP‐ANN performed well for recognition and prediction in terms of PMR and CAR samples. Moreover, the PLS‐DA method performed best in the detection of adulterated samples, even if the adulterant was about 25%.  相似文献   

5.
In this paper we describe the characteristics and the applications of the multivariate methods for spectroscopic and chromatographic techniques independent component analysis (ICA) and two-dimensional correlation spectroscopy (2DCOS) focused to their use in environmental studies. In our opinion, these methods are important because they allow to characterize environmental samples with different aims and scopes from those generally obtained by means of more common multivariate methods such as principal component analysis (PCA) and partial least squares (PLS). The new insights of these methods in recent environmental studies are reviewed and debated.  相似文献   

6.
The major challenge facing NMR spectroscopic mixture analysis is the overlapping of signals and the arising impossibility to easily recover the structures for identification of the individual components and to integrate separated signals for quantification. In this paper, various independent component analysis (ICA) algorithms [mutual information least dependent component analysis (MILCA); stochastic non‐negative ICA (SNICA); joint approximate diagonalization of eigenmatrices (JADE); and robust, accurate, direct ICA algorithm (RADICAL)] as well as deconvolution methods [simple‐to‐use‐interactive self‐modeling mixture analysis (SIMPLISMA) and multivariate curve resolution‐alternating least squares (MCR‐ALS)] are applied for simultaneous 1H NMR spectroscopic determination of organic substances in complex mixtures. Among others, we studied constituents of the following matrices: honey, soft drinks, and liquids used in electronic cigarettes. Good quality spectral resolution of up to eight‐component mixtures was achieved (correlation coefficients between resolved and experimental spectra were not less than 0.90). In general, the relative errors in the recovered concentrations were below 12%. SIMPLISMA and MILCA algorithms were found to be preferable for NMR spectra deconvolution and showed similar performance. The proposed method was used for analysis of authentic samples. The resolved ICA concentrations match well with the results of reference gas chromatography–mass spectrometry as well as the MCR‐ALS algorithm used for comparison. ICA deconvolution considerably improves the application range of direct NMR spectroscopy for analysis of complex mixtures. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

7.
From the fundamental parts of PLS‐DA, Fisher's canonical discriminant analysis (FCDA) and Powered PLS (PPLS), we develop the concept of powered PLS for classification problems (PPLS‐DA). By taking advantage of a sequence of data reducing linear transformations (consistent with the computation of ordinary PLS‐DA components), PPLS‐DA computes each component from the transformed data by maximization of a parameterized Rayleigh quotient associated with FCDA. Models found by the powered PLS methodology can contribute to reveal the relevance of particular predictors and often requires fewer and simpler components than their ordinary PLS counterparts. From the possibility of imposing restrictions on the powers available for optimization we obtain an explorative approach to predictive modeling not available to the traditional PLS methods. Copyright © 2008 John Wiley & Sons, Ltd.  相似文献   

8.
An NMR and chemometric analytical approach to classify beers according to their brand identity was developed within the European TRACE project (FP6‐2003‐FOOD‐2‐A, contract number: 0060942). Rochefort 8 Trappist beers (47 samples), other Trappist beers (76 samples) and non‐Trappist beers (110 samples) were analyzed by 1H NMR spectroscopy. Selected NMR signals were measured and used to build classification models. Three different classification problems were identified, namely Trappist versus non‐Trappist, Rochefort versus Non‐Rochefort, and Rochefort 8 versus non‐Rochefort 8. In all the three cases, both a discriminant and a modeling approaches were followed, using partial least squares discriminant analysis (PLS‐DA) and soft independent modeling of class analogies (SIMCA), respectively, leading to very high classification accuracy as evaluated by external validation. Information regarding chemical composition was also obtained: Trappist beers contain a higher amount of formic and pyruvic acids and a lower amount of acetic acid and alanine with respect to non‐Trappist ones. Rochefort beers turned out to have also a higher content of propanol and isopentanol with respect to non‐Rochefort samples. Finally, Rochefort 8, shows the highest content of pyruvic acid and the lowest content of gallic, fumaric, acetic acids, adenosine, uridine, 2‐phenylethanol, GABA, and alanine.  相似文献   

9.
A simple and reliable method for simultaneous spectrophotometric determination of iron(II) and cobalt(II) has been established. The method is based on complex formation with 1‐(2‐pyridylazo)‐2‐naphtol (PAN) in a micellar medium. Despite a spectral overlap, Fe2+ and Co2+ have been simultaneously determined with chemometric approaches involving principal component artificial neural network (PC‐ANN), principal component regression (PCR) and partial least squares (PLS). Various synthetic mixtures of iron and cobalt were assessed and the results obtained by the applications of these chemometric approaches were evaluated and compared. It was found that the PC‐ANN method afforded relatively better precision than that of PCR or PLS. The proposed method permits detection limits of 0.05 and 0.07 ng mL?1 for Co and Fe, respectively. The influences of pH, ligand amount, solvent percentage and time on the absorbance were also investigated. The proposed method was also applied satisfactorily for the determination of Fe(II) and Co(II) in real and synthetic samples.  相似文献   

10.
A rapid method was developed and validated by ultra‐performance liquid chromatography–triple quadrupole mass spectroscopy with ultraviolet detection (UPLC‐UV‐MS) for simultaneous determination of paris saponin I, paris saponin II, paris saponin VI and paris saponin VII. Partial least squares discriminant analysis (PLS‐DA) based on UPLC and Fourier transform infrared (FT‐IR) spectroscopy was employed to evaluate Paris polyphylla var. yunnanensis (PPY) at different harvesting times. Quantitative determination implied that the various contents of bioactive compounds with different harvesting times may lead to different pharmacological effects; the average content of total saponins for PPY harvested at 8 years was higher than that from other samples. The PLS‐DA of FT‐IR spectra had a better performance than that of UPLC for discrimination of PPY from different harvesting times.  相似文献   

11.
Water quality data set from the alluvial region in the Gangetic plain in northern India, which is known for high fluoride levels in soil and groundwater, has been analysed by chemometric techniques, such as principal component analysis (PCA), discriminant analysis (DA) and partial least squares (PLS) in order to investigate the compositional differences between surface and groundwater samples, spatial variations in groundwater composition and influence of natural and anthropogenic factors. Trilinear plots of major ions showed that the groundwater in this region is mainly of Na/K-bicarbonate type. PCA performed on complete data matrix yielded six significant PCs explaining 65% of the data variance. Although, PCA rendered considerable data reduction, it could not clearly group and distinguish the sample types (dug well, hand-pump and surface water). However, a visible differentiation between the water samples pertaining to two watersheds (Khar and Loni) was obtained. DA identified six discriminating variables between surface and groundwater and also between different types of samples (dug well, hand pump and surface water). Distinct grouping of the surface and groundwater samples was achieved using the PLS technique. It further showed that the groundwater samples are dominated by variables having origin both in natural and anthropogenic sources in the region, whereas, variables of industrial origin dominate the surface water samples. It also suggested that the groundwater sources are contaminated with various industrial contaminants in the region.  相似文献   

12.
We describe three types of automatic software for the chemometric processing of spectrometric data. The software was developed in the MATLAB working environment and includes data import, mathematical preprocessing, chemometric analysis, and generation of a report file. The software is designed to solve problems regarding identification of some components of multicomponent mixtures, determination of compounds with overlapping signals, and differentiation of samples by their spectral responses. To test the software, we present examples of spectrometric analyses of coffee, fruit juices, and alcoholic beverages using chemometric methods of independent component analysis (ICA) and partial least squares–discriminant analysis (PLS–DA). In particular, we simulated electronic absorption spectra for the identification of three artificial colors (E110, E102, and E122) in alcoholic beverages, NMR spectra for the simultaneous determination of five components (acetic acid, γ-aminobutyric acid, arginine, acetaldehyde, and proline) in orange juice without using reference standards, and NMR spectra of coffee samples to determine its varietal authenticity (Arabica or Robusta). The duration of automatic chemometric processing did not exceed 1 min per sample. The developed software can be optimized for other matrices and/or brands of spectrometers.  相似文献   

13.
A data analysis tool, known as independent component analysis (ICA), is the main focus of this paper. The theory of ICA is briefly reviewed, and the underlying statistical assumptions and a practical algorithm are described. This paper introduces cross validation/jack-knifing and significance tests to ICA. Jack-knifing is applied to estimate uncertainties for the ICA loadings, which also serve as a basis for significance tests. These tests are shown to improve ICA performance, indicating how many components are mixed in the observed data, and also which parts of the extracted sources that contain significant information. We address the issue of stability for the ICA model through uncertainty plots. The ICA performance is compared to principal component analysis (PCA) for two selected applications, a simulated experiment and a real world application.  相似文献   

14.
The aim of this paper is to characterize metabolism disorders in Kunming mice induced by S180 and H22 tumor cells. Metabolic fingerprint based on high performance liquid chromatography‐diode array detector (HPLC‐DAD) was developed to map the disturbed metabolic responses. In vivo testing of the antitumor activity of paclitaxel (Taxol) was carried out by inhibiting the growth of S180 and H22 tumor cells. Based on 27 common peaks, principal component analysis (PCA) and partial least squares‐discriminant analysis (PLS‐DA) were used to distinguish the abnormal from control and to find significant endogenous compounds (SECs) which have significant contributions to classification. The tumor growth inhibition ratios (TIRs) of Taxol groups were used to validate the predictive accuracies of the PLS‐DA models. The predictive accuracies of PLS‐DA models for S180 and H22 tumor model groups were 97.6 and 100%, respectively. Nine (S180) and seven (H22) SECs were discovered, including uric acid and cytidine. In addition, the correlations between relative tumor weights (RTWs) and chromatographic data for the SECs were significant (p < 0.05). Investigations on the stability and precision of the established metabolic fingerprints demonstrate that the experiment is well controlled and reliable. This work shows that the platform of HPLC‐DAD coupled with chemometric methods provides a promising method for the study of metabolism disorders induced by tumor cells. Copyright © 2011 John Wiley & Sons, Ltd.  相似文献   

15.
Some vegetable oils such as canola (CaO), corn (CO), soybean (SO), and walnut (WO) oils have similar color with cod liver oil (CLO), therefore, the presence of these oils was difficult to detect using naked eye. For this reason, Fourier transform infrared (FTIR) spectroscopy using horizontal attenuated total reflectance (HATR) as sampling accessory and in the combination with chemometrics was developed for detection and quantification of these vegetable oils as adulterants in CLO. The quantification of vegetable oils was carried out by using multivariate calibrations of partial least squares (PLS) and principle component regression (PCR), while the classification between pure CLO and CLOs adulterated with CaO, CO, SO, and WO was performed using discriminant analysis (DA). PLS with FTIR normal spectra was more suitable compared with PCR for quantification purposes with coefficient of determination (R2) higher than 0.99 and root mean square error of calibration (RMSEC) in the range of 0.04-0.82% (v/v). The PLS model was further used to predict the levels of these vegetable oils in independent samples for validation/prediction purpose. The root mean square error of prediction (RMSEP) values obtained were of 1.75% (v/v) (CaO), 1.39% (v/v) (CO), 1.35% (v/v) (SO), and 1.37% (v/v) (WO), respectively. The classification using DA revealed that the developed method can classify CLO and that mixed with these vegetable oils using 9 principal components.  相似文献   

16.
独立分量分析预处理法提高苹果糖度模型预测精度研究   总被引:1,自引:0,他引:1  
邹小波  赵杰文 《分析化学》2006,34(9):1291-1294
为了提高苹果近红外光谱糖度预测模型精度,利用独立分量分析方法(ICA)对苹果近红外光谱进行了预处理,并且建立了糖度的偏最小二乘(PLS)预测模型。结果表明,独立分量分析不但能分离出噪声信号,而且所分离出来的光谱信号也比原始光谱信号光滑。在预处理后的最佳PLS糖度模型校正时的相关系数rc和标准偏差SEC分别为0.9549和0.3361,用于预测时的相关系数rp和标准偏差SEP分别为0.9071和0.4355。与普通的平均处理法的PLS模型相比,其精度有所提高,且模型更加简洁。  相似文献   

17.
姜红  陈壮  郝小辉  章欣 《化学通报》2024,87(1):118-121
食品类塑料瓶物证携带许多潜在证据信息,目前针对此类物证的检验研究尚处于探索阶段。利用差分拉曼光谱对46个食品类塑料瓶样品进行检验,依据样品材质及光谱特征峰可将样品分为三类。利用主成分分析(Principal component analysis, PCA)-Fisher判别分析,绘制主成分得分图,构建判别函数,建立分类模型。结果表明,食品类塑料瓶样品具有明显的聚类关系,原始分类与交叉验证分类准确率达到100 %。差分拉曼光谱结合PCA-Fisher判别分析,检验鉴别食品类塑料瓶物证具有一定的科学性。  相似文献   

18.
The performances of three multivariate analysis methods—partial least squares (PLS) regression, secured principal component regression (sPCR) and modified secured principal component regression (msPCR)—are compared and tested for the determination of human serum albumin (HSA), γ-globulin, and glucose in phosphate buffer solutions and blood glucose quantification by near-infrared (NIR) spectroscopy. Results from the application of PLS, sPCR and msPCR are presented, showing that the three methods can determine the concentrations of HSA, γ-globulin and glucose in phosphate buffer solutions almost equally well provided that the prediction samples contain the same spectral information as the calibration samples. On the other hand, when some potential spectral features appear in new measurements, sPCR and msPCR outperform PLS significantly. The reason for this is that such spectral features are not included during calibration, which leads to a degradation in PLS prediction performance, while sPCR and msPCR can improve their predictions for the concentrations of the analytes by removing the uncalibrated features from the original spectra. This point is demonstrated by successfully applying sPCR and msPCR to in vivo blood glucose measurements. This work therefore shows that sPCR and msPCR may provide possible alternatives to PLS in cases where some uncalibrated spectral features are present in measurements used for concentration prediction.  相似文献   

19.
Augustin C?t?lin Mo? 《Talanta》2010,81(3):1010-1002
The present study described reflectance spectroscopy as a suitable analytical tool to discriminate the floral origin of 39 Romanian propolis samples. Relevant differences between the UV-vis reflectance spectra of the investigated propolis samples within the 220-850 nm spectral range were found. The results obtained applying cluster analysis, principal component analysis and linear discriminant analysis to the digitized data of zero order, zero order normalized and first order derivative spectra support the reliability of this technique. In addition, the application of the linear discriminant analysis to the score matrices corresponding to the first principal components appeared to be an illuminating solution. Generally, the samples have been assigned to two large groups in a good agreement with their vegetal sampling location, samples originating from predominant forest area and samples originating from meadows. Within the first group, two subgroups were identified according to the dominant type of the forest, deciduous or resinous, while within the last group three subgroups were found according to the extend and variety of the meadow.  相似文献   

20.
A rapid Raman spectroscopy protocol is reported to classify gasoline according to its distributor and to identify and quantify common adulterants. Gasoline from three distributors was collected from 19 stations in São Paulo, Brazil. Principal component analysis (PCA) showed specific clusters for each distributor, and partial least squares discriminant analysis (PLS-DA) correctly identified the origin of the samples. To evaluate the technique for the identification and quantification of the adulterants, authentic samples from each distributor were fortified at levels from 2.5 up to 25.0% (v/v) using ethanol, methanol, toluene, and turpentine to obtain 120 altered samples. PCA showed clear separation among the samples with the adulterants and PLS-DA precisely identified the adulterants (478 in 480 predictions by cross-validation), irrespective of the distributor and the concentration. One classification model was used to characterize all distributors. To quantify the adulterants, 36 multivariate calibration models were constructed using partial least squares (PLS), interval PLS, and PLS genetic algorithm for each distributor and for each adulterant. Cross-validation errors of less than 5.0% were obtained for all adulterants regardless of the distributor. Raman spectroscopy and multivariate analysis were shown to be powerful for rapid and inexpensive for the characterization of gasoline origin and the identification and quantification of common adulterants.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号