首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 296 毫秒
1.
基于主成分分析和小波神经网络的近红外多组分建模研究   总被引:5,自引:0,他引:5  
将小麦叶片原始光谱经过预处理后,采用主成分分析(PCA)对数据进行降维,取前3个主成分输入小波神经网络,建立了基于主成分分析和小波神经网络的近红外多组分预测模型(WNN);进一步研究了小波基函数个数的选取(WNN隐层节点数)对小波神经网络模型性能的影响,并将WNN模型与偏最小二乘法(PLS)和传统的反向传播神经网络(BPNN)模型进行了比较.结果表明,所建立的WNN模型能用于同时预测小麦叶片全氮和可溶性总糖两种组分含量,其预测均方根误差(RMSEP)分别为0.101%和0.089%,预测相关系数(R)分别为0.980和0.967.另外,在收敛速度和预测精度上,WNN模型明显优于BPNN和PLS模型,从而为将小波神经网络用于近红外光谱的多组分定量分析奠定了基础.  相似文献   

2.
利用近红外光谱(NIRS)技术对柴胡提取过程中的药效成分进行快速定量分析。共收集126个柴胡提取液样品,采用紫外-可见分光光度法测定总黄酮和多糖的含量,高效液相色谱法(HPLC)测定柴胡皂苷A及柴胡皂苷D的含量,以透射模式采集提取液的近红外光谱,运用偏最小二乘法(PLS)分别建立了近红外光谱与4种药效指标参考值之间的定量校正模型,并采用不同的预处理方法、光谱波段和主因子数对模型进行优化。结果表明,总黄酮、多糖、柴胡皂苷A和柴胡皂苷D 4种定量模型的近红外预测值与参考值之间的拟合性良好,模型预测精度较高,其中预测集相关系数(RP)均大于0.9;预测集误差均方根(RMSEP)分别为3.46 μg/mL、0.743 mg/mL、1.53 μg/mL、0.406 μg/mL;预测集相对偏差(RSEP)分别为1.65%、8.28%、5.74%、7.52%。该研究证实了NIRS结合PLS可成功应用于监测柴胡提取液中药效成分的含量变化,且方法具有快速、准确、无损和环保的特点。  相似文献   

3.
短波近红外光谱技术对葡萄酒中总糖含量快速测定的研究   总被引:2,自引:0,他引:2  
采用短波近红外光谱技术结合偏最小二乘法(PLS),建立了葡萄酒中总糖含量的定量分析数学模型,讨论了光谱预处理方法和主成分数对PLS模型预报精度的影响.应用所建模型对预测集样本中总糖含量进行预报,结果令人满意.该方法方便快捷,并且具有较高的预报精度,可以用于葡萄酒中总糖含量的快速测定.  相似文献   

4.
偏最小二乘(partial least squares,PLS)与广义回归神经网络(generalized regression neural networks,GRNN)联用对土豆样品建立起粗纤维、淀粉、蛋白质含量的预测校正模型,用PLS法将原始数据压缩为主成份,取前3个主成份的12个特征吸收峰输入GRNN网络,网络光滑因子σi为0.1.PLS-GRNN模型对样品3个组分含量的预测决定系数(R2)分别为: 0.945、 0.992、 0.938.结果表明,近红外光谱技术可以快速、准确地同时测定土豆中的粗纤维、淀粉、蛋白质,该方法可应用于果蔬产业的品质管理与控制.  相似文献   

5.
短波近红外光谱法对蛇床子SFE萃取产物的定量分析   总被引:1,自引:0,他引:1  
郭晔  曲楠  王彬  任玉林 《分析试验室》2007,26(11):49-52
利用中药蛇床子CO2超临界萃取(SFE)的萃取物的短波近红外漫反射光谱(800~1100 nm),以HPLC分析值作参比值,采用化学计量学中的偏最小二乘法(PLS)建立短波近红外漫反射光谱与蛇床子SFE萃取物中主要成分蛇床子素和欧前胡素间定量分析数学模型.实现了快速、无损的测定双组分中药的有效成分.讨论了光谱的预处理方法和主成分数对PLS定量预测蛇床子萃取物中蛇床子素和欧前胡素含量能力的影响,并对预测集样品进行预测.  相似文献   

6.
将中红外光谱筛选出的598个纯涤、纯棉及涤/棉混纺样本采用GB/T 2910.11-2009法测定其涤、棉准确含量,其中校正集样本252个,验证集样本346个。使用便携式近红外光谱仪获取样本的原始近红外光谱(NIRS)。校正集样本依据回归系数的分布趋势和范围选取最佳建模谱区,并采用差分一阶导、S-G平滑和均值中心化相结合的方法对原始光谱进行预处理,利用偏最小二乘法(PLS)建立涤/棉混纺织物中涤含量的近红外(NIR)定量分析模型。同时分析了样本颜色对NIRS的影响,探讨了斜线光谱样本、奇异样本和不同组织结构织物对模型预测效果的影响。结果表明:利用PLS法建立的涤/棉混纺织物定量分析模型最优组合包含1个光谱区间和9个主成分因子,校正集相关系数(RC)为0.998,标准偏差(SEC)为0.908。为验证所建模型的有效性和实用性,对346个未参与建模的涤棉样本进行了预测,并将预测结果与国标法测定值进行方差分析,两种方法结果无显著差异,预测正确率达97%以上。模型的建立为废旧涤/棉混纺织物快速、无损分拣提供了基础数据库。  相似文献   

7.
偏最小二乘法测定复方乙酰水杨酸片中的有效成分   总被引:3,自引:0,他引:3  
将偏最小二乘法(PLS)与近红外漫反射光谱法相结合,对复方乙酰水杨酸片进行无损非破坏定量分析.建立了最佳的数学校正模型,比较了样品中3种有效成分(乙酰水杨酸、非那西丁和咖啡因)同时测定和单独测定时的主成分数对PLS定量预测能力的影响,预测了未知样品。3种有效成分同时测定和单独测定建立的PLS模型具有相同的主成分数,PLS预报浓度与参考浓度具有相近的标准偏差,说明用PLS法同时测定3种组分的含量是可行的。  相似文献   

8.
通过偏最小二乘法(partial least squares,PLS)与人工神经网络(artificial neural networks,ANN)联用对鲜乳和掺有植物奶油的牛乳建立识别模型.用PLS法对原始数据进行主成分压缩,采用自组织竞争神经网络建模.取前3个主成分的21个吸收峰值输入网络,学习参数为0.05,网络训练迭代次数为200,模型鉴别准确率达100%.其次建立了植物奶油掺假量的定量检测PLS模型,并采用交互校验和外部检验考察模型的可靠性,模型的校正相关系数为0.996 3,均方估计残差(RMSEC)为0.110;交互校验均方残差(RMSECV)为0.142;应用所建PLS模型对样品中植物奶油添加量进行预测,并对预测值与真值进行配对t检验,结果表明两者差异均不显著.  相似文献   

9.
取原油样品120个,分别按照GB/T 11133-2015和GB/T 17040-2008中所述方法测定了上述原油样品中的水分和硫的含量。通过优化的近红外光谱(NIRS)条件采集了上述原油样品的NIR光谱图。采用杠杆值算法剔除4个异常样品。在建立水分含量分析模型时,采用的条件为:用Savitzky-Golay法对光谱进行滤波预处理,建模光谱区间为6 200~8 200cm-1,主成分数为6,用偏最小二乘回归法(PLS)交叉验证建立分析模型。硫含量分析模型的建立条件为:采用二阶导数-Norris Derivative对光谱进行预处理,建模光谱区间为4 400~4 700cm-1,主成分数为6,用PLS交叉验证建立分析模型。水分和硫含量模型的预测值与测定值的相关性较好。水分模型的决定系数(R2c)为0.989 9,校正标准偏差(RMSEC)为0.084 2,说明其预测效果较好,可用于原油中水分含量的预测。硫含量模型的R2c为0.996 3,RESEC为0.069 6,说明此模型的预测效果也较好,可用原油中硫含量的预测。应用所建立的两个模型对10个未知原油样品中水分和硫含量进行了预测,并与其测定值比较,结果表明两者之间的相对偏差均小于10%。  相似文献   

10.
偏最小二乘与人工神经网络联用对70个饲料样品建立起天门冬氨酸(Asp)、谷氨酸(Glu)、丝氨酸(Ser)和组氨酸(His)4种氨基酸含量的预测校正模型,以样品平行扫描光谱验证校正模型预测的准确性和重现性。用偏最小二乘法将原始数据压缩为主成分,采用单隐层的反向传播网络建模。取前3个主成分的12个数据输入网络,以Kolmogorov定理为依据,经过实验确定中间层的神经元个数为25,初始训练迭代次数为1000。偏最小二乘-反向传播网络模型对样品4个组分含量的预测决定系数(R2)分别为:0.981、0.997、0.979、0.946;样品平行扫描光谱预测值的标准偏差分别为:0.020、0.029、0.017、0.023。本研究为近红外快速检测在组分含量较低的样品实现多组分同时测定提供了思路。  相似文献   

11.
This study compares the performance of partial least squares (PLS) regression analysis and artificial neural networks (ANN) for the prediction of total anthocyanin concentration in red-grape homogenates from their visible-near-infrared (Vis-NIR) spectra. The PLS prediction of anthocyanin concentrations for new-season samples from Vis-NIR spectra was characterised by regression non-linearity and prediction bias. In practice, this usually requires the inclusion of some samples from the new vintage to improve the prediction. The use of WinISI LOCAL partly alleviated these problems but still resulted in increased error at high and low extremes of the anthocyanin concentration range. Artificial neural networks regression was investigated as an alternative method to PLS, due to the inherent advantages of ANN for modelling non-linear systems. The method proposed here combines the advantages of the data reduction capabilities of PLS regression with the non-linear modelling capabilities of ANN. With the use of PLS scores as inputs for ANN regression, the model was shown to be quicker and easier to train than using raw full-spectrum data. The ANN calibration for prediction of new vintage grape data, using PLS scores as inputs, was more linear and accurate than global and LOCAL PLS models and appears to reduce the need for refreshing the calibration with new-season samples. ANN with PLS scores required fewer inputs and was less prone to overfitting than using PCA scores. A variation of the ANN method, using carefully selected spectral frequencies as inputs, resulted in prediction accuracy comparable to those using PLS scores but, as for PCA inputs, was also prone to overfitting with redundant wavelengths.  相似文献   

12.
《Analytical letters》2012,45(9):2073-2083
Abstract

A consensus regression approach based on partial least square (PLS) regression, named as cPLS, for calibrating the NIR data was investigated. In this approach, multiple independent PLS models were developed and integrated into a single consensus model. The utility and merits of the cPLS method were demonstrated by comparing its results with those from a regular PLS method in predicting moisture, oil, protein, and starch contents of corn samples using the NIR spectral data. It was found that cPLS was superior to regular PLS with respect to prediction accuracy and robustness.  相似文献   

13.
The performance of Partial Least Squares regression (PLS) in predicting the output with multivariate cross‐ and autocorrelated data is studied. With many correlated predictors of varying importance PLS does not always predict well and we propose a modified algorithm, Partitioned Partial Least Squares (PPLS). In PPLS the predictors are partitioned into smaller subgroups and the important subgroups with high prediction power are identified. Finally, regular PLS analysis using only those subgroups is performed. The proposed Partitioned PLS (PPLS) algorithm is used in the analysis of data from a real pharmaceutical batch fermentation process for which the process variables follow certain profiles during a specific fermentation period. We observed that PPLS leads to a more accurate prediction of the yield of the fermentation process and an easier interpretation, since fewer predictors are used in the final PLS prediction. In the application important issues such as alignment of the profiles from one batch to another and standardization of the predictors are also addressed. For instance, in PPLS noise magnification due to standardization does not seem to create problems as it might in regular PLS. Finally, PPLS is compared to several recently proposed functional PLS and PCR methods and a genetic algorithm for variable selection. More specifically for a couple of publicly available data sets with near infrared spectra it is shown that overall PPLS has lower cross‐validated error than PLS, PCR and the functional modifications hereof, and is similar in performance to a more complex genetic algorithm. Copyright © 2011 John Wiley & Sons, Ltd.  相似文献   

14.
Regression from high dimensional observation vectors is particularly difficult when training data is limited. Partial least squares (PLS) partly solves the high dimensional regression problem by projecting the data to latent variables space. The key issue in PLS is the computation of weight vector which describes the covariance between the responses and observations. For small-sample-size and high-dimensional regression problem, the covariance estimation is usually inaccurate and the correlated components in the predictors will distort the PLS weight. In this paper, we propose a sparse matrix transform (SMT) based PLS (SMT-PLS) method for high-dimensional spectroscopy regression. In SMT-PLS, the observation data is first decorrelated by SMT. Then, in the decorrelated data space, the PLS loading weight is computed by least squares regression. SMT technique provides an accurate data covariance estimation, which can overcome the effect of small-sample-size and benefit both the PLS weight computation and subsequent regression prediction. The proposed SMT-PLS method is compared, in terms of root mean square errors of prediction, to PLS, Power PLS and PLS with orthogonal scatter correction on four real spectroscopic data sets. Experimental results demonstrate the efficacy and effectiveness of our proposed method.  相似文献   

15.
Two alternative partial least squares (PLS) methods, averaged PLS and weighted average PLS, are proposed and compared with the classical PLS in terms of root mean square error of prediction (RMSEP) for three real data sets. These methods compute the (weighted) average of PLS models with different complexity. The prediction abilities of the alternative methods are comparable to that of the classical PLS but they do not require to determine how many components should be included in the model. They are also more robust in the sense that the quality of prediction depends less on a good choice of the number of components to be included. In addition, weighted average PLS is also compared with the weighted average part of LOCAL, a published method that also applies weighted average PLS, with however an entirely different weighting scheme.  相似文献   

16.
以普通玉米籽粒为试验材料,在应用遗传算法结合偏最小二乘回归法对近红外光谱数据进行特征波长选择的基础上,应用偏最小二乘回归法建立了特征波长测定玉米籽粒中淀粉含量的校正模型.试验结果表明,基于11个特征波长所建立的校正模型,其校正误差(RMSEC)、交叉检验误差(RMSECV)和预测误差(RMSEP)分别为0.30%、0.35%和0.27%,校正数据集和独立的检验数据集的预测值与实际测定值之间的相关系数分别达到0.9279和0.9390,与全光谱数据所建立的预测模型相比,在预测精度上均有所改善,表明应用遗传算法和PLS进行光谱特征选择,能获得更简单和更好的模型,为玉米籽粒中淀粉含量的近红外测定和红外光谱数据的处理提供了新的方法与途径.  相似文献   

17.
18.
The nearest shrunken centroid (NSC) Classifier is successfully applied for class prediction in a wide range of studies based on microarray data. The contribution from seemingly irrelevant variables to the classifier is minimized by the so‐called soft‐thresholding property of the approach. In this paper, we first show that for the two‐class prediction problem, the NSC Classifier is similar to a one‐component discriminant partial least squares (PLS) model with soft‐shrinkage of the loading weights. Then we introduce the soft‐threshold‐PLS (ST‐PLS) as a general discriminant‐PLS model with soft‐thresholding of the loading weights of multiple latent components. This method is especially suited for classification and variable selection when the number of variables is large compared to the number of samples, which is typical for gene expression data. A characteristic feature of ST‐PLS is the ability to identify important variables in multiple directions in the variable space. Both the ST‐PLS and the NSC classifiers are applied to four real data sets. The results indicate that ST‐PLS performs better than the shrunken centroid approach if there are several directions in the variable space which are important for classification, and there are strong dependencies between subsets of variables. Copyright © 2007 John Wiley & Sons, Ltd.  相似文献   

19.
Metal ions such as Co(II), Ni(II), Cu(II), Fe(III) and Cr(III), which are commonly present in electroplating baths at high concentrations, were analysed simultaneously by a spectrophotometric method modified by the inclusion of the ethylenediaminetetraacetate (EDTA) solution as a chromogenic reagent. The prediction of the metal ion concentrations was facilitated by the use of an orthogonal array design to build a calibration data set consisting of absorption spectra collected in the 370-760 nm range from solution mixtures containing the five metal ions earlier. With the aid of this data set, calibration models were built based on 10 different chemometrics methods such as classical least squares (CLS), principal component regression (PCR), partial least squares (PLS), artificial neural networks (ANN) and others. These were tested with the use of a validation data set constructed from synthetic solutions of the five metal ions. The analytical performance of these chemometrics methods were characterized by relative prediction errors and recoveries (%). On the basis of these results, the computational methods were ranked according to their performances using the multi-criteria decision making procedures preference ranking organization method for enrichment evaluation (PROMETHEE) and geometrical analysis for interactive aid (GAIA). PLS and PCR models applied to the spectral data matrix that used the first derivative pre-treatment were the preferred methods. They together with ANN-radial basis function (RBF) and PLS were applied for analysis of results from some typical industrial samples analysed by the EDTA-spectrophotometric method described. DPLS, DPCR and the ANN-RBF chemometrics methods performed particularly well especially when compared with some target values provided by industry.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号