首页 | 本学科首页   官方微博 | 高级检索  
     

近红外光谱分析中建模校正集的选择
引用本文:秦冲,陈雯雯,何雄奎,张录达,马翔. 近红外光谱分析中建模校正集的选择[J]. 光谱学与光谱分析, 2009, 29(10): 2661-2664. DOI: 10.3964/j.issn.1000-0593(2009)10-2661-04
作者姓名:秦冲  陈雯雯  何雄奎  张录达  马翔
作者单位:中国农业大学理学院,北京,100193;红塔集团技术中心,云南,玉溪,653100
基金项目:国家自然科学基金,国家高技术研究发展计划863计划项目 
摘    要:
将极大线性无关组的概念及方法引入近红外光谱分析,探讨了在建立定量分析模型时代表性样品,即校正集样品的选择问题。以2 652个烟末样品为实验材料,随机选取1 001个样品构成预测集,其余1 651个样品为代表性样品备选集。用Matlab软件求出代表性样品备选集光谱矩阵的极大线性无关组,以此作为代表性样品,构成建模的校正集。用PLS回归法建立了烟末样品总糖含量定量分析的预测模型,并将模型用于预测集中1 001个烟末样品总糖含量的预测分析。实验结果表明,当选择的校正集包含的样品数量大于32时,所建各模型对预测集样品预测的平均相对误差均小于4%,平均相关系数大于0.96。其中选择32个代表性样品和146个代表性样品所建模型定量分析预测集中各样品的总糖含量,两个结果经统计检验没有显著性差异(α=0.05),说明求极大线性无关组的方法用于校正集样品的选择,可实现“少而精”选择样品的目的。此外,我们用求极大线性无关组选择校正集样品和随机方法选择校正集样品两种方法,选择了同样数目28,32,41,76,146,163个样品建模进行预测效果的对比实验,结果显示,求极大线性无关组法选择校正集建模的预测效果优于随机选择校正集建模的预测效果。

关 键 词:近红外光谱  代表性样品选择  极大线性无关组
收稿时间:2008-09-02

Study on a Method of Selecting Calibration Samples in NIR Spectral Analysis
QIN Chong,CHEN Wen-wen,HE Xiong-kui,ZHANG Lu-da,MA Xiang. Study on a Method of Selecting Calibration Samples in NIR Spectral Analysis[J]. Spectroscopy and Spectral Analysis, 2009, 29(10): 2661-2664. DOI: 10.3964/j.issn.1000-0593(2009)10-2661-04
Authors:QIN Chong  CHEN Wen-wen  HE Xiong-kui  ZHANG Lu-da  MA Xiang
Affiliation:1. College of Science, China Agricultural University, Beijing 100193, China2. Hongta Group R&D Center,Yuxi 653100,China
Abstract:
In the present paper, a simple but novel method based on maximum linearly independent group was introduced into near-infrared (NIR) spectral analysis for selecting representative calibration samples. The experiment materials contained 2 652 tobacco powder samples, with 1 001 samples randomly selected as prediction set, and the others as representative sample candidate set from which calibration sample set was selected. The method of locating maximum linearly independent vectors was used to select representative samples from the spectral vectors of representative samples candidate set. The arithmetic was accomplished by function rref(X, q) in Matlab. The maximum linearly independent spectral vectors were treated as calibration samples set. When different calculating precision q was given, different amount of representative samples were acquired. The selected calibration sample set was used to build regression model to predict the total sugar of tobacco powder samples by PLS. The model was used to analyze 1001 samples in the prediction set. When selecting 32 representative samples, the model presented a good predictive veracity, whose predictive mean relative error was 3. 621 0%, and correlation coefficient was 0. 964 3. By paired- samples t-test, we found that the difference between the predicting result of model obtained by 32 samples and that obtained by 146 samples was not significant (a=0. 05). Also, we compared the methods of randomly selecting calibration samples and maximum linearly independent selection by their predicting effects of models. In the experiment, correspondingly, six calibration sample sets were selected, one of which included 28 samples, while the others included 32, 41, 76, 146 and 163 samples respectively. The method of maximum linearly independent selecting samples turned out to be obviously better than that of randomly selecting. The result indicated that the proposed method can not only effectively enhance the cost-effectiveness of NIR spectral analysis by reducing the number of samples required for cockamamie and expensive chemical measurement, but also improve the analysis accuracy. In conclusion, this method can be applied to select representative samples in near-infrared spectral analysis.
Keywords:NIRS  Representative sample selection  Maximum linearly independent group
本文献已被 万方数据 等数据库收录!
点击此处可从《光谱学与光谱分析》浏览原始摘要信息
点击此处可从《光谱学与光谱分析》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号