首页 | 本学科首页   官方微博 | 高级检索  
     

正交匹配追踪算法的近红外光谱定量分析
引用本文:李四海,刘东玲. 正交匹配追踪算法的近红外光谱定量分析[J]. 光谱学与光谱分析, 2021, 41(4): 1097-1101. DOI: 10.3964/j.issn.1000-0593(2021)04-1097-05
作者姓名:李四海  刘东玲
作者单位:甘肃中医药大学信息工程学院,甘肃 兰州 730000;甘肃中医药大学药学院,甘肃 兰州 730000
基金项目:国家自然科学基金项目(81603407);甘肃省自然科学基金项目(1506RJZA046);兰州市科技计划项目(2018-3-41);甘肃省高校中(藏)药化学与质量研究省级重点实验室开放基金项目(zzy-2018-05)资助。
摘    要:压缩感知(CS)是一种新兴的信号压缩和采样技术,正交匹配追踪(OMP)是一种贪婪追踪算法,广泛用于压缩感知领域中的稀疏信号重构.针对近红外光谱信号高维小样本以及信号稀疏先验的特点,为进一步提高小样本近红外光谱变量选择的灵活性和可靠性,基于压缩感知理论,提出了一种新颖的光谱变量选择方法正交匹配追踪变量选择(OM PBVS...

关 键 词:近红外光谱  变量选择  压缩感知  偏最小二乘  正交匹配追踪
收稿时间:2020-03-02

Quantitative Analysis of Near Infrared Spectroscopy Based on Orthogonal Matching Pursuit Algorithm
LI Si-hai,LIU Dong-ling. Quantitative Analysis of Near Infrared Spectroscopy Based on Orthogonal Matching Pursuit Algorithm[J]. Spectroscopy and Spectral Analysis, 2021, 41(4): 1097-1101. DOI: 10.3964/j.issn.1000-0593(2021)04-1097-05
Authors:LI Si-hai  LIU Dong-ling
Affiliation:1. College of Information Engineering, Gansu University of Chinese Medicine, Lanzhou 730000, China2. School of Pharmacy, Gansu University of Chinese Medicine, Lanzhou 730000, China
Abstract:Compressed sensing(CS)is a new technology of signal compression and sampling.Orthogonal Matching Pursuit(OMP),a greedy tracking algorithm,is widely used in sparse signal reconstruction in the compressed sensing field.In connection with the characteristics of high-dimensional small samples of near-infrared spectra signals and sparse prior signals,a novel near-infrared spectra variable selection method named Orthogonal Matching Pursuit Based Variable Selection(OMPBVS)is proposed,based on the compressed sensing theory,to further improve the flexibility and reliability of near-infrared spectra variable selection.By sparse reconstruction of the original spectral signal,OMPBVS can compress the regression coefficient of most variables to zero,and then indirectly realize the selection of spectral variables.In the specific process,the spectral matrix is adopted as the sensing matrix,the predictive variable as the observation variable and iteratively calculated residual and the inner product of the atom,and the inner product of the largest atom is chosen.During each iteration,the signal is projected onto the subspace spanned by all selected atoms,and then the coefficients are updated for all the selected atoms,enabling the residual error and all the selected atoms to be orthogonal.With the residual calculation to be the essence of Grammar-Schmidt Orthogonalization,the orthogonal projection can reduce the number of iterations and ensure the accuracy of signal reconstruction.OMPBVS can reduce the spectral dimension to the sample size scale,and its variable selection capability is comparable to LASSO.However,compared with LASSO,the optimization method of OMPBVS loss function is a forward selection algorithm,which reduces the number of iterations and can precisely control the number of selected variables.Variable selection experiments were performed on the beer dataset and Wheat kernels dataset to compare the performance of six variable selection methods:PLS,MCUVE,CARS,WMSCVS,LASSOLarsCV,and OMPBVS.There were 60 samples in the beer dataset,36 samples of the training set and 24 samples of the test set were divided by Kennard Stone(KS)method,and the prediction variable was Original extract concentration.The Wheat kernels data set consisted of 523 samples,415 training samples,and 108 test samples.The predicted value was protein content.The OMPBVS method selects the number of variables,RMSEC and RMSEP from the beer dataset as 2,0.2052 and 0.1598,respectively.When on the Wheat kernels data set,the number of selected variables,RMSEC and RMSEP were 9,0.4502,and 0.4125,respectively,and the variable selection ability and model performance was better than the other five methods,indicating that OMPBVS is an effective NIR spectral variable selection and quantitative analysis method.OMPBVS variable selection method has good generalization ability in the case of small samples,which can reduce the number of selected variables and improve the robustness of variable selection.Besides,spectral preprocessing methods based on SNV and MSC can reduce the number of selected variables to a certain extent and improve the interpretability of the model.
Keywords:Near infrared spectroscopy  Variable selection  Compressed sensing  Partial Least squares  Orthogonal matching pursuit
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《光谱学与光谱分析》浏览原始摘要信息
点击此处可从《光谱学与光谱分析》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号