首页 | 本学科首页   官方微博 | 高级检索  
     检索      

潜变量机器学习方法在咖啡NIR定量分析中的应用
引用本文:陈华舟,许丽莉,乔涵丽,洪绍勇.潜变量机器学习方法在咖啡NIR定量分析中的应用[J].光谱学与光谱分析,2021,41(5):1441-1445.
作者姓名:陈华舟  许丽莉  乔涵丽  洪绍勇
作者单位:1. 桂林理工大学理学院,广西 桂林 541004
2. 大数据处理与算法技术研究中心(桂林理工大学),广西 桂林 541004
3. 北部湾大学海洋学院,广西 钦州 535011
4. 广州华商学院数据科学学院,广东 广州 511300
基金项目:国家自然科学基金项目(61505037);广西自然科学基金项目(2018GXNSFAA050045);广东省普通高校青年创新人才类项目(2019KQNCX213);广东省普通高校创新团队项目(2020WCXTD008)资助。
摘    要:采用近红外(NIR)光谱快检技术实现对咖啡蛋白质的定量检测,研究支持向量机(SVM)和极限学习机(ELM)等机器学习方法在建模分析中的实用性。结合潜变量分析技术,建立潜变量SVM(LV-SVM)模型和潜变量ELM(LV-ELM)模型,通过调试潜变量个数和机器学习关键参数的联合优选,实现数据降维和机器学习关键参数的同过程优化。运用定标-验证-测试机制,利用定标集样本建立咖啡蛋白质的NIR分析模型,随参数变动形成三维随动优选结构的建模预测结果,结合验证集样本对模型进行联合优选,然后将优化模型应用于测试集样本进行模型评价。LV-SVM建模优选的验证集预测均方根误差为6.797,对应的测试集预测均方根误差为8.384。LV-ELM建模优选的验证集预测均方根误差为6.118,对应的测试集预测均方根误差为7.837。与常规偏最小二乘(PLS)方法相比较,LV-SVM和LV-ELM方法均取得更好的预测结果,验证了潜变量机器学习方法在近红外定量分析中的应用优势,该方法有望应用于不同类型的咖啡各成分含量检测。

关 键 词:NIR光谱  咖啡  蛋白质  SVM  ELM  潜变量技术  
收稿时间:2020-06-23

Latent Variable Machine Learning Methods Applied for NIR Quantitative Analysis of Coffee
CHEN Hua-zhou,XU Li-li,QIAO Han-li,HONG Shao-yong.Latent Variable Machine Learning Methods Applied for NIR Quantitative Analysis of Coffee[J].Spectroscopy and Spectral Analysis,2021,41(5):1441-1445.
Authors:CHEN Hua-zhou  XU Li-li  QIAO Han-li  HONG Shao-yong
Institution:1. College of Science, Guilin University of Technology, Guilin 541004, China 2. Center for Data Analysis and Algorithm Technology, Guilin University of Technology, Guilin 541004, China 3. College of Marine Sciences, Beibu Gulf University, Qinzhou 535011, China 4. School of Data Science, Guangzhou Huashang College, Guangzhou 511300, China
Abstract:Near-infrared(NIR)spectroscopy rapid detection technology was used to determine protein content in instant coffee.Support vector machine(SVM)and extreme learning machine(ELM)was applied for validating their practicality in modeling analysis.We proposed the latent variable SVM(LV-SVM)and latent variable ELM(LV-ELM)methods combined with latent variable analysis technique.Thetuning of latent variables and the optimization of the key parameters in machines were joint in-one so that the data dimension reduction and the selection of machine parameters can be both accomplished in one single modeling process.The calibrating-validating-testing mechanism was used for sample division.The NIR analytical models were trained based on the calibrating sample set.The model prediction results were generated and saved as a 3 Dbox as they were determined by the simultaneous tuning of the latent variable and the machine parameter.Then the joint optimization of model parameters was selected in the way of predicting the validating samples.Further,the optimal model was evaluated by the testing samples.The optimal LV-SVM model gave the validating root mean square error as 6.797;the corresponding testing root mean square error as 8.384.The optimal LV-ELM model obtained the validating root mean square error as 6.118.The corresponding testing root means square error as 7.837.Compared with the common partial least square method,the LV-SVM and LV-ELM methods have better prediction results,which verified the application advantages of the latent variable machine learning method in near-infrared quantitative analysis.This proposed method is expected for further application in content detection of other kinds of coffee.
Keywords:NIR spectroscopy  Coffee  Protein  SVM  ELM  Latent variable technique
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《光谱学与光谱分析》浏览原始摘要信息
点击此处可从《光谱学与光谱分析》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号