首页 | 本学科首页   官方微博 | 高级检索  
     

近红外光谱的油页岩总有机碳快速检测
引用本文:李泉伦,陈争光,孙先达. 近红外光谱的油页岩总有机碳快速检测[J]. 光谱学与光谱分析, 2022, 42(6): 1691-1697. DOI: 10.3964/j.issn.1000-0593(2022)06-1691-07
作者姓名:李泉伦  陈争光  孙先达
作者单位:1. 黑龙江八一农垦大学信息与电气工程学院,黑龙江 大庆 163319
2. 东北石油大学“陆相页岩油气成藏及高效开发”教育部重点实验室,黑龙江 大庆 163318
基金项目:国家重点研发计划项目(2016YFD0701300);;黑龙江省博士后科研启动金项目(LBH-Q18028)资助;
摘    要:为了快速检测油页岩总有机碳(TOC)含量,以松辽盆地某区块所取岩芯为研究对象,测量230个岩石样本的TOC含量和近红外光谱数据。利用蒙特卡洛法剔除异常样本14个,剩余的216个样本进行去趋势加基线校正方法预处理,采用连续投影算法(SPA)、无信息变量消除算法以及竞争自适应算法选取特征波长。使用SPXY方法对样本按照2∶1的比例划分为144个校正集和72个验证集,然后建立线性的偏最小二乘(PLS)模型以及非线性的支持向量机(SVM)模型和随机森林(RF)模型对油页岩TOC含量进行预测。采用测定系数(R2)和均方根误差(RMSE)作为模型的评价指标,探究不同特征波长选择方法对油页岩总有机碳建模的影响,比较不同建模方法对油页岩TOC含量预测的准确度。结果表明,特征波长提取能够起到优化模型的作用。SPA,UVE和CARS分别提取了16,253和65个波长,经过特征波长提取后模型测定系数均有提高,均方根误差均有下降,这说明进行特征波长优选对于简化模型、提高模型运算速度发挥着很重要的作用。此外,非线性的RF和SVM模型性能要优于线性模型PLS。这是因为油页岩中的碳存在于各类烃的中,不同类别含烃基团的吸收峰之间相互影响,使得油页岩总有机碳含量和近红外光谱数据之间存在着复杂的非线性关系,因此,非线性的SVM和RF模型能够表现出更好的效果。相比于其他模型,CARS-SVM模型验证集的测定系数(R2v)和均方根误差(RMSEV)表现出的结果较好,分别达到了0.906 6和0.222 0,该模型能够用于油页岩总有机碳含量的快速检测。研究结果说明,近红外光谱分析应用于油页岩TOC含量快速检测是可行的;建立的CARS-SVM模型能够表现出较好的预测效果,为我国油页岩TOC含量快速检测提供了一种新的方法和思路。

关 键 词:近红外  油页岩总有机碳  特征波长  支持向量机  随机森林  
收稿时间:2021-05-05

Rapid Detection of Total Organic Carbon in Oil Shale Based on Near Inf rared Spectroscopy
LI Quan-lun,CHEN Zheng-guang,SUN Xian-da. Rapid Detection of Total Organic Carbon in Oil Shale Based on Near Inf rared Spectroscopy[J]. Spectroscopy and Spectral Analysis, 2022, 42(6): 1691-1697. DOI: 10.3964/j.issn.1000-0593(2022)06-1691-07
Authors:LI Quan-lun  CHEN Zheng-guang  SUN Xian-da
Affiliation:1. College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, China2. Key Laboratory of Continental Shale Hydrocarbon Accumulation and Efficient Development, Ministry of Education, Northeast Petroleum University, Daqing 163318, China
Abstract:To quickly detect the total organic carbon (TOC) content of oil shale, the TOC content and near-infrared spectrum data of 230 rock samples were measured in a certain block of the Songliao basin. The Monte Carlo method eliminates 14 abnormal samples, and the remaining 216 samples are pretreated by the method of detrended and baseline correction. The feature wavelength is selected by successive projections algorithm (SPA), uninformative variable elimination(UVE) algorithm and competitive adaptive reweighted sampling(CARS) method. The SPXY method divides the sample set into calibration set(144 samples) and validation set(72 samples) according to the ratio of 2∶1. Then linear partial least squares (PLS) model, nonlinear support vector machine (SVM) model and random forest (RF) model are adopted to predict the TOC content of oil shale. The determination coefficient (R2) and root mean square error (RMSE) was used as the evaluation indexes of the model to explore the influence of different characteristic wavelength selection methods on TOC modeling of oil shale and to compare the accuracy of different modeling methods on TOC content prediction of oil shale. The results show that the feature wavelength extraction can optimize the model. SPA, UVE and CARS extract 16, 253 and 65 wavelength points respectively. After the feature wavelength extraction, the model determination coefficient is improved, and the root means square error is decreased. This shows that the feature wavelength extraction plays an important role in simplifying the model and improving model efficiency. In addition, The performance of the nonlinear RF and SVM model is better than that of the linear PLS model. The reason is that the carbon in oil shale exists in all kinds of hydrocarbons, and the absorption peaks of different hydrocarbon groups interact with each other, which makes the complex nonlinear relationship between the TOC content of oil shale and the near-infrared spectroscopy data. Therefore, the nonlinear SVM and RF model can show better performance. Compared with other models, the coefficient of determination (R2v) and root mean square error (RMSEV) of the CARS-SVM model invalidation set show better results, reaching 0.906 6 and 0.222 0 respectively. This model can be used to rapidly detect TOC content in oil shale. The results of this study show that the application of near-infrared spectroscopy in the rapid detection of TOC content in oil shale is feasible, and the CARS-SVM model can show good prediction performance, which provides a new method and idea for the rapid detection of TOC content in oil shale in China.
Keywords:Near-infrared  Total organic carbon in oil shale  Characteristic wavelength  Support vector machine  Random forest  
本文献已被 万方数据 等数据库收录!
点击此处可从《光谱学与光谱分析》浏览原始摘要信息
点击此处可从《光谱学与光谱分析》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号