首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
该文构建了玉米秸秆粗蛋白定量分析模型,并对光谱特征波段选取方法进行探讨及验证。首先对107个样本进行预处理,剔除两个异常样本后采用DB2小波缺省阈值4层分解方式进行光谱重构,预处理后粗蛋白模型交互验证决定系数R2CV从0.788 9提高至0.920 8,采用间隔偏最小二乘(IPLS)及其改进型方法后向区间间隔偏最小二乘(BIPLS)、组合间隔偏最小二乘(SIPLS)进行特征波段选取,并对比主成分分析、竞争性自适应重加权采样法、相关系数法、遗传算法、移动窗口最小二乘等结果,发现基于IPLS及其改进型BIPLS、SIPLS均可有效、准确定位特征波段区间,其中采用SIPLS 30 波段间隔在10 128~10 398 cm-1与11 196~11 462 cm-1时具有最优模型,验证集相关系数(rp)为0.978 4,验正集决定系数(R2P)为0.957 2,验正集均方误差根(RMSEP)为0.221 1,相比于其他波段选取方法表现出较好的实时准确性,该方法可为玉米秸秆氨碱化最优条件判定提供重要的数据支撑。  相似文献   

2.
研究了近红外光谱技术快速检测红曲菌固态发酵过程参数水分含量和pH值的可行性。针对传统基于间隔策略波长选择方法忽略非线性因素的缺点,采用一种基于最小二乘支持向量机(Least squares support vector machines,LS-SVM)非线性模型的波长筛选算法:联合区间最小二乘支持向量机(Synergy interval least squares support vector machines,siLS-SVM),并将新算法与相关系数法、iPLS算法、siPLS算法对比。实验结果显示,联合siLS-SVM算法和LS-SVM模型取得了最好的预测效果,水分含量、pH值的预测集相关系数(Rp)分别为0.962 1、0.976 1,预测均方根误差(RMSEP)分别为0.012 9、0.145 2,表明模型具有较好的拟合度和预测性能。应用近红外光谱法进行红曲菌固态发酵过程的水分含量和pH值的快速检测可行,该方法为进一步实现其过程参数的在线检测及发酵条件优化提供了技术基础。  相似文献   

3.
该文以咪唑型离子液体作为原料制备吸附剂富集稀溶液中的木犀草素,利用竞争性自适应权重(CARS)变量筛选的方法建立了一种快速测定木犀草素的近红外光谱分析方法。考察了吸附剂用量、pH值、振荡时间对吸附效果的影响,并探究了吸附剂的吸附能力;富集木犀草素的吸附剂经近红外漫反射光谱检测,采用CARS变量筛选的方法结合偏最小二乘回归(PLS)建立了木犀草素的定量校正模型。结果表明,吸附剂用量为0.15 g、pH值为7、振荡时间为20 min的最佳条件下,吸附率达90.9%,且该吸附符合Langmuir等温吸附模型,最大吸附量为7.1 mg/g。近红外光谱建模中,与未经CARS变量筛选处理作为对照,对比发现经CARS变量筛选的方法结果更优,并采用连续小波变换(CWT)的光谱预处理进行验证,结果表明经CWT处理后,预测残差(RPD)值增大,说明了模型的可靠性。该方法可有效富集稀溶液中的木犀草素,采用CARS变量筛选结合CWT光谱预处理的近红外光谱方法可实现对稀溶液中木犀草素的灵敏、快捷检测。  相似文献   

4.
Near-infrared (NIR) spectroscopy and characteristic variables selection methods were used to develop a quick method for the determination of cellulose, hemicellulose, and lignin contents in Sargassum horneri. Calibration models for cellulose, hemicellulose, and lignin in Sargassum horneri were established using partial least square regression methods with full variables (full-PLSR). The PLSR calibration models were established by four characteristic variables selection methods, including interval partial least square (iPLS), competitive adaptive reweighted sampling (CARS), correlation coefficient (CC), and genetic algorithm (GA). The results showed that the performance of the four calibration models, namely iPLS-PLSR, CARS-PLSR, CC-PLSR, and GA-PLSR, was better than the full-PLSR calibration model. The iPLS method was best in the performance of the models. For iPLS-PLSR, the determination coefficient (R2), root mean square error (RMSE), and residual predictive deviation (RPD) of the prediction set were as follows: 0.8955, 0.8232%, and 3.0934 for cellulose, 0.8669, 0.4697%, and 2.7406 for hemicellulose, and 0.7307, 0.7533%, and 1.9272 for lignin, respectively. These findings indicate that the NIR calibration models can be used to predict cellulose, hemicellulose, and lignin contents in Sargassum horneri quickly and accurately.  相似文献   

5.
A new variable selection algorithm is described, based on ant colony optimization (ACO). The algorithm aim is to choose, from a large number of available spectral wavelengths, those relevant to the estimation of analyte concentrations or sample properties when spectroscopic analysis is combined with multivariate calibration techniques such as partial least-squares (PLS) regression. The new algorithm employs the concept of cooperative pheromone accumulation, which is typical of ACO selection methods, and optimizes PLS models using a pre-defined number of variables, employing a Monte Carlo approach to discard irrelevant sensors. The performance has been tested on a simulated system, where it shows a significant superiority over other commonly employed selection methods, such as genetic algorithms. Several near infrared spectroscopic experimental data sets have been subjected to the present ACO algorithm, with PLS leading to improved analytical figures of merit upon wavelength selection. The method could be helpful in other chemometric activities such as classification or quantitative structure-activity relationship (QSAR) problems.  相似文献   

6.
By employing the simple but effective principle ‘survival of the fittest’ on which Darwin's Evolution Theory is based, a novel strategy for selecting an optimal combination of key wavelengths of multi-component spectral data, named competitive adaptive reweighted sampling (CARS), is developed. Key wavelengths are defined as the wavelengths with large absolute coefficients in a multivariate linear regression model, such as partial least squares (PLS). In the present work, the absolute values of regression coefficients of PLS model are used as an index for evaluating the importance of each wavelength. Then, based on the importance level of each wavelength, CARS sequentially selects N subsets of wavelengths from N Monte Carlo (MC) sampling runs in an iterative and competitive manner. In each sampling run, a fixed ratio (e.g. 80%) of samples is first randomly selected to establish a calibration model. Next, based on the regression coefficients, a two-step procedure including exponentially decreasing function (EDF) based enforced wavelength selection and adaptive reweighted sampling (ARS) based competitive wavelength selection is adopted to select the key wavelengths. Finally, cross validation (CV) is applied to choose the subset with the lowest root mean square error of CV (RMSECV). The performance of the proposed procedure is evaluated using one simulated dataset together with one near infrared dataset of two properties. The results reveal an outstanding characteristic of CARS that it can usually locate an optimal combination of some key wavelengths which are interpretable to the chemical property of interest. Additionally, our study shows that better prediction is obtained by CARS when compared to full spectrum PLS modeling, Monte Carlo uninformative variable elimination (MC-UVE) and moving window partial least squares regression (MWPLSR).  相似文献   

7.
为了能够快速准确地掌握整个昆明地区土壤水解性氮含量的情况,收集963个不同类型的土壤样品,采用竞争自适应重加权采样(Competitive adaptive reweighted sampling,CARS)变量选择方法筛选波长变量,并建立水解性氮的偏最小二乘法(Partial least squares,PLS)分析模型。结果表明,采用CARS方法优选波长变量后,模型参数有所改善,交互验证标准偏差(Root mean square error of cross validation,RMSECV)由31.63降至25.55,交互验证相关系数(Correlation coefficientof cross validation,R_(cv))由0.78提升至0.84,且模型外部验证结果与内部交叉验证结果基本一致。研究结果表明近红外光谱技术结合CARS分法,在大量代表性样品建模下,能够有效建立昆明地区不同土壤类型的水解性氮含量的近红外数学模型,方法可推广应用于土壤其他组分的近红外检测,具有重要的指导意义。  相似文献   

8.
In this paper, we propose a genetic algorithm‐based wavelength selection (GAWLS) method for visible and near‐infrared (Vis/NIR) spectral calibration. The objective of GAWLS is to construct robust and predictive regression models by selecting informative wavelength regions. To demonstrate the ability of the proposed method, regression models for soil properties and sugar content of apples are constructed by using GAWLS and other variable selection methods. Copyright © 2010 John Wiley & Sons, Ltd.  相似文献   

9.
Naturally inspired evolutionary algorithms prove effectiveness when used for solving feature selection and classification problems. Artificial Bee Colony (ABC) is a relatively new swarm intelligence method. In this paper, we propose a new hybrid gene selection method, namely Genetic Bee Colony (GBC) algorithm. The proposed algorithm combines the used of a Genetic Algorithm (GA) along with Artificial Bee Colony (ABC) algorithm. The goal is to integrate the advantages of both algorithms. The proposed algorithm is applied to a microarray gene expression profile in order to select the most predictive and informative genes for cancer classification. In order to test the accuracy performance of the proposed algorithm, extensive experiments were conducted. Three binary microarray datasets are use, which include: colon, leukemia, and lung. In addition, another three multi-class microarray datasets are used, which are: SRBCT, lymphoma, and leukemia. Results of the GBC algorithm are compared with our recently proposed technique: mRMR when combined with the Artificial Bee Colony algorithm (mRMR-ABC). We also compared the combination of mRMR with GA (mRMR-GA) and Particle Swarm Optimization (mRMR-PSO) algorithms. In addition, we compared the GBC algorithm with other related algorithms that have been recently published in the literature, using all benchmark datasets. The GBC algorithm shows superior performance as it achieved the highest classification accuracy along with the lowest average number of selected genes. This proves that the GBC algorithm is a promising approach for solving the gene selection problem in both binary and multi-class cancer classification.  相似文献   

10.
《Analytical letters》2012,45(7):1145-1154
This paper reports the chemometric predictive models developed for near infrared spectroscopy (NIRS) for the quantitative determination of the kinematic viscosity (37.1–93.1 cSt) of lubricant oils for gear motors. The gear motor is a complete motive force system that consists of an electric motor and a reduction gear train integrated into one easy-to-mount and configure package. The method used for measuring the viscosity of the lubricating oil was ASTM D445, the Standard Test Method for Kinematic Viscosity of Transparent and Opaque Liquids. A comparison was made among several multivariate calibration techniques and algorithms for pre-processing and variable selection of data, including partial least squares, interval partial least squares (iPLS), a genetic algorithm (GA), and a successive projections algorithm. Finally, the results obtained for the root mean square errors of prediction in cSt and relative average error were, respectively, 1.86 and 2.97% (GA) and 2.36 and 2.97% (iPLS). The method proposed in this study is a useful alternative for the determination of the kinematic viscosity in oils for gear motors.  相似文献   

11.
Evolutionary factor analysis (EFA) and rank annihilation factor analysis (RAFA) were applied to resolve the two-way equilibrium spectrophotometric data belonging to the complexes of Fe(III), Al(III) and V(V) with morin (3,5,7,20,40-penta hydroxy flavone) as chelating agent in triton X-100 micellar media. Then, partial least square regression combined with genetic algorithm for wavelength selection (GA-PLS) was used for simultaneous determination of the metal ions. The parameters controlling behavior of the system were investigated and optimum conditions were selected. The predictive abilities of partial least squares regression (PLS) and genetic algorithm-partial least squares regression (GA-PLS) were examined in simultaneous determination of ternary mixtures of metal ions over the concentration range of 17.0-170.0ngml(-1), 25.0-180.0ngml(-1) and 40.0-325.0ngml(-1) for Fe(III), Al(III) and V(V), respectively. The relative standard errors for prediction of the ions in synthetic mixtures were lower than 5% and the mean recoveries in the tap water spiked samples were 104.2 and 101.7% for PLS and GA-PLS, respectively.  相似文献   

12.
以普通玉米籽粒为试验材料,在应用遗传算法结合偏最小二乘回归法对近红外光谱数据进行特征波长选择的基础上,应用偏最小二乘回归法建立了特征波长测定玉米籽粒中淀粉含量的校正模型.试验结果表明,基于11个特征波长所建立的校正模型,其校正误差(RMSEC)、交叉检验误差(RMSECV)和预测误差(RMSEP)分别为0.30%、0.35%和0.27%,校正数据集和独立的检验数据集的预测值与实际测定值之间的相关系数分别达到0.9279和0.9390,与全光谱数据所建立的预测模型相比,在预测精度上均有所改善,表明应用遗传算法和PLS进行光谱特征选择,能获得更简单和更好的模型,为玉米籽粒中淀粉含量的近红外测定和红外光谱数据的处理提供了新的方法与途径.  相似文献   

13.
In this work, a comparative study of two novel algorithms to perform sample selection in local regression based on Partial Least Squares Regression (PLS) is presented. These methodologies were applied for Near Infrared Spectroscopy (NIRS) quantification of five major constituents in corn seeds and are compared and contrasted with global PLS calibrations. Validation results show a significant improvement in the prediction quality when local models implemented by the proposed algorithms are applied to large data bases.  相似文献   

14.
It is imperfect to evaluate a subsampling variable selection method using only its prediction performance. To further assess the reliability of subsampling variable selection methods, dummy noise variables of different amplitudes were augmented to the original spectral data, and the false variable selection number was recorded. The reliabilities of three subsampling variable selection methods including Monte Carlo uninformative variable elimination (MC‐UVE), competitive adaptive reweighted sampling (CARS), and stability CARS (SCARS) were evaluated using this dummy noise strategy. The evaluation results indicated that both CARS and SCARS produced more parsimonious variable sets, but the reliabilities of their final variable sets were weaker than those of MC‐UVE. On the contrary, only marginal improvement on the prediction performance was obtained using MC‐UVE. Further experiments showed that removing white noise‐like variables beforehand would improve the reliability of variables extracted by CARS and SCARS. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

15.
利用高光谱技术对培养基上细菌(大肠杆菌、李斯特菌和金黄色葡萄球菌)菌落进行快速识别和分类。采集琼脂培养基上细菌菌落的高光谱反射图像(390~1040 nm),在对波段差图像进行大津阈值分割的基础上自动提取细菌菌落光谱,并建立细菌分类检测的全波长和简化偏最小二乘判别( PLS-DA)模型。全波长模型对预测集样本的分类准确率和置信预测分类准确率分别为100%和95.9%。此外,利用竞争性自适应重加权算法( CARS)、遗传算法( GA)和最小角回归算法( LARS-Lasso)进行波长优选并建立对应简化模型。其中,CARS简化模型在精度、稳定性及分类准确率方面均优于GA和LARS-Lasso简化模型,其对预测集样本的分类准确率和置信预测分类准确率分别达到了100%和98.0%。研究表明,高光谱是一种细菌菌落高精度、快速、无损识别检测的有效方法。简化模型中优选的波长可以为开发低成本检测仪器提供理论依据。  相似文献   

16.
17.
In this work we evaluated the use of different variable selection techniques combined with partial least‐squares regression (PLS) – genetic algorithm PLS (GA‐PLS), interval PLS (iPLS), and synergy interval PLS (siPLS) – in the simultaneous determination of Cd(II), Cu(II), Pb(II) and Zn(II) by anodic stripping voltammetry at a bismuth film. Generally, variable selection provided an improvement in prediction results when compared to full‐voltammogram PLS. The use of interval selection based algorithms have shown to be most adequate than the selection of discrete variables by GA. Excellent analytical performances were obtained despite the inherent complexity of the simultaneous determination.  相似文献   

18.
A new heuristic and parallel simulated annealing algorithm was proposed for variable selection in near‐infrared spectroscopy analysis. The algorithm employs a parallel mechanism to enhance the search efficiency, a heuristic mechanism to generate high‐quality candidate solutions, and the concept of Metropolis criterion to estimate accuracy of the candidate solutions. Several near‐infrared datasets have been evaluated under the proposed new algorithm, with partial least squares leading to improved analytical figures of merit upon wavelength selection. Improved robust and predictive regression models were obtained by the new algorithm. The method could also be helpful in other chemometric activities such as classification or quantitative structure‐activity relationship problems.  相似文献   

19.
基于多光谱特征融合技术的面粉掺杂定量分析方法   总被引:1,自引:0,他引:1  
提出了一种基于拉曼光谱技术(Raman)和激光诱导击穿光谱技术(LIBS)的多光谱特征融合技术(MFFT),利用拉曼光谱中分子组分信息和激光诱导击穿光谱中原子组分信息之间的互补特性,采用自适应小波变换(AWT)-竞争性自适应加权(CARS)-偏最小二乘回归(PLS)建模技术,获取了面粉体系更为全面的特征信息。在多光谱特征融合技术中,首先采用AWT-CARS方法分别提取拉曼光谱和激光诱导击穿光谱中的特征变量,然后将两者的特征变量融合为一个向量,采用PLS方法构建MFFT模型,实现了面粉掺杂物的定量分析。通过对二氧化钛、硫酸铝钾等面粉掺杂体系建模分析,考察MFFT模型的有效性。结果表明,与单一拉曼光谱技术或激光诱导击穿光谱技术建立的预测模型相比,MFFT模型显著提升了模型的预测性能,二氧化钛和硫酸铝钾预测模型的线性相关系数分别从相对较差的Raman模型的0.884、0.877提升到0.981、0.980,其预测均方根误差分别从相对较差的Raman模型的0.151、0.154降低到0.069、0.068。表明多光谱特征融合技术可以准确提取Raman光谱中的分子信息和LIBS光谱中的元素信息,使其互为补充、互为校正,进而有效克服面粉基质对掺杂组分定量分析的干扰,显著提高模型的预测精度。  相似文献   

20.
The potential of near-infrared spectroscopy (NIRS) for screening the inorganic arsenic (i-As) content in commercial rice was assessed. Forty samples of rice were freeze-dried and scanned by NIRS. The i-As contents of the samples were obtained by acid digestion-solvent extraction followed by hydride generation atomic absorption spectrometry, and were regressed against different spectral transformations by modified partial least square (MPLS) regression. The second derivative transformation equation of the raw optical data, previously standardized by applying standard normal variate (SNV) and De-trending (DT) algorithms, resulted in a coefficient of determination in the cross-validation (1-VR) of 0.65, indicative of equations useful for correct separation of the samples in low, medium and high groups. The standard deviation (SD) to standard error of cross-validation (SECV) ratio, expressed in the second derivative equation, was similar to those obtained for other trace metal calibrations reported in NIRS reflectance. Spectral information relating to starch, lipids and fiber in the rice grain, and also pigments in the caryopsis, were the main components used by MPLS for modeling the selected prediction equation. This pioneering use of NIRS to predict the i-As content in rice represents an important reduction in labor input and cost of analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号