首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 24 毫秒
1.
In recent 10 years, like other disciplines influenced by the fast development of PC technique, chemometrics has been used in many analytical methods, especially in instrumental analysis. This article describes applications and comparison of multivariate linear regression (MLR), principal component analysis (PCA), principal component regression (PCR), partial least square (PLS), neural network (ANN), fuzzy and model recognition. A better calibration method can be a great help to improve the efficiency of the routine analytical work.  相似文献   

2.
倪永年  黄春芳 《分析化学》2002,30(8):994-999
评述了化学计量学方法在生产过程分析中各个方面 ,如过程优化、过程模拟、仪器及仪器校正、过程监测等方面的应用 ,并展望了化学计量学在过程分析中的应用前景  相似文献   

3.
支持向量机分类和回归用于肽的QSAR研究   总被引:4,自引:0,他引:4  
周鹏  曾晖  李波  周原  李志良 《化学通报》2006,69(5):342-346
使用支持向量机技术对两类肽化合物体系进行了分类和回归研究,并将其系统地与K最邻近法、多元线性回归、偏最小二乘、人工神经网络进行了比较。结果表明,对于小样本、非线性问题,支持向量机具有较强的稳定性能及泛化能力,在大多数情况下能够得到优于传统方法的建模效果。对于分类问题,支持向量机对训练集和测试集都达到了100%的分类正确率;对于回归问题,支持向量机虽对训练集样本拟合效果略低于人工神经网络,但对外部测试集却表现出较强的预测能力。  相似文献   

4.
This paper reports the results of a rapid method to determine sucrose in chocolate mass using near infrared spectroscopy (NIRS). We applied a broad-based calibration approach, which consists in putting together in one single calibration samples of various types of chocolate mass. This approach increases the concentration range for one or more compositional parameters, improves the model performance and requires just one calibration model for several recipes. The data were modelled using partial least squares (PLS) and multiple linear regression (MLR). The MLR models were developed using a variable selection based on the coefficient regression of PLS and genetic algorithm (GA). High correlation coefficients (0.998, 0.997, 0.998 for PLS, MLR and GA-MLR, respectively) and low prediction errors confirms the good predictability of the models. The results show that NIR can be used as rapid method to determine sucrose in chocolate mass in chocolate factories.  相似文献   

5.
This study compares the performance of partial least squares (PLS) regression analysis and artificial neural networks (ANN) for the prediction of total anthocyanin concentration in red-grape homogenates from their visible-near-infrared (Vis-NIR) spectra. The PLS prediction of anthocyanin concentrations for new-season samples from Vis-NIR spectra was characterised by regression non-linearity and prediction bias. In practice, this usually requires the inclusion of some samples from the new vintage to improve the prediction. The use of WinISI LOCAL partly alleviated these problems but still resulted in increased error at high and low extremes of the anthocyanin concentration range. Artificial neural networks regression was investigated as an alternative method to PLS, due to the inherent advantages of ANN for modelling non-linear systems. The method proposed here combines the advantages of the data reduction capabilities of PLS regression with the non-linear modelling capabilities of ANN. With the use of PLS scores as inputs for ANN regression, the model was shown to be quicker and easier to train than using raw full-spectrum data. The ANN calibration for prediction of new vintage grape data, using PLS scores as inputs, was more linear and accurate than global and LOCAL PLS models and appears to reduce the need for refreshing the calibration with new-season samples. ANN with PLS scores required fewer inputs and was less prone to overfitting than using PCA scores. A variation of the ANN method, using carefully selected spectral frequencies as inputs, resulted in prediction accuracy comparable to those using PLS scores but, as for PCA inputs, was also prone to overfitting with redundant wavelengths.  相似文献   

6.
The pharmaceutical industry faces increasing regulatory pressure to optimize quality control. Content uniformity is a basic release test for solid dosage forms. To accelerate test throughput and comply with the Food and Drug Administration's process analytical technology initiative, attention is increasingly turning to nondestructive spectroscopic techniques, notably near-infrared (NIR) spectroscopy (NIRS). However, validation of NIRS using requisite linearity and standard error of prediction (SEP) criteria remains a challenge. This study applied wavelet transformation of the NIR spectra of a commercial tablet to build a model using conventional partial least squares (PLS) regression and an artificial neural network (ANN). Wavelet coefficients in the PLS and ANN models reduced SEP by up to 60% compared to PLS models using mathematical spectra pretreatment. ANN modeling yielded high-linearity calibration and a correlation coefficient exceeding 0.996.  相似文献   

7.
通过对部分含氧化合物(醇、酯、醛、酮)在不同固定相不同柱温下的849个样本的气相色谱保留指数值(RI)与其部分参数:拓扑指数(mQ)、定位基参数(Sox)、固定液极性值(CP)及柱温(T)建立定量结构-色谱保留相关(QSRR)模型。分别利用多元线性回归(MLR)、偏最小二乘回归(PLSR)、人工神经网络(ANN)建模,同时采用内部及外部双重验证的办法对所得模型稳定性能进行深入分析和检验,建模计算值、留一法(LOO)交互检验(CV)预测值和外部样本预测值的复相关系数Rcum、QLOO和Rext分别为0.9832、0.9829和0.9836(MLR);0.9832、0.9830和0.9836(PLSR);0.9910、0.9909和0.9900(ANN)。结果表明:所建定量结构保留关系(QSRR)模型具有良好的稳定性和预测能力,较好地揭示了含氧化合物(醇、酯、醛、酮)在不同色谱条件下气相色谱保留指数的变化规律。  相似文献   

8.
通过对184个烯烃类化合物在不同固定相不同柱温下的617个样本的气相色谱保留指数值(RI)与其部分参数:拓扑指数(mQ)、偶极矩(DPL)、固定液极性值(CP)及柱温(T)建立定量-色谱保留相关(QSRR)模型.分别利用多元线性回归(MLR)、偏最小二乘回归(PLSR)、人工神经网络(ANN)建模,同时采用内部及外部双重验证的办法对所得模型稳定性能进行深入分析和检验,建模计算值、留一法(LOO)交互检验(CV)预测值和外部样本的复相关系数Rcum,QLOO和Rext分别为0.999 2,0.998 4和0.999 2(MLR);0.999 0,0.998 0和0.999 1(PLSR);0.999 4,0.998 7和0.999 2(ANN).结果表明:所建定量结构保留关系(QSRR)模型具有良好的稳定性和预测能力,较好地揭示了烯烃类化合物在不同固定相不同柱温上气相色谱保留指数的变化规律.  相似文献   

9.
A novel method named a wavelet packet transform based Elman recurrent neural network (WPTERNN) was proposed for the simultaneous UV–visible spectrometric determination of Cu(II), Cd(II) and Zn(II). This method combined wavelet packet denoising with an Elman recurrent neural network. A wavelet packet transform was applied to perform data compression, to extract relevant information, and to eliminate noise and collinearity. An Elman recurrent network was applied for nonlinear multivariate calibration. In this case, using trials, the kind of wavelet function, the decomposition level, and the number of hidden nodes for the WPTERNN method were selected as Daubechies 14, 3, and 8, respectively. A program (PWPTERNN) was designed that could perform the simultaneous determination of Cu(II), Cd(II) and Zn(II). The relative standard errors of prediction (RSEP) obtained for all components using WPTERNN, a Elman recurrent neural network (ERNN), partial least squares (PLS), principal component regression (PCR), Fourier transform based PCR (FTPCR), and multivariate linear regression (MLR) were compared. Experimental results demonstrated that the WPTERRN method was successful even where there was severe overlap of spectra. The results obtained from an additional test case also demonstrated that the WPTERNN method performed very well. Figure The part of WP coefficients obtained by wavelet packet transforms  相似文献   

10.
11.
In this work it has been shown that the routine ASTM methods (ASTM 4052, ASTM D 445, ASTM D 4737, ASTM D 93, and ASTM D 86) recommended by the ANP (the Brazilian National Agency for Petroleum, Natural Gas and Biofuels) to determine the quality of diesel/biodiesel blends are not suitable to prevent the adulteration of B2 or B5 blends with vegetable oils. Considering the previous and actual problems with fuel adulterations in Brazil, we have investigated the application of vibrational spectroscopy (Fourier transform (FT) near infrared spectrometry and FT-Raman) to identify adulterations of B2 and B5 blends with vegetable oils. Partial least square regression (PLS), principal component regression (PCR), and artificial neural network (ANN) calibration models were designed and their relative performances were evaluated by external validation using the F-test. The PCR, PLS, and ANN calibration models based on the Fourier transform (FT) near infrared spectrometry and FT-Raman spectroscopy were designed using 120 samples. Other 62 samples were used in the validation and external validation, for a total of 182 samples. The results have shown that among the designed calibration models, the ANN/FT-Raman presented the best accuracy (0.028%, w/w) for samples used in the external validation.  相似文献   

12.
Different calibration techniques are available for spectroscopic applications that show nonlinear behavior. This comprehensive comparative study presents a comparison of different nonlinear calibration techniques: kernel PLS (KPLS), support vector machines (SVM), least-squares SVM (LS-SVM), relevance vector machines (RVM), Gaussian process regression (GPR), artificial neural network (ANN), and Bayesian ANN (BANN). In this comparison, partial least squares (PLS) regression is used as a linear benchmark, while the relationship of the methods is considered in terms of traditional calibration by ridge regression (RR). The performance of the different methods is demonstrated by their practical applications using three real-life near infrared (NIR) data sets. Different aspects of the various approaches including computational time, model interpretability, potential over-fitting using the non-linear models on linear problems, robustness to small or medium sample sets, and robustness to pre-processing, are discussed. The results suggest that GPR and BANN are powerful and promising methods for handling linear as well as nonlinear systems, even when the data sets are moderately small. The LS-SVM is also attractive due to its good predictive performance for both linear and nonlinear calibrations.  相似文献   

13.
In the present study, different multivariate regression techniques have been applied to two large near-infrared data sets of feed and feed ingredients in order to fulfil the regulations and laws that exist about the chemical composition of these products. The aim of this paper was to compare the performances of different linear and nonlinear multivariate calibration techniques: PLS, ANN and LS-SVM. The results obtained show that ANN and LS-SVM are very powerful methods for non-linearity but LS-SVM can also perform quite well in the case of linear models. Using LS-SVM an improvement of the RMS for independent test sets of 10% is obtained in average compared to ANN and of 24% compared to PLS.  相似文献   

14.
15.
Quantitative structure-activity relationship (QSAR) studies based on chemometric techniques are reviewed. Partial least squares (PLS) is introduced as a novel robust method to replace classical methods such as multiple linear regression (MLR). Advantages of PLS compared to MLR are illustrated with typical applications. Genetic algorithm (GA) is a novel optimization technique which can be used as a search engine in variable selection. A novel hybrid approach comprising GA and PLS for variable selection developed in our group (GAPLS) is described. The more advanced method for comparative molecular field analysis (CoMFA) modeling called GA-based region selection (GARGS) is described as well. Applications of GAPLS and GARGS to QSAR and 3D-QSAR problems are shown with some representative examples. GA can be hybridized with nonlinear modeling methods such as artificial neural networks (ANN) for providing useful tools in chemometric and QSAR.  相似文献   

16.
《中国化学会会志》2018,65(5):567-577
Calpeptin analogs show anticancer properties with inhibition of calpain. In this work, we applied a quantitative structure–activity relationship (QSAR) model on 34 calpeptin derivatives to select the most appropriate compound. QSAR was employed to generate the models and predict the more significant compounds through a series of calpeptin derivatives. The HyperChem, Gaussian 09, and Dragon software programs were used for geometry optimization of the molecules. The 2D and 3D molecular structures were drawn by ChemDraw (Ultra 16.0) and Chem3D (Pro16.0) software. The Unscrambler program was used for the analysis of data. Multiple linear regression (MLR‐MLR), partial least‐squares (MLR‐PLS1), principal component regression (MLR‐PCR), a genetic algorithm‐artificial neural networks (GA‐ANN), and a novel similarity analysis‐artificial neural network (SA‐ANN) method were used to create QSAR models. Among the three MLR models, MLR‐MLR provided better statistical parameters. The R2 and RMSE of the prediction were estimated as 0.8248 and 0.26, respectively. Nevertheless, the constructed model using GA‐ANN revealed the best statistical parameters among the studied methods (R2 test = 0.9643, RMSE test = 0.0155, R2 train = 0.9644, RMSE train = 0.0139). The GA‐ANN model is found to be the most favorable method among the statistical methods and can be employed for designing new calpeptin analogs as potent calpain inhibitors in cancer treatment.  相似文献   

17.
In this paper, two spectral data sets have been used to illustrate the importance of maintaining chemical information whilst generating predictive multivariate calibration models. The first data set is based on 26 duplicate UV/VIS spectra for four meal ions (Fe, Ni, Co, Cu) present at varying concentrations in aqueous solution. Spectra were collected across the range 180–800 nm at a resolution of 3.5 nm generating 211 data points for each sample. Calibration was carried out using multiple linear regression (MLR) and a K-matrix approach to demonstrate the advantages the latter method has in describing real spectral features. In addition, the limitation of MLR in accommodating noise and spectral overlap in the data is also illustrated. The second data set based on NIR spectroscopy, was generated using a four-level 2 factor Factorial design strategy and consisted of two additives present at a range of concentrations in an aqueous caustic system, with the spectra being collected over the range 10,000–3000 cm−1. Whilst a conventional partial least squares (PLS) model was applied to the data, it was through the use of variable selection (VS) prior to PLS and the application of weighted ridge regression (WRR) techniques that the need to develop chemometric methodology which intuitively reflected chemical information has been demonstrated. The results will also illustrate how a poorly designed experimental design protocol and missing data can limit the performance of the calibration models generated. The aims of this paper are not to prescribe ideal calibration methodology but rather to demonstrate the relevance of selecting multivariate calibration methodology that relates more to the chem rather than just the metrics in chemometrics.  相似文献   

18.
傅里叶变换红外光声光谱法测定土壤中有效磷   总被引:3,自引:0,他引:3  
杜昌文  周健民 《分析化学》2007,35(1):119-122
以中国科学院封丘生态实验站长期定位实验区的土样为材料(68样),利用傅里叶转换红外光声光谱测定土壤有效磷:以Olsen-P为因变量,通过傅里转换红外光声光谱构建偏最小二乘法和人工神经网络模型,利用模型进行预测。结果表明,偏最小二乘法模型的相关系数(R2)为0.96,校正标准偏差为1.79mg/kg,验证标准偏差为5.25mg/kg;人工神经网络模型的校正系数为0.84,校正标准偏差为2.40mg/kg,验证标准偏差为5.43mg/kg。两种模型均可以用于土壤有效磷的预测,且偏最小二乘模型优于人工神经网络模型。该方法的特点是无需样品前处理,且测定对样品无破坏,为土壤有效磷的快速测定提供新的手段。  相似文献   

19.
A novel method named OSC-WPT-PLS approach based on partial least squares (PLS) regression with orthogonal signal correction (OSC) and wavelet packet transform (WPT) as pre-processed tools was proposed for the simultaneous spectrophotometric determination of Al(III), Mn(II) and Co(II). This method combines the ideas of OSC and WPT with PLS regression for enhancing the ability of extracting characteristic information and the quality of regression. OSC is used to remove information in the response matrix D by subtracting the structured noise that is orthogonal to the concentration matrix C. Wavelet packet transform was applied to perform data compression, to extract relevant information, and to eliminate noise and collinearity. PLS was applied for multivariate calibration and noise reduction by eliminating the less important latent variables. In this case, using trials, the kind of wavelet function, the decomposition level, the number of OSC components and the number of PLS factors for the OSC-WPT-PLS method were selected as Daubechies 4, 3, 2 and 3, respectively. A program (POSCWPTPLS) was designed to perform the simultaneous spectrophotometric determination of Al(III), Mn(II) and Co(II). The relative standard errors of prediction (RSEP) obtained for total elements using OSC-WPT-PLS, WPT-PLS and PLS were compared. Experimental results demonstrated that the OSC-WPT-PLS method had the best performance among the three methods and was successful even when there was severe overlap of spectra.  相似文献   

20.
偏最小二乘法—流动注射pH梯度技术用于同时测定铜和钴   总被引:1,自引:0,他引:1  
以PAR作显色剂,用流动注射pH梯度技术测定多个不同pH下的吸光度,以偏最小二乘法建立校正模型并预测,对Cu~(2+)、Co~(2+)二元素进行了同时测定,其计算结果优于主成分回归及多元线性回归法。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号