首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The present paper elaborates on the design of classifiers based on cross‐correlation‐based principal component analysis (PCA) and Sammon's nonlinear mapping (NLM) using current signals obtained from electronic tongue (e‐tongue) with commercial mineral water samples available in the Indian market. The pulse‐voltammetric method is used to capture the electroanalytical/electrochemical characteristics of the sampled mineral waters by considering a real model for the liquid–electrode interface in a given e‐tongue apparatus. Then the cross‐correlation coefficients between the output and input signals are determined. Both PCA and Sammon's NLM create a subspace from high‐dimensional mineral water data by considering the principal eigenvectors and minimising the stress function, respectively. The proposed cross‐correlation‐based PCA and Sammon's classifiers establish the highest separation distance among the investigated water brands and carries out the authentication of more than one unknown sample of the same brand with a certain degree of variability with respect to their sources. Copyright © 2013 John Wiley & Sons, Ltd.  相似文献   

2.
3.
The main objective of this paper is to introduce principal component analysis and two robust fuzzy principal component algorithms as useful tools in characterizing and comparing rime samples collected in different locations in Poland (2004–2007). The efficiency of the applied procedures was illustrated on a data set containing 108 rime samples and concentration of anions, cations, HCHO, as well as pH and conductivity. The fuzzy principal component algorithms achieved better results mainly because they are more compressible than classical PCA and very robust to outliers. For example, a three component model, fuzzy principal component analysis-first component (FPCA-1) accounts for 62.37% of the total variance and fuzzy principal component analysis-orthogonal (FPCA-o) 90.11%; PCA accounts only for 58.30%. The first two principal components explain 51.41% of the total variance in the case of FPCA-1 and 79.59% in the case of FPCA-o as compared to only 47.55% for PCA. As a direct consequence, PCA showed only a partial differentiation of rime samples onto the plane or in the space described by different combination of two or three principal components, whereas a much sharper differentiation of the samples, regarding their origin and location, is observed when FPCAs are applied.   相似文献   

4.
5.
A strategy for monitoring and analyzing the chemical stability of Xuebijing injection (XBJ) by multiwavelength chromatographic fingerprints and multivariate classification techniques is presented in this paper. Multiwavelength chromatographic fingerprints were constructed using chromatographic data obtained at four wavelengths (260, 280, 320, and 400?nm). The raw chromatography data were preprocessed by noise reduction, baseline correction, data normalization, and interval correlation optimized shifting (icoshift). Using this method, fingerprints of 166 samples of XBJ subjected to different forced degradation conditions (irradiation, high temperature, and a range of pH values) were properly represented. Forty-one chemical components were identified using the iPeak program. In addition, the identified peak area profiling of chemical components were used for multivariate classification analysis. Principal component analysis (PCA) and Ward’s method were used to classify different XBJ degradation samples. The PCA score plot showed that XBJ degradation samples were clustered into four groups, and the results are confirmed by Ward’s method. Ten key chemical markers under different degradation conditions were found and identified by counterpropagation artificial neural networks (CP-ANN), statistical t-tests, and UPLC-Q-TOF-MS. The results suggest that the proposed strategy could be successfully applied to the comprehensive analysis of complex chemical systems.  相似文献   

6.
Quartz crystal microbalance (QCM) gas sensors based on polymeric material were fabricated and their gas response characteristics were examined for four simulant gases of chemical agents, which were dimethyl methyl phosphonate (DMMP), N,N-dimethylacetamide (DMA), 1,5-dichloropentane (DCP) and dichloroethane (DCE). For the selection of appropriate coating materials, both principal component analysis (PCA) and hierarchical cluster methods were applied to a data set collected from 15 QCM sensors for 12 analytes. Four appropriate coating materials were selected after optimizing the correlation between the 15 coating materials and the first four principal component (PC) factors. The four chosen polymers were used as sensitive component for a sensor array, and then PCA is adapted to classify four simulant gases. The results show that the QCM sensor array has high sensitivity and selectivity to four chemical agents.  相似文献   

7.
Exploratory statistical methods are used to elucidate the group structures pertaining to a chemical data set obtaned from 184 samples of five typical white wines from the Venetian Region: Soave Classico, Presecco di Conegliano-Valdobbiadene, Verduzzo del Piave, Tocai di Lison, Tocai delle Grave del Friuli. Analytical data included 19 classical parameters and 9 aroma components. Correlation matrices showed that the same three sets of intercorrelated variables were present in all the groups, and that aroma components were nearly uncorrelated with classical parameters. Principal components analysis allowed a subspace of reduced dimensionality to be derived for each group. However, because of similar pattern of the correlation matrices, a high degree of similarity was also observed among the subspaces. An efficient differentiation of the groups was obtained by canonical variates analysis. The most important chemical parameters were potassium, ash content, total nitrogen, cis-3-hexen-1-o1 and 1-hexanol. Classification of samples by the Euclidean distances from the centroids in the canonical space gave on average 94% correct results.  相似文献   

8.
An in-depth study is presented to better understand how data reduction via averaging impacts retention alignment and the subsequent chemometric analysis of data obtained using gas chromatography (GC). We specifically study the use of signal averaging to reduce GC data, retention time alignment to correct run-to-run retention shifting, and principal component analysis (PCA) to classify chromatographic separations of diesel samples by sample class. Diesel samples were selected because they provide sufficient complexity to study the impact of data reduction on the data analysis strategies. The data reduction process reduces the data sampling ratio, S(R), which is defined as the number of data points across a given chromatographic peak width (i.e., the four standard deviation peak width). Ultimately, sufficient data reduction causes the chromatographic resolution to decrease, however with minimal loss of chemical information via the PCA. Using PCA, the degree of class separation (DCS) is used as a quantitative metric. Three "Paths" of analysis (denoted A-C) are compared to each other in the context of a "benchmark" method to study the impact of the data sampling ratio on preserving chemical information, which is defined by the DCS quantitative metric. The benchmark method is simply aligning data and applying PCA, without data reduction. Path A applies data alignment to collected data, then data reduction, and finally PCA. Path B applies data reduction to collected data, and then data alignment, and finally PCA. The optimized path, namely Path C, is created from Paths A and B, whereby collected data are initially reduced to fewer data points (smaller S(R)), then aligned, and then further reduced to even fewer points and finally analyzed with PCA to provide the DCS metric. Overall, following Path C, one can successfully and efficiently classify chromatographic data by reducing to a S(R) of ~15 before alignment, and then reducing down to S(R) of ~2 before performing PCA. Indeed, following Path C, results from an average of 15 different column length-with-temperature ramp rate combinations spanning a broad range of separation conditions resulted in only a ~15% loss in classification capability (via PCA) when the loss in chromatographic resolution was ~36%.  相似文献   

9.
Nonlinear underdetermined blind separation of nonnegative dependent sources consists in decomposing a set of observed nonlinearly mixed signals into a greater number of original nonnegative and dependent component (source) signals. This hard problem is practically relevant for contemporary metabolic profiling of biological samples, where sources (a.k.a. pure components or analytes) are aimed to be extracted from mass spectra of nonlinear multicomponent mixtures. This paper presents a method for nonlinear underdetermined blind separation of nonnegative dependent sources that comply with a sparse probabilistic model, that is, sources are constrained to be sparse in support and amplitude. This model is validated on experimental pure component mass spectra. Under a sparse prior, a nonlinear problem is converted into an equivalent linear one comprised of original sources and their higher‐order, mostly second‐order, monomials. The influence of these monomials, which stand for error terms, is reduced by preprocessing a matrix of mixtures by means of robust principal component analysis and hard, soft and trimmed thresholding. Preprocessed data matrices are mapped in high‐dimensional reproducible kernel Hilbert space (RKHS) of functions by means of an empirical kernel map. Sparseness‐constrained nonnegative matrix factorizations in RKHS yield sets of separated components. They are assigned to pure components from the library using a maximal correlation criterion. The methodology is exemplified on demanding numerical and experimental examples related respectively to extraction of eight dependent components from three nonlinear mixtures and to extraction of 25 dependent analytes from nine nonlinear mixture mass spectra recorded in nonlinear chemical reaction of peptide synthesis. Copyright © 2014 John Wiley & Sons, Ltd.  相似文献   

10.
基于浓度参量同步荧光光谱技术,对不同溢油类型不同油源原油样品集、引入外扰相似油源样品集进行光谱数据采集,获取其浓度同步荧光光谱矩阵Concentration-Synchronous-Matrix-Fluorescence(CSMF),利用主成分分析方法对两套不同层次的原油相关样品集进行了多类分类识别。结果表明:主成分载荷图可以很好地反映各个原油相关样品在油源上的相似程度,结合支持向量机可以实现不同溢油类型及不同油源原油的准确分类,对于引入风化和海水外扰相似油源溢油样品集,两类分类区分的结果远远高于多类分类识别的结果。通过详细的主成分分析讨论,为溢油油种鉴别提供了一种利用多类分类识别,逐步缩减嫌疑样本数量,最后通过两两分类实现溢油样品准确识别的新思路。  相似文献   

11.
12.
The application of a new method to the multivariate analysis of incomplete data sets is described. The new method, called maximum likelihood principal component analysis (MLPCA), is analogous to conventional principal component analysis (PCA), but incorporates measurement error variance information in the decomposition of multivariate data. Missing measurements can be handled in a reliable and simple manner by assigning large measurement uncertainties to them. The problem of missing data is pervasive in chemistry, and MLPCA is applied to three sets of experimental data to illustrate its utility. For exploratory data analysis, a data set from the analysis of archeological artifacts is used to show that the principal components extracted by MLPCA retain much of the original information even when a significant number of measurements are missing. Maximum likelihood projections of censored data can often preserve original clusters among the samples and can, through the propagation of error, indicate which samples are likely to be projected erroneously. To demonstrate its utility in modeling applications, MLPCA is also applied in the development of a model for chromatographic retention based on a data set which is only 80% complete. MLPCA can predict missing values and assign error estimates to these points. Finally, the problem of calibration transfer between instruments can be regarded as a missing data problem in which entire spectra are missing on the ‘slave’ instrument. Using NIR spectra obtained from two instruments, it is shown that spectra on the slave instrument can be predicted from a small subset of calibration transfer samples even if a different wavelength range is employed. Concentration prediction errors obtained by this approach were comparable to cross-validation errors obtained for the slave instrument when all spectra were available.  相似文献   

13.
Observed data often belong to some specific intervals of values (for instance in case of percentages or proportions) or are higher (lower) than pre‐specified values (for instance, chemical concentrations are higher than zero). The use of classical principal component analysis (PCA) may lead to extract components such that the reconstructed data take unfeasible values. In order to cope with this problem, a constrained generalization of PCA is proposed. The new technique, called bounded principal component analysis (B‐PCA), detects components such that the reconstructed data are constrained to belong to some pre‐specified bounds. This is done by implementing a row‐wise alternating least squares (ALS) algorithm, which exploits the potentialities of the least squares with inequality (LSI) algorithm. The results of a simulation study and two applications to bounded data are discussed for evaluating how the method and the algorithm for solving it work in practice. Copyright © 2007 John Wiley & Sons, Ltd.  相似文献   

14.
ANOVA–simultaneous component analysis (ASCA) is a recently developed tool to analyze multivariate data. In this paper, we enhance the explorative capability of ASCA by introducing a projection of the observations on the principal component subspace to visualize the variation among the measurements. We compare the significance of experimental effects for ASCA and ANOVA–principal component analysis (PCA), a similar tool to explore multivariate data, by using permutation tests. Furthermore, we quantify the quality of the loadings estimate obtained with ASCA and compare this with the loadings estimate obtained with ANOVA–PCA. Copyright © 2011 John Wiley & Sons, Ltd.  相似文献   

15.
ChemCam is a remote laser-induced breakdown spectroscopy (LIBS) instrument that will arrive on Mars in 2012, on-board the Mars Science Laboratory Rover. The LIBS technique is crucial to accurately identify samples and quantify elemental abundances at various distances from the rover. In this study, we compare different linear and nonlinear multivariate techniques to visualize and discriminate clusters in two dimensions (2D) from the data obtained with ChemCam. We have used principal components analysis (PCA) and independent components analysis (ICA) for the linear tools and compared them with the nonlinear Sammon’s map projection technique. We demonstrate that the Sammon’s map gives the best 2D representation of the data set, with optimization values from 2.8% to 4.3% (0% is a perfect representation), together with an entropy value of 0.81 for the purity of the clustering analysis. The linear 2D projections result in three (ICA) and five times (PCA) more stress, and their clustering purity is more than twice higher with entropy values about 1.8. We show that the Sammon’s map algorithm is faster and gives a slightly better representation of the data set if the initial conditions are taken from the ICA projection rather than the PCA projection. We conclude that the nonlinear Sammon’s map projection is the best technique for combining data visualization and clustering assessment of the ChemCam LIBS data in 2D. PCA and ICA projections on more dimensions would improve on these numbers at the cost of the intuitive interpretation of the 2D projection by a human operator.  相似文献   

16.
17.
运用色谱指纹图谱与化学计量学方法对灵芝进行分类   总被引:2,自引:0,他引:2  
张景丽  罗霞  郑林用  许小燕  叶利明 《色谱》2009,27(6):776-780
采用95%乙醇为提取溶剂,运用高效液相色谱(HPLC)指纹图谱技术与化学计量学方法,对11个不同灵芝菌株子实体进行分类。通过相似度分析分别获得提取样品指纹图谱的13个共有峰及每个样品之间的相似度;以相对共有峰面积为分析参数,运用化学计量学方法包括聚类分析(HCA)、主成分分析(PCA)及判别分析(DA)对其进行分类,结果分为紫芝、赤芝和美国大灵芝3类。实验结果表明,用化学计量学的方法对灵芝样品的指纹图谱数据进行分析,是一种可用于其分类的科学方法。  相似文献   

18.
19.
Electronic nose (e-Nose) technology based on metal oxide semiconductor gas sensor array is widely studied for detection of gas components. This paper proposes a new discriminant analysis framework (NDA) for dimension reduction and e-Nose recognition. In a NDA, the between-class and the within-class Laplacian scatter matrix are designed from sample to sample, respectively, to characterize the between-class separability and the within-class compactness by seeking for discriminant matrix to simultaneously maximize the between-class Laplacian scatter and minimize the within-class Laplacian scatter. In terms of the linear separability in high dimensional kernel mapping space and the dimension reduction of principal component analysis (PCA), an effective kernel PCA plus NDA method (KNDA) is proposed for rapid detection of gas mixture components by an e-Nose. The NDA framework is derived in this paper as well as the specific implementations of the proposed KNDA method in training and recognition process. The KNDA is examined on the e-Nose datasets of six kinds of gas components, and compared with state of the art e-Nose classification methods. Experimental results demonstrate that the proposed KNDA method shows the best performance with average recognition rate and total recognition rate as 94.14% and 95.06% which leads to a promising feature extraction and multi-class recognition in e-Nose.  相似文献   

20.
开发了一种鉴别β受体激动剂的新型阵列传感器。该传感器由8种传感物质构成,使用96孔板酶标仪采集响应数据,结合主成分分析(PCA)、分层聚类分析(HCA)、判别分析(LDA)等模式识别方法进行数据处理,对5类β受体激动剂及其混合物进行检测。PCA结果表明,该传感器主要是基于空间结构以及氢键作用实现对β受体激动剂的识别;HCA结果显示,93个分析样本归类正确;LDA结果显示,该传感器对于β受体激动剂识别的准确率达98.9%。本方法在β受体激动剂的检测中有潜在应用价值。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号