首页 | 本学科首页   官方微博 | 高级检索  
     


An efficient spectra processing method for metabolite identification from 1H-NMR metabolomics data
Authors:Daniel Jacob  Catherine Deborde  Annick Moing
Affiliation:1. INRA, UMR1332 Fruit Biology and Pathology, Centre INRA de Bordeaux, 33140, Villenave d’Ornon, France
2. Metabolome Facility of Bordeaux Functional Genomics Center, IBVM, Centre INRA de Bordeaux, 33140, Villenave d’Ornon, France
Abstract:The spectra processing step is crucial in metabolomics approaches, especially for proton NMR metabolomics profiling. During this step, noise reduction, baseline correction, peak alignment and reduction of the 1D 1H-NMR spectral data are required in order to allow biological information to be highlighted through further statistical analyses. Above all, data reduction (binning or bucketing) strongly impacts subsequent statistical data analysis and potential biomarker discovery. Here, we propose an efficient spectra processing method which also provides helpful support for compound identification using a new data reduction algorithm that produces relevant variables, called buckets. These buckets are the result of the extraction of all relevant peaks contained in the complex mixture spectra, rid of any non-significant signal. Taking advantage of the concentration variability of each compound in a series of samples and based on significant correlations that link these buckets together into clusters, the method further proposes automatic assignment of metabolites by matching these clusters with the spectra of reference compounds from the Human Metabolome Database or a home-made database. This new method is applied to a set of simulated 1H-NMR spectra to determine the effect of some processing parameters and, as a proof of concept, to a tomato 1H-NMR dataset to test its ability to recover the fruit extract compositions. The implementation code for both clustering and matching steps is available upon request to the corresponding author.
Figure
Illustration of the processing approach from spectra bucketing to the proposal of candidate compounds, using a set of six simulated NMR spectra. First, the ERVA method of data reduction is applied to the spectra after noise processing, generating buckets as shown for two spectra regions. Second, the correlation matrix between bucket intensities is computed and a correlation threshold is applied for bucket clustering. The cluster shown gathers two sub-clusters (A and B), each being intra-connected with higher correlations (r?>?0.996) than the interconnections (r?
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号