首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Discovery of false identification using similarity difference in GC–MS‐based metabolomics
Authors:Seongho Kim  Xiang Zhang
Abstract:Compound identification is a critical process in metabolomics. The widely used approach for compound identification in gas chromatography–mass spectrometry‐based metabolomics is spectrum matching, in which the mass spectral similarity between an experimental mass spectrum and each mass spectrum in a reference library is calculated. While various similarity measures have been developed to improve the overall accuracy of compound identification, little attention has been paid to reducing the false discovery rate. We, therefore, develop an approach for controlling the false identification rate using the distribution of the difference between the first and second highest spectral similarity scores. We further propose a model‐based approach to achieving a desired true positive rate. The developed method is applied to the National Institute of Standards and Technology mass spectral library, and its performance is compared with that of the conventional approach that uses only the maximum spectral similarity score. The results show that the developed method achieves a significantly higher F1 score and positive predictive value than did the conventional approach. Copyright © 2014 John Wiley & Sons, Ltd.
Keywords:compound identification  gas chromatography–  mass spectrometry (GC–  MS)  metabolomics  similarity  true positive rate
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号