首页 | 本学科首页   官方微博 | 高级检索  
     检索      


PeakSelect: preprocessing tandem mass spectra for better peptide identification
Authors:Zhang Jingfen  He Simin  Ling Charles X  Cao Xingjun  Zeng Rong  Gao Wen
Institution:Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China. zhangjingf@missouri.edu
Abstract:We present a new preprocessing method, PeakSelect, to improve the accuracy and efficiency of Tandem Mass-Spec peptide (protein) identification. The fundamental difference between noise and fragment ions in spectra is that ions have isotopes but noise does not. We propose a new and important concept of an Isotope Pattern Vector (IPV) which characterizes the isotope cluster of fragment ions. Then the noise and real peaks can be distinguished by the quantitative IPV values. PeakSelect first uses a new method of the Gaussian Mixture Model and Expectation-Maximization (EM) algorithm to find the base intensity level (baseline) in a spectrum. Then PeakSelect selects features based on the IPV and baseline, and constructs a decision tree to automatically classify the peaks into different categories such as noise, single ion peaks, and overlapping peaks. Experiments show that PeakSelect can help to reduce the Mascot searching time and increase the reliability of peptide identifications. In particular, PeakSelect performs well on complex spectra with a large number of peaks from large peptides, and supports more sequence identification than other well-known systems.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号