首页 | 本学科首页   官方微博 | 高级检索  
     


Using Gaussian model to improve biological sequence comparison
Authors:Qi Dai  Xiaoqing Liu  Lihua Li  Yuhua Yao  Bin Han  Lei Zhu
Abstract:One of the major tasks in biological sequence analysis is to compare biological sequences, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations among the sequences. Numerous efficient methods have been developed for sequence comparison, but challenges remain. In this article, we proposed a novel method to compare biological sequences based on Gaussian model. Instead of comparing the frequencies of k‐words in biological sequences directly, we considered the k‐word frequency distribution under Gaussian model which gives the different expression levels of k‐words. The proposed method was tested by similarity search, evaluation on functionally related genes, and phylogenetic analysis. The performance of our method was further compared with alignment‐based and alignment‐free methods. The results demonstrate that Gaussian model provides more information about k‐word frequencies and improves the efficiency of sequence comparison. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010
Keywords:biological sequence  Gaussian distribution  word frequency  similarity search  phylogenetic analysis
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号