首页 | 本学科首页   官方微博 | 高级检索  
     检索      


An Efficient Algorithm to Identify DNA Motifs
Authors:Mostafa M Abbass  Hazem M Bahig
Institution:1. Department of Basic Science, Faculty of Engineering, Sinai University, Sinai, Egypt
2. KINDI Lab for Computing Research, College of Engineering, Qatar University, Doha, Qatar
3. Computer Science and Software Engineering Department, College of Computer Science and Engineering, Hail University, Hail, Kingdom of Saudi Arabia
4. Computer Science Division, Department of Mathematics, Faculty of Science, Ain Shams University, Cairo, Egypt
Abstract:We consider the problem of identifying motifs that abstracts the task of finding short conserved sites in genomic DNA. The planted (l, d)-motif problem, PMP, is the mathematical abstraction of this problem, which consists of finding a substring of length l that occurs in each s i in a set of input sequences S = {s 1, s 2, . . . ,s t } with at most d substitutions. Our propose algorithm combines the voting algorithm and pattern matching algorithm to find exact motifs. The combined algorithm is achieved by running the voting algorithm on t′ sequences, t′ < t. After that we use the pattern matching on the output of the voting algorithm and the reminder sequences, t ? t′. Two values of t′ are calculated. The first value of t′ makes the running time of our proposed algorithm less than the running time of voting algorithm. The second value of t′ makes the running time of our proposed algorithm is minimal. We show that our proposed algorithm is faster than the voting algorithm by testing both algorithms on simulated data from (9, d ≤ 2) to (19, d ≤ 7). Finally, we test the performance of the combined algorithm on realistic biological data.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号