首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Gene tree correction for reconciliation and species tree inference: Complexity and algorithms
Institution:1. Dipartimento di Scienze Umane e Sociali, Università degli Studi di Bergamo, Bergamo, Italy;2. Département d''Informatique et Recherche Opérationnelle, Université de Montréal, Montréal, Canada;3. Department of Computer Science, McGill, Montréal, Canada
Abstract:Reconciliation consists in mapping a gene tree T into a species tree S, and explaining the incongruence between the two as evidence for duplication, loss and other events shaping the gene family represented by the leaves of T. When S is unknown, the Species Tree Inference Problem is to infer, from a set of gene trees, a species tree leading to a minimum reconciliation cost. As reconciliation is very sensitive to errors in T, gene tree correction prior to reconciliation is a fundamental task. In this paper, we investigate the complexity of four different combinatorial approaches for deleting misplaced leaves from T. First, we consider two problems (Minimum Leaf Removal and Minimum Species Removal) related to the reconciliation of T with a known species tree S. In the former (latter respectively) we want to remove the minimum number of leaves (species respectively) so that T is “MD-consistent” with S. Second, we consider two problems (Minimum Leaf Removal Inference and Minimum Species Removal Inference) related to species tree inference. In the former (latter respectively) we want to remove the minimum number of leaves (species respectively) from T so that there exists a species tree S such that T is MD-consistent with S. We prove that Minimum Leaf Removal and Minimum Species Removal are APX-hard, even when each label has at most two occurrences in the input gene tree, and we present fixed-parameter algorithms for the two problems. We prove that Minimum Leaf Removal Inference is not only NP-hard, but also W2]-hard and inapproximable within factor clnn, where n is the number of leaves in the gene tree. Finally, we show that Minimum Species Removal Inference is NP-hard and W2]-hard, when parameterized by the size of the solution, that is the minimum number of species removals.
Keywords:Bioinformatics  Gene trees  Reconciliation of gene trees and species trees  Algorithms  Computational complexity
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号