On the complexity of SNP block partitioning under the perfect phylogeny model |
| |
Authors: | Jens Gramm Till Tantau |
| |
Affiliation: | a Wilhelm-Schickard-Institut für Informatik, Universität Tübingen, Germany b Department of Computer Science, Bar-Ilan University, Ramat-Gan 52900, Israel c International Computer Science Institute, Berkeley, USA d School of Computer Science, Tel-Aviv University, Tel-Aviv 69978, Israel e Institut für Theoretische Informatik, Universität zu Lübeck, Germany |
| |
Abstract: | Recent technologies for typing single nucleotide polymorphisms (SNPs) across a population are producing genome-wide genotype data for tens of thousands of SNP sites. The emergence of such large data sets underscores the importance of algorithms for large-scale haplotyping. Common haplotyping approaches first partition the SNPs into blocks of high linkage-disequilibrium, and then infer haplotypes for each block separately. We investigate an integrated haplotyping approach where a partition of the SNPs into a minimum number of non-contiguous subsets is sought, such that each subset can be haplotyped under the perfect phylogeny model. We show that finding an optimum partition is -hard even if we are guaranteed that two subsets suffice. On the positive side, we show that a variant of the problem, in which each subset is required to admit a perfect path phylogeny haplotyping, is solvable in polynomial time. |
| |
Keywords: | Perfect phylogeny haplotyping Perfect path phylogeny Partitioning problems |
本文献已被 ScienceDirect 等数据库收录! |
|