首页 | 本学科首页   官方微博 | 高级检索  
     


From NGS assembly challenges to instability of fungal mitochondrial genomes: A case study in genome complexity
Affiliation:1. Cellular & Molecular Biology Unit, Corporación para Investigaciones Biológicas, Medellín, Colombia;2. Institute of Biology, Universidad de Antioquia, Medellín, Colombia;3. Doctoral Program in Biomedical Sciences, Universidad del Rosario, Bogotá, Colombia;4. School of Medicine, Universidad de Antioquia, Medellín, Colombia;5. School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia;1. Department of Health Services Research, CAPHRI School for Public Health and Primary Care, Maastricht University, P.O. Box 616, 6200, Maastricht MD, The Netherlands;2. Fontys University of Applied Sciences, School of Nursing, P.O. Box 347, 5600, Eindhoven AH, The Netherlands;3. Department of Medical Humanities, EMGO Institute for Health and Care research, Vu University Medical Centre, Van der Boeschorststraat 7, 1007, Amsterdam MB, The Netherlands;4. Department of General Practice, CAPHRI School for Public Health and Primary Care, Maastricht University. P.O. Box 616, 6200, Maastricht MD, The Netherlands;1. Beijing Advanced Innovation Center for Big Data and Brain Computing, Beihang University, Beijing 100191, China;2. School of Transportation Science and Engineering, Beijing Key Laboratory for Cooperative Vehicle Infrastructure Systems and Safety Control, Beihang University, Beijing 100191, China;3. Jiangsu Province Collaborative Innovation Center of Modern Urban Traffic Technologies, Nanjing 210096, China;4. Department of Engineering and Design, University of Sussex, Brighton BN 1 9RH, UK;5. The Texas Department of Transportation, Austin TX 78750, USA;6. College of Computer Science and Technology, Jilin University, Changchun 130012, China;1. PhD Candidate, PhD Program in Health Policy, Harvard University, United States;2. RAND Corporation, Division of General Medicine, Brigham and Women''s Hospital, Department of Medicine, Harvard Medical School, United States;3. Department of Medicine, Massachusetts General Hospital, Harvard Medical School, United States;4. Department of Health Policy and Management, Harvard T.H. Chan School of Public Health, Department of Medicine, Harvard Medical School, Mongan Institute for Health Policy, Massachusetts General Hospital, United States;1. Department of Urology, Mount Sinai Hospital, New York, NY;2. Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, New York, NY;3. Department of Surgery, Sidney Kimmel Center for Prostate and Urologic Cancers, Memorial Sloan-Kettering Cancer Center, New York, NY;1. Division of General Internal Medicine and Geriatrics, Northwestern University Feinberg School of Medicine, Chicago, USA;2. Walgreens Co., South Elgin, USA;3. Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, USA
Abstract:The presence of repetitive or non-unique DNA persisting over sizable regions of a eukaryotic genome can hinder the genome's successful de novo assembly from short reads: ambiguities in assigning genome locations to the non-unique subsequences can result in premature termination of contigs and thus overfragmented assemblies. Fungal mitochondrial (mtDNA) genomes are compact (typically less than 100 kb), yet often contain short non-unique sequences that can be shown to impede their successful de novo assembly in silico. Such repeats can also confuse processes in the cell in vivo. A well-studied example is ectopic (out-of-register, illegitimate) recombination associated with repeat pairs, which can lead to deletion of functionally important genes that are located between the repeats. Repeats that remain conserved over micro- or macroevolutionary timescales despite such risks may indicate functionally or structurally (e.g., for replication) important regions. This principle could form the basis of a mining strategy for accelerating discovery of function in genome sequences. We present here our screening of a sample of 11 fully sequenced fungal mitochondrial genomes by observing where exact k-mer repeats occurred several times; initial analyses motivated us to focus on 17-mers occurring more than three times. Based on the diverse repeats we observe, we propose that such screening may serve as an efficient expedient for gaining a rapid but representative first insight into the repeat landscapes of sparsely characterized mitochondrial chromosomes. Our matching of the flagged repeats to previously reported regions of interest supports the idea that systems of persisting, non-trivial repeats in genomes can often highlight features meriting further attention.
Keywords:Next generation sequencing  Repetitive DNA  Fungal mitochondria  Genome assembly  DNA secondary structure
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号