首页 | 本学科首页   官方微博 | 高级检索  
     


The challenges of statistical patterns of language: The case of Menzerath's law in genomes
Authors:Ramon Ferrer‐I‐Cancho  Núria Forns  Antoni Hernández‐Fernández  Gemma Bel‐enguix  Jaume Baixeries
Affiliation:1. Departament de Llenguatges i Sistemes Informàtics, Complexity and Quantitative Linguistics Lab, TALP Research Center/LARCA, Universitat Politècnica de Catalunya, Barcelona (Catalonia), Spain;2. Departament de Microbiologia, Facultat de Biologia;3. Departament de Lingüística General, Universitat de Barcelona, Barcelona (Catalonia), Spain;4. Laboratoire d'Informatique Fondamentale, University Aix‐Marseille & CNRS, Marseille, France
Abstract:The importance of statistical patterns of language has been debated over decades. Although Zipf's law is perhaps the most popular case, recently, Menzerath's law has begun to be involved. Menzerath's law manifests in language, music and genomes as a tendency of the mean size of the parts to decrease as the number of parts increases in many situations. This statistical regularity emerges also in the context of genomes, for instance, as a tendency of species with more chromosomes to have a smaller mean chromosome size. It has been argued that the instantiation of this law in genomes is not indicative of any parallel between language and genomes because (a) the law is inevitable and (b) noncoding DNA dominates genomes. Here mathematical, statistical, and conceptual challenges of these criticisms are discussed. Two major conclusions are drawn: the law is not inevitable and languages also have a correlate of noncoding DNA. However, the wide range of manifestations of the law in and outside genomes suggests that the striking similarities between noncoding DNA and certain linguistics units could be anecdotal for understanding the recurrence of that statistical law. © 2012 Wiley Periodicals, Inc. Complexity, 2012
Keywords:statistical laws  language  genomes  music  non‐coding DNA  Menzerath's law
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号