On the power-law distribution of language family sizes

When the sizes of language families of the world, measured by the number of languages contained in each family, are plotted in descending order on a diagram where the x-axis represents the place of each family in the rank-order (the largest family having rank 1, the next-largest, rank 2, and so on) and the y-axis represents the number of languages in the family determining the rank-ordering, it is seen that the distribution closely approximates a curve defined by the formula y=ax−b. Such ‘power-law’ distributions are known to characterize a wide range of social, biological, and physical phenomena and are essentially of a stochastic nature. It is suggested that the apparent power-law distribution of language family sizes is of relevance when evaluating overall classifications of the world's languages, for the analysis of taxonomic structures, for developing hypotheses concerning the prehistory of the world's languages, and for modelling the future extinction of language families.

[1]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[2]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[3]  Merritt Ruhlen,et al.  A Guide to the World's Languages, Vol. 1: Classification, 2nd Edn , 1993 .

[4]  J. Chu,et al.  A simple explanation for taxon abundance patterns. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[5]  G. Yule,et al.  A Mathematical Theory of Evolution Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[6]  David Glenn Smith,et al.  Examining the Farming/Language Dispersal Hypothesis. , 2005 .

[7]  F. Auerbach Das Gesetz der Bevölkerungskonzentration. , 1913 .

[8]  Béla Bollobás,et al.  The degree sequence of a scale‐free random graph process , 2001, Random Struct. Algorithms.

[9]  F. Galton,et al.  On the Probability of the Extinction of Families , 1875 .

[10]  D. L. Anderson,et al.  Theoretical Basis of Some Empirical Relations in Seismology by Hiroo Kanamori And , 1975 .

[11]  D. Sornette,et al.  Stretched exponential distributions in nature and economy: “fat tails” with characteristic scales , 1998, cond-mat/9801293.

[12]  J. Sepkoski,et al.  Ten years in the library: new data confirm paleontological patterns , 1993, Paleobiology.

[13]  P. Bellwood Farmers, Foragers, Languages, Genes: the Genesis of Agricultural Societies , 2002 .

[14]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[15]  D. Champernowne A Model of Income Distribution , 1953 .

[16]  Steven Glassman,et al.  A Caching Relay for the World Wide Web , 1994, Comput. Networks ISDN Syst..

[17]  Reinhard Köhler,et al.  Zur linguistischen Synergetik : Struktur und Dynamik der Lexik , 1986 .

[18]  M. Ruhlen A Guide to the World’s Languages , 1987 .

[19]  W. Reed,et al.  On the size distribution of live genera. , 2002, Journal of theoretical biology.

[20]  Tetsuya Yomo,et al.  Universality and flexibility in gene expression from bacteria to human. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[21]  David M. Raup,et al.  How Nature Works: The Science of Self-Organized Criticality , 1997 .

[22]  Peter Bellwood EARLY AGRICULTURALIST POPULATION DIASPORAS ?F ARMING ,L ANGUAGES, AND GENES , 2001 .