Fractal power law in literary English

We present in this paper a numerical investigation of literary texts by various well-known English writers, covering the first half of the twentieth century, based upon the results obtained through corpus analysis of the texts. A fractal power law is obtained for the lexical wealth defined as the ratio between the number of different words and the total number of words of a given text. By considering as a signature of each author the exponent and the amplitude of the power law, and the standard deviation of the lexical wealth, it is possible to discriminate works of different genres and writers and show that each writer has a very distinct signature, either considered among other literary writers or compared with writers of non-literary texts. It is also shown that, for a given author, the signature is able to discriminate between short stories and novels.

[1]  Baruch Vilensky,et al.  Can analysis of word frequency distinguish between writings of different authors , 1996 .

[2]  V V Solovyev,et al.  Fractal graphical representation and analysis of DNA and protein sequences. , 1993, Bio Systems.

[3]  W. Ebeling,et al.  Correction algorithm for finite sample statistics , 2003, The European physical journal. E, Soft matter.

[4]  D. Sornette Critical Phenomena in Natural Sciences: Chaos, Fractals, Selforganization and Disorder: Concepts and Tools , 2000 .

[5]  S. Hergarten,et al.  Aspects of risk assessment in power-law distributed natural hazards , 2004 .

[6]  Rosario N. Mantegna,et al.  Book Review: An Introduction to Econophysics, Correlations, and Complexity in Finance, N. Rosario, H. Mantegna, and H. E. Stanley, Cambridge University Press, Cambridge, 2000. , 2000 .

[7]  F. E. Silva,et al.  Characterization of failure mechanism in composite materials through fractal analysis of acoustic emission signals , 2005 .

[8]  Perline Zipf's law, the central limit theorem, and the random division of the unit interval. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[9]  Rosario N. Mantegna,et al.  An Introduction to Econophysics: Contents , 1999 .

[10]  D. Turcotte,et al.  Landslide inventories and their statistical properties , 2004 .

[11]  S. Havlin The distance between Zipf plots , 1995 .

[12]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[13]  W. Ebeling,et al.  Entropy and Long-Range Correlations in Literary English , 1993, cond-mat/0204108.

[14]  Alexander F. Gelbukh,et al.  Zipf and Heaps Laws' Coefficients Depend on Language , 2001, CICLing.

[15]  Albert-László Barabási,et al.  Avalanches and power-law behaviour in lung inflation , 1994, Nature.