Discovering Global Patterns in Linguistic Networks through Spectral Analysis: A Case Study of the Consonant Inventories

Recent research has shown that language and the socio-cognitive phenomena associated with it can be aptly modeled and visualized through networks of linguistic entities. However, most of the existing works on linguistic networks focus only on the local properties of the networks. This study is an attempt to analyze the structure of languages via a purely structural technique, namely spectral analysis, which is ideally suited for discovering the global correlations in a network. Application of this technique to PhoNet, the co-occurrence network of consonants, not only reveals several natural linguistic principles governing the structure of the consonant inventories, but is also able to quantify their relative importance. We believe that this powerful technique can be successfully applied, in general, to study the structure of natural languages.

[1]  Mikhail Belkin,et al.  Using eigenvectors of the bigram graph to infer morpheme identity , 2002, SIGMORPHON.

[2]  S. V. Shanmugam Dental and Alveolar Nasals in Dravidian , 1972 .

[3]  Niloy Ganguly,et al.  MODELING THE CO-OCCURRENCE PRINCIPLES OF THE CONSONANT INVENTORIES: A COMPLEX NETWORK APPROACH , 2006, physics/0606132.

[4]  Ian Maddieson,et al.  Patterns of sounds , 1986 .

[5]  Niloy Ganguly,et al.  Modeling the Structure and Dynamics of the Consonant Inventories: A Complex Network Approach , 2008, COLING.

[6]  Anirban Banerjee,et al.  Graph spectra as a systematic tool in computational biology , 2007, Discret. Appl. Math..

[7]  K. Hayward,et al.  Dental and alveolar stops in Kimvita Swahili: an electropalatographic study , 1989 .

[8]  C. Baltaxe,et al.  Principles of phonology , 1969 .

[9]  A. Barabasi,et al.  Spectra of "real-world" graphs: beyond the semicircle law. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Niloy Ganguly,et al.  Analysis and Synthesis of the Distribution of Consonants over Languages: A Complex Network Approach , 2006, ACL.

[11]  Christian Abry,et al.  The Weight of Phonetic Substance in the Structure of Sound Inventories , 2002 .

[12]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[13]  Peter Ladefoged WPP, No. 104: Features and parameters for different purposes , 2005 .

[14]  Mariano Sigman,et al.  Global organization of the Wordnet lexicon , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Santosh S. Vempala,et al.  Spectral Algorithms , 2009, Found. Trends Theor. Comput. Sci..

[16]  Animesh Mukherjee,et al.  The Structure and Dynamics of Linguistic Networks , 2009 .

[17]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[18]  Christos Gkantsidis,et al.  Spectral analysis of Internet topologies , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[19]  Anirban Banerjee,et al.  Spectral plots and the representation and interpretation of biological data , 2007, Theory in Biosciences.

[20]  George N. Clements The role of features in speech sound inventories , 2009 .