Language documentation twenty-five years on

Abstract:This discussion note reviews responses of the linguistics profession to the grave issues of language endangerment identified a quarter of a century ago in the journal Language by Krauss, Hale, England, Craig, and others (Hale et al. 1992). Two and a half decades of worldwide research not only have given us a much more accurate picture of the number, phylogeny, and typological variety of the world’s languages, but they have also seen the development of a wide range of new approaches, conceptual and technological, to the problem of documenting them. We review these approaches and the manifold discoveries they have unearthed about the enormous variety of linguistic structures. The reach of our knowledge has increased by about 15% of the world’s languages, especially in terms of digitally archived material, with about 500 languages now reasonably documented thanks to such major programs as DoBeS, ELDP, and DEL. But linguists are still falling behind in the race to document the planet’s rapidly dwindling linguistic diversity, with around 35–42% of the world’s languages still substantially undocumented, and in certain countries (such as the US) the call by Krauss (1992) for a significant professional realignment toward language documentation has only been heeded in a few institutions. Apart from the need for an intensified documentarist push in the face of accelerating language loss, we argue that existing language documentation efforts need to do much more to focus on crosslinguistically comparable data sets, sociolinguistic context, semantics, and interpretation of text material, and on methods for bridging the ‘transcription bottleneck’, which is creating a huge gap between the amount we can record and the amount in our transcribed corpora.

[1]  Anthony C. Woodbury,et al.  Finding a way into a family of tone languages: The story and methods of the Chatino Language Documentation Project , 2014 .

[2]  L. Joan Vanishing Voices: The Extinction of the World's Languages. , 2004 .

[3]  M. Brenzinger Language Diversity Endangered , 2008 .

[4]  Laura C. Robinson,et al.  In Defense of the Lone Wolf: Collaboration in Language Documentation , 2013 .

[5]  S. Allen Polysynthesis in the Acquisition of Inuit Languages , 2017 .

[6]  Mark A. Sicoli Repair organization in Chinantec whistled speech , 2016 .

[7]  Colleen M. Fitzgerald,et al.  Training Communities, Training Graduate Students: The 2012 Oklahoma Breath of Life Workshop , 2013 .

[8]  P. Austin,et al.  Dying to be counted: the commodification of endangered languages in documentary linguistics 1 , 2007 .

[9]  Leanne Hinton The Use of Linguistic Archives in Language Revitalization: The Native California Language Restoration Workshop , 2001 .

[10]  Caterina Mauri,et al.  Linguistic Diversity , 2020, The International Encyclopedia of Higher Education Systems and Institutions.

[11]  Ken Hale,et al.  Language endangerment and the human value of linguistic diversity , 2015 .

[12]  Robert Henderson,et al.  More than Words: Towards a Development-Based Approach to Language Revitalization , 2014 .

[13]  Caryl D. Johnson,et al.  The World until Yesterday: What Can We Learn from Traditional Societies? , 2013 .

[14]  Enrique L. Palancar,et al.  Tone and inflection : new facts and new perspectives , 2016 .

[15]  Claire Bowern Language vitality: Theorizing language loss, shift, and reclamation (Response to Mufwene) , 2017 .

[16]  L. Grenoble A response to ‘Assessing levels of endangerment in the Catalogue of Endangered Languages (ELCat) using the Language Endangerment Index (LEI)’, by Nala Huiying Lee & John Van Way , 2016, Language in Society.

[17]  Gary F. Simons,et al.  The world’s languages in crisis , 2013 .

[18]  Sarah Rothstein Space In Language And Cognition Explorations In Cognitive Diversity , 2016 .

[19]  Felicity Meakins,et al.  Gurindji Kriol: A Mixed Language Emerges from Code-switching , 2005 .

[20]  N. Evans,et al.  The grammar of engagement I: framework and initial exemplification , 2017, Language and Cognition.

[21]  Christian T. DiCanio The phonetics of register in Takhian Thong Chong , 2009, Journal of the International Phonetic Association.

[22]  Racquel-María Yamada,et al.  Collaborative Linguistic Fieldwork: Practical Application of the Empowerment Model , 2007 .

[23]  E. Dąbrowska Naive v. expert intuitions: An empirical study of acceptability judgments , 2010 .

[24]  James Stanford 20. Clan as a sociolinguistic variable: Three approaches to Sui clans , 2009 .

[25]  D. Hargreaves,et al.  Agency and Intentional Action in Kathmandu Newar , 2014 .

[26]  S. Levinson,et al.  Differential Ineffability and the Senses , 2014 .

[27]  L. S. Roque Using you to get to me: Addressee perspective and speaker stance in Duna evidential marking , 2015 .

[28]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[29]  Damir Cavar,et al.  Endangered Language Documentation: Bootstrapping a Chatino Speech Corpus, Forced Aligner, ASR , 2016, LREC.

[30]  Frank Seifart,et al.  Nouns slow down speech across structurally and culturally diverse languages , 2018, Proceedings of the National Academy of Sciences.

[31]  The vitality and diversity of multilingual repertoires: Commentary on Mufwene , 2017 .

[32]  Leanne Hinton The Use of Linguistic Archives in Language Revitalization , 2001 .

[33]  Ulrike Zeshan,et al.  Reciprocals constructions in Indo-Pakistani sign language , 2011 .

[34]  Larry M. Hyman Elicitation as Experimental Phonology: Thlantlang Lai Tonology , 2006 .

[35]  Victor Lyle Dowdell,et al.  Anthropology From a Pragmatic Point of View , 1978 .

[36]  Ina Bornkessel-Schlesewsky,et al.  The Neurophysiology of Language Processing Shapes the Evolution of Grammar: Evidence from Case Marking , 2015, PloS one.

[37]  A. Majid,et al.  How Changing Lifestyles Impact Seri Smellscapes and Smell Language , 2017 .

[38]  Christina M. Esposito,et al.  Santa Ana del Valle Zapotec phonation , 2003 .

[39]  K. Hale,et al.  Book Review: The Green Book of Language Revitalization in Practice , 2001 .

[40]  Carmel O’Shannessy The role of multiple sources in the formation of an innovative auxiliary category in Light Warlpiri, a new Australian mixed language , 2013 .

[41]  L. Matthewson On the Methodology of Semantic Fieldwork1 , 2004, International Journal of American Linguistics.

[42]  Nick Thieberger Using language documentation data in a broader context , 2012 .

[43]  Steven Bird,et al.  Aikuma: A Mobile App for Collaborative Language Documentation , 2014 .

[44]  Daniel McCloy,et al.  Revisiting population size vs. phoneme inventory size , 2012 .

[45]  Maria-Josep Solé,et al.  Experimental approaches to phonology , 2007 .

[46]  藤村 靖,et al.  Vocal physiology : voice production, mechanisms, and functions , 1988 .

[47]  Lise M. Dobrin,et al.  From linguistic elicitation to eliciting the linguist: Lessons in community empowerment from Melanesia , 2008 .

[48]  M. Dalrymple,et al.  Reciprocal Expressions and the Concept of Reciprocity , 1998 .

[49]  Maria Polinsky,et al.  Long-Distance Agreement And Topic In Tsez , 2001 .

[50]  John W. Du Bois The Discourse Basis of Ergativity , 1987 .

[51]  Stephen C. Levinson,et al.  Grammars of space: Explorations in cognitive diversity , 2006 .

[52]  S. Levinson,et al.  Structural Phylogenetics and the Reconstruction of Ancient Language History , 2005, Science.

[53]  N. C. England Doing Mayan linguistics in Guatemala , 2015 .

[54]  C. Cieri,et al.  Evaluating phonemic transcription of low-resource tonal languages for language documentation , 2018 .

[55]  M. Kovach,et al.  Indigenous Methodologies: Characteristics, Conversations, and Contexts , 2010 .

[56]  L. Watahomigie,et al.  Local reactions to perceived language decline , 2015 .

[57]  Kevin R. Gregg,et al.  SLA volume 17 issue 1 Cover and Front matter , 1995, Studies in Second Language Acquisition.

[58]  Robert Forkel,et al.  The World Atlas of Language Structures Online , 2009 .

[59]  Christfried Naumann The phoneme inventory of Taa (West !Xoon dialect) , 2013 .

[60]  Sarah L. Nesbeitt Ethnologue: Languages of the World , 1999 .

[61]  Ulrike Mosel,et al.  Essentials of language documentation , 2006 .

[62]  David W. Fleck Evidentiality and double tense in Matses , 2007 .

[63]  Joel Sherzer,et al.  The Archive of the Indigenous Languages of Latin America: An Overview , 2013 .

[64]  J. Loh,et al.  A global index of biocultural diversity , 2005 .

[65]  M. Dunn,et al.  Demonstratives in Cross-Linguistic Perspective , 2018 .

[66]  Pedro Mateo Pedro,et al.  The Acquisition of Inflection in Q’anjob’al Maya , 2015 .

[67]  S. Mufwene Colonisation , Globalisation , and the Future of Languages in the Twenty-first Century , 2002 .

[68]  P. Lewis Ethnologue : languages of the world , 2009 .

[69]  Joshua A. Fishman,et al.  Reversing Language Shift: Theoretical and Empirical Foundations of Assistance to Threatened Languages. Multilingual Matters Series: 76. , 1991 .

[70]  Linda Smith,et al.  Decolonizing Methodologies: Research and Indigenous Peoples , 2000 .

[71]  Shawn Wilson Research Is Ceremony: Indigenous Research Methods , 2008 .

[72]  Ian Maddieson,et al.  Human spoken language diversity and the acoustic adaptation hypothesis , 2015 .

[73]  William J. Sutherland,et al.  Parallel extinction risk and global distribution of languages and species , 2003, Nature.

[74]  Morten H. Christiansen,et al.  The need for quantitative methods in syntax and semantics research , 2013 .

[75]  Martin Haspelmath,et al.  The World Atlas of Language Structures Online , 2013 .

[76]  C. Moseley,et al.  Atlas Of The World’s Languages In Danger , 2015 .

[77]  Roberto Zamparelli,et al.  QUANTIFICATION AND THE NATURE OF CROSSLINGUISTIC VARIATION* , 2001 .

[78]  Anna Margetts,et al.  Potentials of language documentation: methods, analyses, and utilization , 2012 .

[79]  Jeff Good,et al.  Beyond the ancestral code: Towards a model for sociolinguistic language documentation , 2014 .

[80]  Daniel Nettle,et al.  Social scale and structural complexity in human languages , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[81]  Martine Adda-Decker,et al.  Lig-Aikuma: A Mobile App to Collect Parallel Speech for Under-Resourced Language Studies , 2016, INTERSPEECH.

[82]  Elena Mihas,et al.  Responses to Language Endangerment: in honor of Mickey Noonan: new directions in language documentation and language revitalization , 2013 .

[83]  J. Léonard,et al.  Documentation et revitalisation des "langues en danger" : épistémologie et praxis , 2015 .

[84]  Edward L. Keenan,et al.  Handbook of quantifiers in natural language , 2012 .

[85]  Sandy Lovie How the mind works , 1980, Nature.

[86]  David Harmon,et al.  The index of linguistic diversity: A new quantitative measure of trends in the status of the world's languages , 2010 .

[87]  Ewa Czaykowska-Higgins Research Models, Community Engagement, and Linguistic Fieldwork: Reflections on Working within Canadian Indigenous Communities , 2009 .

[88]  Lynn Yong-Shi Hou,et al.  "Making hands" : family sign languages in the San Juan Quiahije community , 2016 .

[89]  Morten H. Christiansen,et al.  Sound–meaning association biases evidenced across thousands of languages , 2016, Proceedings of the National Academy of Sciences.

[90]  Jennifer Green,et al.  Drawn from the Ground: Sound, Sign and Inscription in Central Australian Sand Stories , 2014 .

[91]  Gary F. Simons,et al.  ASSESSING ENDANGERMENT: EXPANDING FISHMAN'S GIDS , 2010 .

[92]  Nick Thieberger,et al.  A Grammar of South Efate: An Oceanic Language of Vanuatu , 2006 .

[93]  John H. Esling,et al.  The valves of the throat and their functioning in tone, vocal register and stress: laryngoscopic case studies , 2006, Phonology.

[94]  Felix K. Ameka,et al.  Catching Language: The Standing Challenge of Grammar Writing , 2006 .

[95]  Sven Grawunder,et al.  Reducing language to rhythm: Amazonian Bora drummed language exploits speech rhythm for long-distance communication , 2018, Royal Society Open Science.

[96]  I. Kant Anthropology from a pragmatic point of view , 1974 .

[97]  K. Gallagher Darwin’s Dangerous Idea: Evolution and the Meanings of Life , 1996 .

[98]  R. Singer,et al.  What practices and ideologies support small-scale multilingualism? A case study of Warruwi Community, northern Australia , 2016 .

[99]  Michael E. Krauss The world's languages in crisis , 2015 .

[100]  JANE H. Hill Reversing Language Shift: Theoretical and Empirical Foundations of Assistance to Threatened Languages , 1994 .

[101]  Lindsay J. Whaley,et al.  Dying words: endangered languages and what they have to tell us , 2011 .

[102]  N. Himmelmann,et al.  Documentary and descriptive linguistics , 1998 .

[103]  Nancy C. Dorian,et al.  Investigating Obsolescence: Studies in Language Contraction and Death , 1990 .

[104]  Lucille J. Watahomigie,et al.  Endangered languages. , 1991, Science.

[105]  Rob Amery,et al.  Phoenix or Relic? Documentation of Languages with Revitalization in Mind , 2009 .

[106]  Clifton Pye The Comparative Method of Language Acquisition Research , 2017 .

[107]  Dennis R. Preston,et al.  Variation in indigenous minority languages , 2009 .

[108]  Nicholas Evans,et al.  The Acquisition of Polysynthetic Verb Forms in Chintang , 2017 .

[109]  P. Pye-Smith The Descent of Man, and Selection in Relation to Sex , 1871, Nature.

[110]  L. Babel,et al.  Searching for meaning in the Library of Babel : field semantics and problems of digital archiving , 2006 .

[111]  David W. Fleck,et al.  A grammar of Matsés , 2003 .

[112]  Cadey Korson,et al.  The World until Yesterday: What Can We Learn from Traditional Societies? , 2017 .

[113]  Peter Austin,et al.  The Cambridge handbook of endangered languages , 2011 .

[114]  Jeffrey Heath Functional grammar of Nunggubuyu , 1984 .

[115]  A. Majid,et al.  Odors are expressible in language, as long as you speak the right language , 2014, Cognition.

[116]  L. Matthewson,et al.  Methodologies in Semantic Fieldwork , 2015 .

[117]  Colleen M. Fitzgerald Understanding language vitality and reclamation as resilience: A framework for language endangerment and ‘loss’ (Commentary on Mufwene) , 2017 .

[118]  S. Levinson,et al.  The myth of language universals: language diversity and its importance for cognitive science. , 2009, The Behavioral and brain sciences.

[119]  Ulrike Mosel,et al.  Chapter 1 Language documentation: What is it and what is it good for? , 2006 .

[120]  Bettina Speckmann,et al.  Simultaneous visualization of language endangerment and language description , 2018 .

[121]  Chien Yuehchen,et al.  Yilan Creole in Taiwan , 2010 .

[122]  S. Mufwene Language vitality: The weak theoretical underpinnings of what can be an exciting research area , 2017 .

[123]  S. Levinson,et al.  Reciprocals and semantic typology , 2011 .

[124]  Friederike Lüpke African(ist) perspectives on vitality: Fluidity, small speaker numbers, and adaptive multilingualism make vibrant ecologies (Response to Mufwene) , 2017 .

[125]  J. Fishman Whorfianism of the third kind: Ethnolinguistic diversity as a worldwide societal asset (The Whorfian Hypothesis: Varieties of validation, confirmation, and disconfirmation II) , 1982, Language in Society.

[126]  Henrik Bergqvist Complex Epistemic Perspective in Kogi (Arwako)1 , 2016, International Journal of American Linguistics.

[127]  Simon J. Greenhill,et al.  Evolved structure of language shows lineage-specific trends in word-order universals , 2011, Nature.

[128]  D. Ladd,et al.  Linguistic tone is related to the population frequency of the adaptive haplogroups of two brain size genes, ASPM and Microcephalin , 2007, Proceedings of the National Academy of Sciences.

[129]  Irit Meir,et al.  The gradual emergence of phonological form in a new language , 2011, Natural language & linguistic theory.

[130]  K. David Harrison,et al.  When languages die : the extinction of the world's languages and the erosion of human knowledge , 2007 .

[131]  Sebastian Sauppe Symmetrical and asymmetrical voice systems and processing load: Pupillometric evidence from sentence production in Tagalog and German , 2017 .

[132]  Patricia Epps Amazonian linguistic diversity and its sociocultural correlates , 2020 .

[133]  N. Farnsworth,et al.  The value of plants used in traditional medicine for drug discovery. , 2001, Environmental health perspectives.

[134]  A. Majid,et al.  Revisiting the limits of language: The odor lexicon of Maniq , 2014, Cognition.

[135]  Victoria Nyst,et al.  A descriptive analysis of Adamorobe sign language (Ghana) , 2007 .

[136]  Bernard Spolsky,et al.  When Languages Die: The Extinction of the World's Languages and the Erosion of Human Knowledge (review) , 2010 .

[137]  Keren Rice,et al.  Let the language tell its story? The role of linguistic theory in writing grammars , 2006 .

[138]  Sebastian Stüker,et al.  Breaking the Unwritten Language Barrier: The BULB Project , 2016, SLTU.

[139]  Lisa Matthewson,et al.  Pronouns, Presuppositions, and Semantic Variation , 2008 .

[140]  Agnieszka E. Konopka,et al.  Word order affects the time course of sentence formulation in Tzeltal , 2013 .

[141]  J. L. Gittleman,et al.  The biodiversity of species and their rates of extinction, distribution, and protection , 2014, Science.

[142]  Stefan Schnell,et al.  The discourse basis of ergativity revisited , 2016 .

[143]  Ulrike Mosel,et al.  Chapter 5 The ethnography of language and language documentation , 2006 .

[144]  Lea Brown,et al.  The verbs for ‘and’ in Walman, a Torricelli language of Papua New Guinea , 2008 .

[145]  Larry M. Hyman Morphological Tonal Assignments in Conflict: Who Wins? , 2013 .

[146]  Stefan Schnell,et al.  Do grammatical relations reflect information status? Reassessing Preferred Argument Structure theory against discourse data from Tondano , 2017 .

[147]  Stephen R. Anderson,et al.  How Many Languages Are There in the World , 2004 .

[148]  Luisa Maffi,et al.  LINGUISTIC, CULTURAL, AND BIOLOGICAL DIVERSITY , 2005 .

[149]  Colleen M. Fitzgerald Creating sustainable models of language documentation and revitalization , 2018 .

[150]  Rachel Nordlinger,et al.  The Acquisition of Murrinhpatha (Northern Australia) , 2017 .

[151]  N. Evans View with a view: Towards a typology of multiple perspective constructions , 2005 .

[152]  N. Evans,et al.  The grammar of engagement II: typology and diachrony , 2017, Language and Cognition.