A thorough evaluation of the Language Environment Analysis (LENA) system.

In the previous decade, dozens of studies involving thousands of children across several research disciplines have made use of a combined daylong audio-recorder and automated algorithmic analysis called the LENAⓇ system, which aims to assess children's language environment. While the system's prevalence in the language acquisition domain is steadily growing, there are only scattered validation efforts on only some of its key characteristics. Here, we assess the LENAⓇ system's accuracy across all of its key measures: speaker classification, Child Vocalization Counts (CVC), Conversational Turn Counts (CTC), and Adult Word Counts (AWC). Our assessment is based on manual annotation of clips that have been randomly or periodically sampled out of daylong recordings, collected from (a) populations similar to the system's original training data (North American English-learning children aged 3-36 months), (b) children learning another dialect of English (UK), and (c) slightly older children growing up in a different linguistic and socio-cultural setting (Tsimane' learners in rural Bolivia). We find reasonably high accuracy in some measures (AWC, CVC), with more problematic levels of performance in others (CTC, precision of male adults and other children). Statistical analyses do not support the view that performance is worse for children who are dissimilar from the LENAⓇ original training set. Whether LENAⓇ results are accurate enough for a given research, educational, or clinical application depends largely on the specifics at hand. We therefore conclude with a set of recommendations to help researchers make this determination for their goals.

[1]  F. Tion,et al.  Reliability of the LENA TM Language Environment Analysis System in Young Children's Natural Home Environment , 2009 .

[2]  A. Eriks-Brophy,et al.  A Concise Protocol for the Validation of Language ENvironment Analysis (LENA) Conversational Turn Counts in Vietnamese , 2018 .

[3]  Hung Thai-Van,et al.  Reliability of the Language ENvironment Analysis system (LENA™) in European French , 2015, Behavior Research Methods.

[4]  Kwang-Il Goh,et al.  Burstiness and memory in complex systems , 2006 .

[5]  J. Gilkerson,et al.  The LENA Natural Language Study , 2009 .

[6]  Ben Baumer,et al.  R Markdown: Integrating A Reproducible Analysis Tool into Introductory Statistics , 2014, 1402.1894.

[7]  J. Gilkerson,et al.  Teaching by Listening: The Importance of Adult-Child Conversations to Language Development , 2009, Pediatrics.

[8]  D K Oller,et al.  Automated vocal analysis of naturalistic recordings from children with autism, language delay, and typical development , 2010, Proceedings of the National Academy of Sciences.

[9]  J. Gilkerson,et al.  Assessing Children’s Home Language Environments Using Automatic Speech Recognition Technology , 2011 .

[10]  L. Polka,et al.  Monolingual and bilingual infants' word segmentation abilities in an inter-mixed dual-language task. , 2019, Infancy : the official journal of the International Society on Infant Studies.

[11]  Jill Gilkerson,et al.  Transcriptional Analyses of the LENA Natural Language Corpus , 2009 .

[12]  Alejandrina Cristia,et al.  Infant-Mother Acoustic-Prosodic Alignment and Developmental Risk. , 2018, Journal of speech, language, and hearing research : JSLHR.

[13]  S. von Stumm,et al.  A naturalistic home observational approach to children's language, cognition, and behavior. , 2019, Developmental psychology.

[14]  Anne Fernald,et al.  Talking to Children Matters , 2013, Psychological science.

[15]  Caroline F. Rowland,et al.  The Language 0-5 Project , 2018 .

[16]  John H L Hansen,et al.  Mapping the Early Language Environment Using All-Day Recordings and Automated Analysis. , 2017, American journal of speech-language pathology.

[17]  Elika Bergelson,et al.  What Do North American Babies Hear? A large-scale cross-corpus analysis. , 2018, Developmental science.

[18]  Astrid van Wieringen,et al.  Correlation and agreement between Language ENvironment Analysis (lena™) and manual transcription for Dutch natural language recordings , 2017, Behavior Research Methods.

[19]  Carmen Peláez-Moreno,et al.  Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Manfred K. Warmuth,et al.  THE CMU SPHINX-4 SPEECH RECOGNITION SYSTEM , 2001 .

[21]  Alejandrina Cristia,et al.  HomeBank: An Online Repository of Daylong Child-Centered Audio Recordings , 2016, Seminars in Speech and Language.

[22]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[23]  Alejandrina Cristià,et al.  A New Workflow for Semi-Automatized Annotations: Tests with Long-Form Naturalistic Recordings of Childrens Language Environments , 2017, INTERSPEECH.

[24]  Mark VanDam,et al.  A modular, extensible approach to massive ecologically valid behavioral data , 2018, Behavior Research Methods.

[25]  Hervé Bredin,et al.  pyannote.metrics: A Toolkit for Reproducible Evaluation, Diagnostic, and Error Analysis of Speaker Diarization Systems , 2017, INTERSPEECH.

[26]  Keith Topping,et al.  Evaluating language environment analysis system performance for Chinese: a pilot study in Shanghai. , 2015, Journal of speech, language, and hearing research : JSLHR.

[27]  Federica Bulgarelli,et al.  Look who’s talking: A comparison of automated and human-generated speaker tags in naturalistic day-long recordings , 2019, Behavior Research Methods.

[28]  Alejandrina Cristia,et al.  Accuracy of the Language Environment Analysis System Segmentation and Metrics: A Systematic Review. , 2020, Journal of speech, language, and hearing research : JSLHR.

[29]  Laura C. Dilley,et al.  Fidelity of automatic speech processing for adult speech classifications using the Language ENvironment Analysis ( LENA ) system , 2018 .