Automatically assessing the ABCs: Verification of children's spoken letter-names and letter-sounds

Automatic literacy assessment is an area of research that has shown significant progress in recent years. Technology can be used to automatically administer reading tasks and analyze and interpret children's reading skills. It has the potential to transform the classroom dynamic by providing useful information to teachers in a repeatable, consistent, and affordable way. While most previous research has focused on automatically assessing children reading words and sentences, assessments of children's earlier foundational skills is needed. We address this problem in this research by automatically verifying preliterate children's pronunciations of English letter-names and the sounds each letter represents (“letter-sounds”). The children analyzed in this study were from a diverse bilingual background and were recorded in actual kindergarten to second grade classrooms. We first manually verified (accept/reject) the letter-name and letter-sound utterances, which serve as the ground-truth in this study. Next, we investigated four automatic verification methods that were based on automatic speech recognition techniques. We attained percent agreement with human evaluations of 90% and 85% for the letter-name and letter-sound tasks, respectively. Humans agree between themselves an average of 95% of the time for both tasks. We discuss the various confounding factors for this assessment task, such as background noise and the presence of disfluencies, that impact automatic verification performance.

[1]  Catherine McBride-Chang,et al.  The ABCs of the ABCs: The Development of Letter-Name and Letter-Sound Knowledge. , 1999 .

[2]  Steve Young,et al.  The HTK book version 3.4 , 2006 .

[3]  Abeer Alwan,et al.  Assessment of emerging reading skills in young native speakers and language learners , 2009, Speech Commun..

[4]  Abeer Alwan,et al.  Automatic detection of voice onset time contrasts for use in pronunciation assessment , 2006, INTERSPEECH.

[5]  Shrikanth S. Narayanan,et al.  Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.

[6]  Satoshi Nakamura,et al.  Automatic pronunciation scoring of words and sentences independent from the non-native's first language , 2009, Comput. Speech Lang..

[7]  Ronald A. Cole,et al.  Highly accurate children's speech recognition for interactive reading tutors using subword units , 2007, Speech Commun..

[8]  Jeanne R. Paratore,et al.  Classroom Literacy Assessment. Making Sense of What Students Know and Do. Solving Problems in the Teaching of Literacy Series. , 2007 .

[9]  Søren Holdt Jensen,et al.  A system for detecting miscues in dyslexic read speech , 2009, INTERSPEECH.

[10]  Joost van Doremalen,et al.  Optimizing Automatic Speech Recognition for Low-Proficient Non-Native Speakers , 2010, EURASIP J. Audio Speech Music. Process..

[11]  Abeer Alwan,et al.  Pronunciation variations of Spanish-accented English spoken by young children , 2005, INTERSPEECH.

[12]  P. Black,et al.  Assessment and Classroom Learning , 1998 .

[13]  Ronald A. Cole,et al.  ITALIAN LITERACY TUTOR tools and technologies for individuals with cognitive disabilities , 2004 .

[14]  Abeer Alwan,et al.  TBALL data collection: the making of a young children's speech corpus , 2005, INTERSPEECH.

[15]  D. Langenberg Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction , 2000 .

[16]  Abeer Alwan,et al.  A Generative Student Model for Scoring Word Reading Skills , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Hugo Van hamme,et al.  Automatic assessment of children's reading level , 2007, INTERSPEECH.

[18]  Shrikanth S. Narayanan,et al.  Automatic pronunciation verification of english letter-names for early literacy assessment of preliterate children , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Abeer Alwan,et al.  A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[20]  Justine Cassell,et al.  Making Space for Voice: Technologies to Support Children’s Fantasy and Storytelling , 2001, Personal and Ubiquitous Computing.

[21]  Margaret Heritage,et al.  Formative Assessment: What Do Teachers Need to Know and Do? , 2007 .

[22]  Shrikanth S. Narayanan,et al.  Detecting emotional state of a child in a conversational computer game , 2011, Comput. Speech Lang..

[23]  Steve J. Young,et al.  Phone-level pronunciation scoring and assessment for interactive language learning , 2000, Speech Commun..

[24]  M P Black,et al.  Automatic Prediction of Children's Reading Ability for High-Level Literacy Assessment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Frank K. Soong,et al.  Generalized Segment Posterior Probability for Automatic Mandarin Pronunciation Evaluation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[26]  Benita A. Blachman,et al.  Kindergarten teachers develop phoneme awareness in low-income, inner-city classrooms , 1994 .

[27]  Shrikanth S. Narayanan,et al.  Pronunciation verification of English letter-sounds in preliterate children , 2008, INTERSPEECH.

[28]  Maxine Eskénazi,et al.  An overview of spoken language technology for education , 2009, Speech Commun..

[29]  Jack Mostow,et al.  A Prototype Reading Coach that Listens , 1994, AAAI.