The Vowel Game: Continuous Real-Time Visualization for Pronunciation Learning with Vowel Charts

Learning to pronounce new speech sounds is difficult. Visual feedback helps in identifying the errors and indicating the achieved progress. The Vowel Game uses a visualization method that symbolizes the vocal tract. This instructs the user on how to adjust e.g. the tongue position during pronunciation. It gives information about the correctness and goodness of the uttered vowel. Preliminary evaluation suggests that continuous real-time feedback can be obtained, but the effect on learning remains to be tested.

[1]  Y. Tohkura,et al.  A perceptual interference account of acquisition difficulties for non-native phonemes , 2003, Cognition.

[2]  A. Paige,et al.  Calculation of vocal tract length , 1970 .

[3]  Taku Komura,et al.  Topology matching for fully automatic similarity estimation of 3D shapes , 2001, SIGGRAPH.

[4]  Klára Vicsi,et al.  A multilingual, multimodal, speech training system, SPECO , 2001, INTERSPEECH.

[5]  P. Kuhl Human adults and human infants show a “perceptual magnet effect” for the prototypes of speech categories, monkeys do not , 1991, Perception & psychophysics.

[6]  Ronald Pose,et al.  Priority rendering with a virtual reality address recalculation pipeline , 1994, SIGGRAPH.

[7]  Philip N. Day,et al.  Modelling the Effects of Delayed Visual Feedback in Real-Time Operator Control Loops : A Cognitive Perspective , 1999 .

[8]  M. Halle,et al.  Preliminaries to Speech Analysis: The Distinctive Features and Their Correlates , 1961 .

[9]  B. C. Griffith,et al.  The discrimination of speech sounds within and across phoneme boundaries. , 1957, Journal of experimental psychology.

[10]  Michael Carey,et al.  CALL Visual Feedback for Pronunciation of Vowels: Kay Sona-Match , 2015 .

[11]  Felicia Zhang Using an Interactive Feedback Tool to Enhance Pronunciation in Language Learning , 2005 .

[12]  Joanna Light,et al.  Using visible speech to train perception and production of speech for individuals with hearing loss. , 2004, Journal of speech, language, and hearing research : JSLHR.

[13]  Sanjaya Mishra,et al.  Interactive Multimedia in Education and Training , 2004 .

[14]  Steve Benford,et al.  Coping with inconsistency due to network delays in collaborative virtual environments , 1999, VRST '99.

[15]  Simon Carlile,et al.  Synchronizing to real events: subjective audiovisual alignment scales with perceived auditory depth and speed of sound. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[16]  John R. Lindsay Smith,et al.  Learning to Pronounce Vowel Sounds in a Foreign Language using Acoustic Measurements of the Vocal Tract as Feedback in Real Time , 1998 .

[17]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .