Assessing Idiosyncrasies in a Bayesian Model of Speech Communication

Although speakers of one specific language share the same phoneme representations, their productions can differ. We propose to investigate the development of these differences in production , called idiosyncrasies, by using a Bayesian model of communication. Supposing that idiosyncrasies appear during the development of the motor system, we present two versions of the motor learning phase, both based on the guidance of an agent master: " a repetition model " where agents try to imitate the sounds produced by the master and " a communication model " where agents try to replicate the phonemes produced by the master. Our experimental results show that only the " communication model " provides production idiosyncrasies, suggesting that idiosyncrasies are a natural output of a motor learning process based on a communicative goal.

[1]  J. Schwartz,et al.  The Dispersion-Focalization Theory of vowel systems , 1997 .

[2]  J. Werker,et al.  Influences on infant speech processing: toward a new synthesis. , 1999, Annual review of psychology.

[3]  Jean-Luc Schwartz,et al.  Invariance and variability in the production of the height feature in French vowels , 2008, Speech Commun..

[4]  Ian S. Howard,et al.  Creating the cognitive form of phonological units: The speech sound correspondence problem in infancy could be solved by mirrored vocal interactions rather than by imitation , 2015, J. Phonetics.

[5]  Jean-Luc Schwartz,et al.  Adverse conditions improve distinguishability of auditory, motor, and perceptuo-motor theories of speech perception: An exploratory Bayesian modelling study , 2012 .

[6]  P. Kuhl Early language acquisition: cracking the speech code , 2004, Nature Reviews Neuroscience.

[7]  ierre,et al.  Bayesian Robot Programming , 2022 .

[8]  Jean-Luc Schwartz,et al.  A computational model of perceptuo-motor processing in speech perception: learning to imitate and categorize synthetic CV syllables , 2013, INTERSPEECH.

[9]  P. Kuhl,et al.  Phonetic learning as a pathway to language: new data and native language magnet theory expanded (NLM-e) , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[10]  Clément Moulin-Frier,et al.  COSMO ("Communicating about Objects using Sensory-Motor Operations"): A Bayesian modeling framework for studying speech communication and the emergence of phonological systems , 2015, J. Phonetics.

[11]  Shinji Maeda,et al.  Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model , 1990 .

[12]  Jean-Luc Schwartz,et al.  COSMO, a Bayesian computational model of speech communication: Assessing the role of sensory vs. motor knowledge in speech perception , 2015, 2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[13]  Julien Diard,et al.  Bayesian Action–Perception Computational Model: Interaction of Production and Recognition of Cursive Letters , 2011, PloS one.

[14]  Clément Moulin-Frier Rôle des relations perception-action dans la communication parlée et l'émergence des systèmes phonologiques : étude, modélisation computationnelle et simulations , 2011 .

[15]  Louis-Jean Boë,et al.  Auditory normalization of French vowels synthesized by an articulatory model simulating growth from birth to adulthood. , 2002, The Journal of the Acoustical Society of America.