A DIVA-based method for the phonation of the Chinese Diphthongs

Mainly used to simulate and describe the brain involved in speech production and speech comprehension relevant functional areas, the DIVA model simulates channel activity by a neural network model of adaptive words, syllables or phonemes, and it depends on the background of the language are 29 basic English phonemes. Since that the pronunciation of Chinese and English is very different, the basic phoneme of more than 70, while the process is completely different from the brain mechanism. If you want to “read” the Chinese brain thinking process, the need for specialized research on China's background to adapt to the DIVA model. This paper based on the DIVA model can simulate and describe the function of language generation and acquisition of relevant brain areas. By means of linear prediction and cepstrum Mel combination (LPMCC), the double vowel resonance peak frequency was extracted by fitting polynomial, and the DIVA 9 typical Chinese double vowels were simulated. The simulation results show that the model is very effective in distinguishing the vowels of Chinese vowels and the sounds of speech by adjusting the function of the resonance peaks and the corresponding parameters of the vocal organs of the vocal organs. This work provides a great theoretical and practical basis for the widespread use of intelligent robots in society.

[1]  Zhou Qiang Intra-chunk relationship analyse for Chinese base chunk labeling systems , 2009 .

[2]  Developmental aspects of American English diphthong trajectories in the formant space , 2013 .

[3]  Frank H Guenther,et al.  The DIVA model: A neural theory of speech acquisition and production , 2011, Language and cognitive processes.

[4]  Shaobai Zhang,et al.  An application of cerebellar control model for prehension movements , 2012, Neural Computing and Applications.

[5]  Yuan Yujin,et al.  Research of speaker recognition based on combination of LPCC and MFCC , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[6]  Long Hong,et al.  Image medium similarity measure and its applications , 2014, Neurocomputing.

[7]  F. Guenther,et al.  The perceptual magnet effect as an emergent property of neural map formation. , 1996, The Journal of the Acoustical Society of America.

[8]  P. Kuhl,et al.  Influences of phonetic identification and category goodness on American listeners' perception of /r/ and /l/. , 1996, The Journal of the Acoustical Society of America.

[9]  K. Stevens,et al.  Linguistic experience alters phonetic perception in infants by 6 months of age. , 1992, Science.

[10]  Shaobai Zhang,et al.  Research on the mechanism for phonating stressed English syllables based on DIVA model , 2015, Neurocomputing.

[11]  Zhang Shao-bai,et al.  Research on the mechanism for phonating stressed English syllables based on DIVA model , 2015 .

[12]  Jason A. Tourville,et al.  Neural mechanisms underlying auditory feedback control of speech , 2008, NeuroImage.

[13]  Zhang Rui-lin Speaker recognition method using MFCC and LPCC features , 2009 .

[14]  Frank H. Guenther,et al.  A neural network model of speech acquisition and motor equivalent speech production , 2004, Biological Cybernetics.

[15]  P. Birkholz,et al.  Phonemic, sensory, and motor representations in an action-based neurocomputational model of speech production , 2010 .

[16]  Ta-Wen Kuan,et al.  Butterfly framework of LPCC ASIC design for friendly HMI in speaker identification , 2013, 2013 1st International Conference on Orange Technologies (ICOT).

[17]  Satrajit S. Ghosh,et al.  Neural modeling and imaging of the cortical interactions underlying syllable production , 2006, Brain and Language.

[18]  Zhang Shao-bai,et al.  A new constructing method of cerebellum model applying to DIVA model , 2009, 2009 Chinese Control and Decision Conference.

[19]  Alexander Kain,et al.  Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  Alfonso Nieto-Castanon,et al.  A modeling investigation of articulatory variability and acoustic stability during American English /r/ production. , 2005, The Journal of the Acoustical Society of America.

[21]  Xin Liu,et al.  Research on the Chinese vowels pronunciation method based on DIVA model , 2014, SIMULTECH 2014.

[22]  Teuvo Kohonen,et al.  Self-organizing neural projections , 2006, Neural Networks.

[23]  Frank H. Guenther,et al.  Unstable or Insufficiently Activated Internal Models and Feedback-Biased Motor Control as Sources of Dysfluency: A Theoretical Model of Stuttering , 2004 .

[24]  Ningning Zhou,et al.  A new control model for the temporal coordination of arm transport and hand preshape applying to two-dimensional space , 2015, Neurocomputing.

[25]  F. Guenther Cortical interactions underlying the production of speech sounds. , 2006, Journal of communication disorders.

[26]  Xiaogang,et al.  A New Cerebellar Control Scheme and Simulation Based on Kalman Estimator , 2009 .