A mechanical voice system and its adaptive learning for the mimicry of human vocalization

A mechanical model of a human vocal system is being developed based on mechatronics technology. Although various ways of vocal sound production have been actively studied, mechanical construction is considered to advantageously realize natural vocalization with its fluid dynamics. The mechanical vocal system has several motors to manipulate the vocal tract and the vocal cords. It became possible to learn the relations between motor positions and the produced vocal sounds by an auditory feedback, and produce Japanese five vowels (a, i, u, e, o) by mimicking a human speech. In addition, the mechanical model could produce some consonant sounds by attaching a nasal cavity with the dynamic control. This paper introduces an adaptive learning algorithm for the mimicry of human vocalization, and presents a listening experiment of generated sounds for the evaluation.

[1]  Atsuo Takanishi,et al.  Development of a talking robot , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[2]  Xavier Rodet,et al.  Synthesis of the singing voice , 1989 .

[3]  Shuji Hashimoto,et al.  Adaptive Control of a Vocal Cord and Vocal Tract for Computerized Mechanical Singing Instruments , 1996, ICMC.

[4]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[5]  Hideyuki Sawada,et al.  Vocalization Control of a Mechanical Vocal System under Auditory Feedback , 2002, J. Robotics Mechatronics.

[6]  Julius O. Smith,et al.  Viewpoints on the History of Digital Synthesis , 1991, ICMC.

[7]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[8]  Xavier Rodet,et al.  A Virtual Castrato (!?) , 1994, ICMC.

[9]  Masafumi Hagiwara,et al.  Fuzzy inference neural network , 1997, Neurocomputing.

[10]  Jiongtao Huang,et al.  A multi-winners self-organizing neural network , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.