Talking Robot and the Analysis of Autonomous Voice Acquisition

A talking and singing robot which adaptively learns the vocalization skill by an auditory feedback learning is being developed. In vocalization, the vibration of vocal cords generates a source sound, and then the sound wave is led to a vocal tract, which works as a resonance filter to determine the spectrum envelope. The robot consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. The paper briefly introduces the construction of vocal cords and vocal tract for the realization of the talking robot, and then describes how the robot autonomously acquires the vocalization skill in the auditory feedback learning by listening to human talking and singing voices. The acquired voices were evaluated by listening experiments

[1]  Atsuo Takanishi,et al.  Development of a talking robot with vocal cords and lips having human-like biological structures , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Shuji Hashimoto,et al.  Adaptive Control of a Vocal Cord and Vocal Tract for Computerized Mechanical Singing Instruments , 1996, ICMC.

[3]  Julius O. Smith,et al.  Viewpoints on the History of Digital Synthesis , 1991, ICMC.

[4]  Hideyuki Sawada,et al.  Vocalization Control of a Mechanical Vocal System under Auditory Feedback , 2002, J. Robotics Mechatronics.

[5]  Mitsuhiro Nakamura,et al.  Mechanical voice system and its singing performance , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[6]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .