论文信息 - Automatic Learning for Supporting Advanced Human-Machine Interfaces

Automatic Learning for Supporting Advanced Human-Machine Interfaces

This paper provides a novel algorithm for supporting automatic learning oriented to advanced human-machine interfaces. The algorithm introduces several points of innovativeness, based on complex similarity metrics involving several features of the whole learning process. A comprehensive experimental assessment and analysis of the proposed algorithm on both synthetic and real-life data sets confirms the benefits deriving from our proposal.

[1] Enzo Mumolo,et al. Automatic 3d virtual cloning of a speaking human face , 2010, SMVC '10.

[2] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[3] Keith Waters,et al. Computer facial animation , 1996 .

[4] Enzo Mumolo,et al. Towards articulatory Control of Talking Heads in Humanoid Robotics Using a Genetic-Fuzzy Imitation Learning Algorithm , 2007, Int. J. Humanoid Robotics.

[5] D H Klatt,et al. Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.

[6] Alfredo Cuzzocrea,et al. Balancing accuracy and privacy of OLAP aggregations on data cubes , 2010, DOLAP '10.

[7] Hani Yehia,et al. Quantitative association of vocal-tract and facial behavior , 1998, Speech Commun..

[8] Emanuele Menegatti,et al. A genetic-fuzzy algorithm for the articulatory imitation of facial movements during vocalization of a humanoid robot , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[9] S. Chiba,et al. Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[10] Andrew Sekey,et al. An Objective Measure for Predicting Subjective Quality of Speech Coders , 1992, IEEE J. Sel. Areas Commun..

[11] Björn Granström,et al. SynFace—Speech-Driven Facial Animation for Virtual Speech-Reading Support , 2009, EURASIP J. Audio Speech Music. Process..

[12] Alfredo Cuzzocrea,et al. Efficient Fragmentation of Large XML Documents , 2007, DEXA.

[13] Irene Albrecht,et al. Automatic Generation of Non-Verbal Facial Expressions from Speech , 2002 .

[14] Abeer Alwan,et al. On the correlation between facial movements, tongue movements and speech acoustics , 2000, INTERSPEECH.

[15] A. Murat Tekalp,et al. Prosody-Driven Head-Gesture Animation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[16] Renato De Mori,et al. Computer Models of Speech Using Fuzzy Algorithms , 1983, Advanced Applications in Pattern Recognition.

[17] M. Stella,et al. Diphone synthesis using multipulse coding and a phase vecoder , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18] Leonardo Fogassi,et al. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE Mirror Neurons and the Evolution of Embodied Language , 2022 .

[19] R. Brooks,et al. The cog project: building a humanoid robot , 1999 .

[20] Thomas P. Barnwell,et al. MCCREE AND BARNWELL MIXED EXCITAmON LPC VOCODER MODEL LPC SYNTHESIS FILTER 243 SYNTHESIZED SPEECH-PERIODIC PULSE TRAIN-1 PERIODIC POSITION JITTER PULSE 4 , 2004 .

[21] Atsuo Takanishi,et al. Three dimensional tongue with liquid sealing mechanism for improving resonance on an anthropomorphic talking robot , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22] Igor S. Pandzic,et al. Towards Facial Gestures Generation by Speech Signal Analysis Using HUGE Architecture , 2009, COST 2102 School.

[23] Jeffrey D. Ullman,et al. Big data: a research agenda , 2013, IDEAS '13.

[24] Keith Waters,et al. Computer Facial Animation, Second Edition , 1996 .

[25] Gunnar Fant,et al. Speech sounds and features , 1973 .

[26] Wonho Yang,et al. A modified bark spectral distortion measure which uses noise masking threshold , 1997, 1997 IEEE Workshop on Speech Coding for Telecommunications Proceedings. Back to Basics: Attacking Fundamental Problems in Speech Coding.