Command-based voice teleoperation of a mobile robot via a human-robot interface

Verbal communication is the most natural way of human–robot interaction. Such an interaction is usually achieved by means of a human-robot interface (HRI). In this paper, a HRI is presented to teleoperate a robotic platform via the user’s voice. Hence, a speech recognition system is necessary. In this work, a user-dependent acoustic model for Spanish speakers has been developed to teleoperate a robot with a set of commands. Experimental results have been successful, both in terms of a high recognition rate and the navigation of the robot under the control of the user’s voice.

[1]  Viii Supervisor Sonar-Based Real-World Mapping and Navigation , 2001 .

[2]  J. Landry,et al.  Emotional Design: Why We Love (or Hate) Everyday Things (Book) , 2004 .

[3]  Irina Illina,et al.  The automatic news transcription system: ANTS, some real time experiments , 2004, INTERSPEECH.

[4]  Peter I. Corke,et al.  A new framework for force feedback teleoperation of robotic vehicles based on optical flow , 2009, 2009 IEEE International Conference on Robotics and Automation.

[5]  Jakob Nielsen,et al.  Designing web usability , 1999 .

[6]  Jakob Nielsen,et al.  Usability inspection methods , 1994, CHI 95 Conference Companion.

[7]  A. Bandera,et al.  Hierarchical planning in a mobile robot for map learning and navigation , 2003 .

[8]  Henry Hexmoor,et al.  Agent Autonomy , 2003, Multiagent Systems, Artificial Societies, and Simulated Organizations.

[9]  Toshio Tsuji,et al.  A human-assisting manipulator teleoperated by EMG signals and arm motions , 2003, IEEE Trans. Robotics Autom..

[10]  Joel Spolsky,et al.  User Interface Design for Programmers , 2001, Apress.

[11]  Raúl Marín,et al.  Automatic speech recognition to teleoperate a robot via Web , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Jakob Nielsen,et al.  Chapter 4 – The Usability Engineering Lifecycle , 1993 .

[13]  Tatsuya Kawahara,et al.  Recent Development of Open-Source Speech Recognition Engine Julius , 2009 .

[14]  C. Galindo,et al.  Control Architecture for Human–Robot Integration: Application to a Robotic Wheelchair , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[15]  Javier Macias-Guarasa,et al.  Voice Command Generation for Teleoperated Robot Systems , 1998 .

[16]  Anders Green,et al.  Social and collaborative aspects of interaction with a service robot , 2003, Robotics Auton. Syst..

[17]  Christophe Cerisara,et al.  Automatic discovery of topics and acoustic morphemes from speech , 2009, Comput. Speech Lang..

[18]  Oussama Khatib,et al.  Autonomous robotic systems , 1998 .

[19]  Ahmad Akbari,et al.  Sub-band weighted projection measure for sub-band speech recognition in noise , 2006 .

[20]  Susan L. Hura,et al.  Voice User Interfaces , 2022, Encyclopedia of Big Data.

[21]  Li Bo,et al.  Speaker recognition based on dynamic MFCC parameters , 2009, 2009 International Conference on Image Analysis and Signal Processing.

[22]  Yongwon Jeong,et al.  Robust speaker adaptation based on parallel factor analysis of training models , 2011 .

[23]  Jeffrey Johnson,et al.  GUI Bloopers: Don'ts and Do's for Software Developers and Web Designers , 2000 .

[24]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[25]  Yang Dong,et al.  Accent analysis for Mandarin large vocabulary continuous speech recognition , 2008 .

[26]  Steve Young,et al.  The HTK book , 1995 .

[27]  Tomomasa Sato,et al.  Human motion tracking system based on skeleton and surface integration model using pressure sensors distribution bed , 2000, Proceedings Workshop on Human Motion.

[28]  Bruce Tognazzini,et al.  Tog on Interface , 1992 .

[29]  Ali Sekmen,et al.  Human–robot interaction via voice-controllable intelligent user interface , 2007, Robotica.

[30]  Mei-Yuh Hwang,et al.  The SPHINX speech recognition system , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[31]  T. Saitoh,et al.  Voice controlled intelligent wheelchair , 2007, SICE Annual Conference 2007.

[32]  Qiang Huang,et al.  A teleoperation system for a humanoid robot with multiple information feedback and operational modes , 2005, 2005 IEEE International Conference on Robotics and Biomimetics - ROBIO.

[33]  Takayuki Kanda,et al.  Teleoperation of Multiple Social Robots , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[34]  Heungkyu Lee,et al.  Competing models-based text-prompted speaker independent verification algorithm , 2006, Speech Commun..

[35]  Fabien Courreges,et al.  Ergonomic mouse based interface for 3D orientation control of a tele-sonography robot , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[36]  Lee-Min Lee Adaptation of hidden Markov models for half frame rate observations , 2010 .

[37]  Mirjam Sepesy Maucec,et al.  A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition , 2002, INTERSPEECH.

[38]  Jakob Nielsen,et al.  Designing Web Usability: The Practice of Simplicity , 1999 .

[39]  José Maria Azorín,et al.  Steps in the development of a robotic scrub nurse , 2012, Robotics Auton. Syst..

[40]  O.P. Mayorga,et al.  Gaussian Components Optimization for a Robot Controlled by Speech Commands in Mexican Spanish , 2007, Electronics, Robotics and Automotive Mechanics Conference (CERMA 2007).

[41]  Johanna D. Moore,et al.  Proceedings of Interspeech 2008 , 2008 .

[42]  Mark T. Bolas Designing the user in user interfaces , 2014, UIST.

[43]  Raja Parasuraman,et al.  Human-Automation Interaction , 2005 .

[44]  D. Norman Emotional design : why we love (or hate) everyday things , 2004 .

[45]  Tanel Alumäe,et al.  Large vocabulary continuous speech recognition for estonian using morpheme classes , 2004, INTERSPEECH.

[46]  Jadav Das,et al.  Incorporating verbal feedback into a robot-assisted rehabilitation system , 2011, Robotica.

[47]  Kiyotaka Izumi,et al.  A particle-swarm-optimized fuzzy-neural network for voice-controlled robot systems , 2005, IEEE Transactions on Industrial Electronics.

[48]  Kiyohiro Shikano,et al.  Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.

[49]  Roland Siegwart,et al.  On developing a voice-enabled interface for interactive tour-guide robots , 2003, Adv. Robotics.