Candidate generation for interactive Chinese speech recognition

Despite progress in automatic speech recognition, user involvement and assistance are still needed to achieve performance similar to human in real-world tasks. Though many interactive speech recognition systems using candidate selection are developed for English and Japanese, similar systems for Chinese face particular difficulties due to characteristics of the Chinese language, especially at the candidate generation stage. In this paper, a method is proposed for candidate generation in interactive Chinese speech recognition, which lists all competitive candidates in forms of Chinese characters in stead of words. The main idea is to align time-overlapped links in Chinese word lattice into clusters based on their phonetic similarity, and then obtain candidates by splitting words attached to links into Chinese characters. Experimental results demonstrated that the candidates generated by the proposed method can lead to high performance in interactive Chinese speech recognition.

[1]  Brian D. O. Anderson,et al.  Localization in sparse networks using sweeps , 2006, MobiCom '06.

[2]  B. R. Badrinath,et al.  Ad hoc positioning system (APS) , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[3]  Brian D. O. Anderson,et al.  A Theory of Network Localization , 2006, IEEE Transactions on Mobile Computing.

[4]  Saurabh Bagchi,et al.  Location Estimation in Ad Hoc Networks with Directional Antennas , 2005, 25th IEEE International Conference on Distributed Computing Systems (ICDCS'05).

[5]  Andreas Stolcke,et al.  Finding consensus in speech recognition: word error minimization and other applications of confusion networks , 2000, Comput. Speech Lang..

[6]  B. Suhm,et al.  Empirical evaluation of interactive multimodal error correction , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[7]  Yuan Li,et al.  Research challenges and applications for underwater sensor networking , 2006, IEEE Wireless Communications and Networking Conference, 2006. WCNC 2006..

[8]  Gang Zhou,et al.  Impact of radio irregularity on wireless sensor networks , 2004, MobiSys '04.

[9]  Mani B. Srivastava,et al.  The bits and flops of the n-hop multilateration primitive for node localization problems , 2002, WSNA '02.

[10]  A. Savvides,et al.  Network localization in partially localizable networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[11]  Lidia Mangu,et al.  Finding consensus in speech recognition , 2000 .

[12]  Jerry D. Gibson,et al.  Mobile Communications Handbook , 1999 .

[13]  Dario Pompili,et al.  Underwater acoustic sensor networks: research challenges , 2005, Ad Hoc Networks.

[14]  Jian Xue,et al.  Improved confusion network algorithm and shortest path search from word lattice , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[15]  Joseph Rice,et al.  Seaweb Acoustic Communication and Navigation Networks , 2005 .

[16]  Masataka Goto,et al.  Speech repair: quick error correction just by using selection operation for speech input interfaces , 2005, INTERSPEECH.

[17]  Mani B. Srivastava,et al.  Dynamic fine-grained localization in Ad-Hoc networks of sensors , 2001, MobiCom '01.

[18]  Daniel B. Horn,et al.  Patterns of entry and correction in large vocabulary continuous speech recognition systems , 1999, CHI '99.