Using the World Wide Web for Learning New Words in Continuous Speech Recognition Tasks: Two Case Studies

In this paper, Web-based lexicon augmentation is addressed: using various strategies for Out-Of-Vocabulary (OOV) word learning, we discuss their relevance in two types of applications: broadcast news, or topic-specific corpora transcription. The Web-based OOV word learning is first tested on the French news corpus ESTER; the same approach is applied to a very specific corpus concerning surgical interventions. These tests allow us to assess the value of the OOV word learning methods proposed, emphasizing their strengths and weaknesses, regarding the particular type of application considered.

[1]  Guillaume Gravier,et al.  Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News , 2004, LREC.

[2]  Roeland Ordelman,et al.  Transcription of conference room meetings: an investigation , 2005, INTERSPEECH.

[3]  Jean-Luc Gauvain,et al.  Developments in continuous speech dictation using the ARPA WSJ task , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[4]  G. Gravier,et al.  STER evaluation campaign of rich transcription of French broadcast news , 2011 .

[5]  Marcello Federico,et al.  Lexicon adaptation for broadcast news transcription , 2001 .

[6]  F. Béchet LIA―PHON: Un système complet de phonétisation de textes , 2001 .

[7]  Alex Waibel,et al.  New developments in automatic meeting transcription , 2000, INTERSPEECH.

[8]  Alexandre Allauzen,et al.  Open vocabulary ASR for audiovisual document indexation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[9]  Paul Taylor,et al.  The architecture of the Festival speech synthesis system , 1998, SSW.

[10]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[11]  Hui Lin,et al.  OOV detection by joint word/phone lattice alignment , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[12]  Georges Linarès,et al.  On-demand new word learning using world wide web , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Hynek Hermansky,et al.  Combination of strongly and weakly constrained recognizers for reliable detection of OOVS , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Dilek Z. Hakkani-Tür,et al.  Spoken language understanding , 2008, IEEE Signal Processing Magazine.