论文信息 - Combining multiple subword representations for open-vocabulary spoken document retrieval

Combining multiple subword representations for open-vocabulary spoken document retrieval

The paper describes subword-based approaches for open-vocabulary spoken document retrieval. First, the feasibility of subword units in spoken document retrieval is investigated, and our previously proposed sub-phonetic segment units are compared to typical subword units, such as syllables, phonemes, and triphones. Next, we explore the linear combination of retrieval score from multiple subword representations to improve retrieval performance. Experimental evaluation of open-vocabulary spoken document retrieval tasks demonstrates that our proposed sub-phonetic segment units are more effective than typical subword units, and the linear combination of multiple subword representations resulted in a consistent improvement in the F-measure.

[1] Kenney Ng,et al. Subword-based approaches for spoken document retrieval , 2000, Speech Commun..

[2] Shuichi Itahashi,et al. JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research , 1999 .

[3] Yoshiaki Itoh,et al. Speech data retrieval system constructed on a universal phonetic code domain , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[4] Martin Wechsler,et al. Spoken document retrieval based on phoneme recognition , 1998 .

[5] Kazuyo Tanaka,et al. Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP , 2001, INTERSPEECH.

[6] Ellen M. Voorhees,et al. Overview of the Seventh Text REtrieval Conference , 1998 .

[7] Nobuaki Minematsu,et al. Sharable software repository for Japanese large vocabulary continuous speech recognition , 1998, ICSLP.