论文信息 - Multilayer subword units for open-vocabulary spoken document retrieval

Multilayer subword units for open-vocabulary spoken document retrieval

This paper describes the application of subword units in an effort of improving open-vocabulary spoken document retrieval performance in the case of highly corrupted recognition output. This paper presents the developed open-vocabulary spoken document retrieval system including the newly proposed subphonetic segment unit and combining multilayer subword units. Our experiments on Japanese spoken documents show that using the proposed subphonetic segment unit can improve retrieval performance, high precision and recall, and a combination of multilayer subword units is also effective.

Shi-wook Lee | Kazuyo Tanaka | Yoshiaki Itoh

[1] Yoshiaki Itoh,et al. Speech data retrieval system constructed on a universal phonetic code domain , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[2] Ellen M. Voorhees,et al. Overview of the seventh text retrieval conference (trec-7) [on-line] , 1999 .

[3] Steve Renals,et al. Retrieval of broadcast news documents with the THISL system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4] Shuichi Itahashi,et al. JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research , 1999 .

[5] Shi-wook Lee,et al. Robust Spoken Document Retrieval Based on Multilingual Subphonetic Segment Recognition , 2004, ICEIS.

[6] Kazuyo Tanaka,et al. Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP , 2001, INTERSPEECH.

[7] Nobuaki Minematsu,et al. Sharable software repository for Japanese large vocabulary continuous speech recognition , 1998, ICSLP.

[8] Peter Schäuble,et al. New techniques for open-vocabulary spoken document retrieval , 1998, SIGIR '98.