论文信息 - Speech labeling and the most frequent phrase extraction using same section in a presentation speech

Speech labeling and the most frequent phrase extraction using same section in a presentation speech

This paper discusses the possibility of speech labeling by utilizing same sections, such as the same words or same phrases that are repeated in a speech. The same sections are checked and detected in a presentation speech. For this purpose, a new efficient algorithm has been proposed, called Shift Continuous DP, because it is an extension of Continuous DP (CDP). Shift CDP realizes fast matching between arbitrary sections in the reference pattern and the input speech and enables extracting similar sections frame-synchronously. This algorithm is extended and applied to extract the repeated sections in a presentation speech and to identify the most frequent phrase in the talk. Experiments were conducted for presentation speech and the results showed Shift CDP was successful in detecting similar sections and identifying the most frequent phrase in the presentation.

Kazuyo Tanaka | Yoshiaki Itoh

[1] Yoshiaki Itoh. A matching algorithm between arbitrary sections of two speech data sets for speech retrieval , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[2] Kazuyo Tanaka,et al. A speech recognition method with a language-independent intermediate phonetic code , 2000, INTERSPEECH.

[3] Yoshiaki Itoh,et al. Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DP , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4] Yoshiaki Itoh,et al. A proposal for a new algorithm of reference interval-free continuous DP for real-time speech or text retrieval , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5] Francine R. Chen,et al. The use of emphasis to automatically summarize a spoken discourse , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] Richard P. Lippmann,et al. Techniques for information retrieval from voice messages , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[7] Kazuyo Tanaka,et al. Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP , 2001, INTERSPEECH.

[8] Kunio Kashino,et al. Quick audio retrieval using active search , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).