Speech-to-text and speech-to-speech summarization of spontaneous speech

This paper presents techniques for speech-to-text and speech-to-speech automatic summarization based on speech unit extraction and concatenation. For the former case, a two-stage summarization method consisting of important sentence extraction and word-based sentence compaction is investigated. Sentence and word units which maximize the weighted sum of linguistic likelihood, amount of information, confidence measure, and grammatical likelihood of concatenated units are extracted from the speech recognition results and concatenated for producing summaries. For the latter case, sentences, words, and between-filler units are investigated as units to be extracted from original speech. These methods are applied to the summarization of unrestricted-domain spontaneous presentations and evaluated by objective and subjective measures. It was confirmed that proposed methods are effective in spontaneous speech summarization.

[1]  Sadaoki Furui,et al.  Ubiquitous speech processing , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[2]  Robin Valenza SUMMARISATION OF SPOKEN AUDIO THROUGH INFORMATION EXTRACTION , 1999 .

[3]  Sadaoki Furui,et al.  Advances in automatic speech summarization , 2001, INTERSPEECH.

[4]  S. Furui Recent Advances in Spontaneous Speech Recognition and Understanding , 2003 .

[5]  Daniel Marcu,et al.  A Noisy-Channel Model for Document Compression , 2002, ACL.

[6]  M. Sanderson Book Reviews: Advances in Automatic Text Summarization , 2000, Computational Linguistics.

[7]  Sadaoki Furui,et al.  TWO-STAGE AUTOMATIC SPEECH SUMMARIZATION BY SENTENCE EXTRACTION AND COMPACTION , 2003 .

[8]  Chin-Yew Lin,et al.  From Single to Multi-document Summarization : A Prototype System and its Evaluation , 2002 .

[9]  Eduard H. Hovy,et al.  From Single to Multi-document Summarization , 2002, ACL.

[10]  Konstantinos Koumpis,et al.  Transcription and summarization of voicemail speech , 2000, INTERSPEECH.

[11]  Ellen M. Voorhees,et al.  Spoken Document Retrieval: 1998 Evaluation and Investigation of New Metrics , 1999 .

[12]  Hitoshi Isahara,et al.  Spontaneous Speech Corpus of Japanese , 2000, LREC.

[13]  Jan Alexandersson,et al.  Towards Multilingual Protocol Generation For Spontaneous Speech Dialogues , 1998, INLG.

[14]  Sadaoki Furui,et al.  A Statistical Approach to Automatic Speech Summarization , 2003, EURASIP J. Adv. Signal Process..

[15]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[16]  Dragutin Petkovic,et al.  Spoken Document Retrieval , 2000 .

[17]  Alexander H. Waibel,et al.  Minimizing Word Error Rate in Textual Summaries of Spoken Language , 2000, ANLP.

[18]  Klaus Zechner Spoken language condensation in the 21st century , 2003, INTERSPEECH.