RECENT ADVANCES IN AUTOMATIC SPEECH SUMMARIZATION

Speech summarization technology, which extracts important information and removes irrelevant information from speech, is expected to play an important role in building speech archives and improving the efficiency of spoken document retrieval. However, speech summarization has a number of significant challenges that distinguish it from general text summarization. Fundamental problems with speech summarization include speech recognition errors, disfluencies, and difficulties of sentence segmentation. Typical speech summarization systems consist of speech recognition, sentence segmentation, sentence extraction, and sentence compaction components. Most of the research has focuses on sentence extraction, using LSA (latent semantic analysis), MMR (maximal marginal relevance), or feature-based approaches, among which no decisive method has yet been found. Proper sentence segmentation is also essential to achieve good summarization performance. How to objectively evaluate speech summarization results is an important issue. Several measures, including families of SumACCY and ROUGE measures, have been proposed, and correlation analyses between subjective and objective evaluation scores have been performed. Although these measures are useful for ranking various summarization methods, they do not correlate well with human evaluations, especially when spontaneous speech is targeted.

[1]  Johanna D. Moore,et al.  Evaluating Automatic Summaries of Meeting Recordings , 2005, IEEvaluation@ACL.

[2]  Julia Hirschberg,et al.  Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization , 2005, INTERSPEECH.

[3]  Chin-Yew Lin,et al.  Looking for a Few Good Metrics: ROUGE and its Evaluation , 2004 .

[4]  Sadaoki Furui,et al.  Advances in automatic speech summarization , 2001, INTERSPEECH.

[5]  Gökhan Tür,et al.  Prosody-based automatic segmentation of speech into sentences and topics , 2000, Speech Commun..

[6]  Jan Alexandersson,et al.  Towards Multilingual Protocol Generation For Spontaneous Speech Dialogues , 1998, INLG.

[7]  Jean Carletta,et al.  Extractive summarization of meeting recordings , 2005, INTERSPEECH.

[8]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[9]  Ellen M. Voorhees,et al.  Spoken Document Retrieval: 1998 Evaluation and Investigation of New Metrics , 1999 .

[10]  Heidi Christensen,et al.  Multi-stage compaction approach to broadcast news summarisation , 2005, INTERSPEECH.

[11]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[12]  Klaus Zechner,et al.  Automatic Summarization of Open-Domain Multiparty Dialogues in Diverse Genres , 2002, CL.

[13]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[14]  Dragutin Petkovic,et al.  Spoken Document Retrieval , 2000 .

[15]  Robin Valenza SUMMARISATION OF SPOKEN AUDIO THROUGH INFORMATION EXTRACTION , 1999 .

[16]  Sadaoki Furui,et al.  Evaluation method for automatic speech summarization , 2003, INTERSPEECH.

[17]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[18]  Heidi Christensen,et al.  From Text Summarisation to Style-Specific Summarisation for Broadcast News , 2004, ECIR.

[19]  Chiori Hori,et al.  Evaluation Measures Considering Sentence Concatenation for Automatic Summarization by Sentence or Word Extraction , 2004, Workshop On Text Summarization Branches Out.

[20]  Takaaki Hori,et al.  Speech summarization using weighted finite-state transducers , 2003, INTERSPEECH.

[21]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .

[22]  Sadaoki Furui,et al.  TWO-STAGE AUTOMATIC SPEECH SUMMARIZATION BY SENTENCE EXTRACTION AND COMPACTION , 2003 .

[23]  Andreas Stolcke,et al.  The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[24]  Heidi Christensen,et al.  Are extractive text summarisation techniques portable to broadcast news? , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[25]  Heidi Christensen,et al.  Exploring the style-technique interaction in extractive summarization of broadcast news , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[26]  Sadaoki Furui,et al.  Speech-to-text and speech-to-speech summarization of spontaneous speech , 2004, IEEE Transactions on Speech and Audio Processing.

[27]  Alexander H. Waibel,et al.  Minimizing Word Error Rate in Textual Summaries of Spoken Language , 2000, ANLP.

[28]  Klaus Zechner Spoken language condensation in the 21st century , 2003, INTERSPEECH.

[29]  Sadaoki Furui,et al.  Spontaneous speech recognition using a massively parallel decoder , 2004, INTERSPEECH.

[30]  Konstantinos Koumpis,et al.  Transcription and summarization of voicemail speech , 2000, INTERSPEECH.

[31]  Sadaoki Furui,et al.  Automatic Sentence Segmentation of Speech for Automatic Summarization , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[32]  Karel Jezek,et al.  Text Summarization and Singular Value Decomposition , 2004, ADVIS.

[33]  Lin-Shan Lee,et al.  Improved Spoken Document Summarization Using Probabilistic Latent Semantic Analysis (PLSA) , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[34]  Sadaoki Furui,et al.  Sentence-extractive automatic speech summarization and evaluation techniques , 2006, Speech Commun..

[35]  Heidi Christensen,et al.  Maximum entropy segmentation of broadcast news , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..