Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms

This paper proposes an improved approach for spoken lecture summarization, in which random walk is performed on a graph constructed with automatically extracted key terms and probabilistic latent semantic analysis (PLSA). Each sentence of the document is represented as a node of the graph and the edge between two nodes is weighted by the topical similarity between the two sentences. The basic idea is that sentences topically similar to more important sentences should be more important. In this way all sentences in the document can be jointly considered more globally rather than individually. Experimental results showed significant improvement in terms of ROUGE evaluation.

[1]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[2]  Jade Goldstein-Stewart,et al.  Summarizing text documents: sentence selection and evaluation metrics , 1999, SIGIR '99.

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[5]  Sadaoki Furui,et al.  Speech-to-text and speech-to-speech summarization of spontaneous speech , 2004, IEEE Transactions on Speech and Audio Processing.

[6]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[7]  Amy Nicole Langville,et al.  A Survey of Eigenvector Methods for Web Information Retrieval , 2005, SIAM Rev..

[8]  Lin-shan Lee,et al.  Spoken document understanding and organization , 2005, IEEE Signal Processing Magazine.

[9]  Lin-Shan Lee,et al.  Improved Spoken Document Summarization Using Probabilistic Latent Semantic Analysis (PLSA) , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Lin-Shan Lee,et al.  IMPROVED SUMMARIZATION OF CHINESE SPOKEN DOCUMENTS BY PROBABILISTIC LATENT SEMANTIC ANALYSIS (PLSA) WITH FURTHER ANALYSIS AND INTEGRATED SCORING , 2006, 2006 IEEE Spoken Language Technology Workshop.

[11]  Shih-Fu Chang,et al.  Video search reranking through random walk over document-level context graph , 2007, ACM Multimedia.

[12]  James R. Glass,et al.  Recent progress in the MIT spoken lecture processing project , 2007, INTERSPEECH.

[13]  Alexander I. Rudnicky,et al.  An extractive-summarization baseline for the automatic detection of noteworthy utterances in multi-party human-human dialog , 2008, 2008 IEEE Spoken Language Technology Workshop.

[14]  Yu Huang,et al.  Automatic key term extraction from spoken course lectures using branching entropy and prosodic/semantic features , 2010, 2010 IEEE Spoken Language Technology Workshop.

[15]  Lin-Shan Lee,et al.  An integrated framework for transcribing Mandarin-English code-mixed lectures with improved acoustic and language modeling , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[16]  Lin-Shan Lee,et al.  Improved spoken term detection with graph-based re-ranking in feature space , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Lin-Shan Lee,et al.  Bilingual Acoustic Model Adaptation by Unit Merging on Different Levels and Cross-Level Integration , 2011, INTERSPEECH.

[18]  Lin-Shan Lee,et al.  Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).