Supervised Spoken Document Summarization jointly Considering Utterance Importance and Redundancy by Structured Support Vector Machine

In extractive spoken document summarization, it is desired to select important utterances from documents to construct the summary while avoiding redundancy among the selected utterances, but it is not easy to balance the two different goals. In this paper, a supervised spoken document summarization approach is proposed based on structured support vector machine (SVM), in which the above two goals are jointly considered during training. A set of parameters not only describing the ways to evaluate the importance of the utterances but minimizing the redundancy is directly learned from the training set. Encourag-ing results were obtained on a lecture corpus in the preliminary experiments.

[1]  Thorsten Joachims,et al.  Large-Margin Learning of Submodular Summarization Methods , 2011, ArXiv.

[2]  Lin-Shan Lee,et al.  Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Yang Liu,et al.  Improving supervised learning for meeting summarization using sampling and regression , 2010, Comput. Speech Lang..

[4]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[5]  Yong Yu,et al.  Enhancing diversity, coverage and balance for summarization through structure learning , 2009, WWW '09.

[6]  Jian Zhang,et al.  A comparative study on speech summarization of broadcast news and lecture speech , 2007, INTERSPEECH.

[7]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[8]  Dan Klein,et al.  Jointly Learning to Extract and Compress , 2011, ACL.

[9]  Dilek Z. Hakkani-Tür,et al.  Clusterrank: a graph based method for meeting summarization , 2009, INTERSPEECH.

[10]  Berlin Chen,et al.  Leveraging evaluation metric-related training criteria for speech summarization , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Dilek Z. Hakkani-Tür,et al.  A global optimization framework for meeting summarization , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  Ryan T. McDonald A Study of Global Inference Algorithms in Multi-document Summarization , 2007, ECIR.

[13]  Dilek Z. Hakkani-Tür,et al.  Integrating prosodic features in extractive meeting summarization , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[14]  Julia Hirschberg,et al.  Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization , 2005, INTERSPEECH.

[15]  Yang Liu,et al.  Using corpus and knowledge-based similarity measure in Maximum Marginal Relevance for meeting summarization , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.