Unsupervised domain adaptation for spoken document summarization with structured support vector machine

Supervised approaches can learn a spoken document summarizer generating high-quality summaries using a set of training examples matched to the domain of target documents. However, preparing a sufficient number of in-domain training examples is expensive. In this paper we propose an approach for unsupervised domain adaptation for spoken document summarization, so no in-domain training examples are needed. A summarizer is first learned from a set of out-of-domain training examples by a supervised summarization approach based on structured support vector machine, and this summarizer is used to generate a set of initial summaries for the target spoken documents. The target documents and their initial machine-generated summaries then serve as extra training examples for learning a new summarizer, which further updates the summaries of the target spoken documents. This process is continued iteratively to incrementally improve the summarizer for the target spoken documents. Moreover, extra approaches transforming the feature representations based on the data distribution in the target domain and augmenting the representations with an extra set of domain-specific features are also proposed. Encouraging results were obtained in summarizing Mandarin-English code-switching course lectures using training examples from Mandarin broadcast news.

[1]  Pascale Fung,et al.  Extractive speech summarization by active learning , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[2]  Sadaoki Furui,et al.  Speech-to-text and speech-to-speech summarization of spontaneous speech , 2004, IEEE Transactions on Speech and Audio Processing.

[3]  Dilek Z. Hakkani-Tür,et al.  Clusterrank: a graph based method for meeting summarization , 2009, INTERSPEECH.

[4]  Florian Metze,et al.  Integrating Intra-Speaker Topic Modeling and Temporal-Based Inter-Speaker Topic Modeling in Random Walk for Improved Multi-Party Meeting Summarization , 2012, INTERSPEECH.

[5]  Julia Hirschberg,et al.  Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization , 2005, INTERSPEECH.

[6]  Hitoshi Nishikawa,et al.  Domain adaptation with augmented space method for multi-domain contact center dialogue summarization , 2012, MLSLP.

[7]  Lin-Shan Lee,et al.  Supervised Spoken Document Summarization jointly Considering Utterance Importance and Redundancy by Structured Support Vector Machine , 2012, INTERSPEECH.

[8]  Yang Liu,et al.  Improving supervised learning for meeting summarization using sampling and regression , 2010, Comput. Speech Lang..

[9]  Geoffrey Zweig,et al.  LATTICE-BASED UNSUPERVISED MLLR FOR SPEAKER ADAPTATION , 2000 .

[10]  Pascale Fung,et al.  Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Mark J. F. Gales,et al.  Constrained discriminative mapping transforms for unsupervised speaker adaptation , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Yu Huang,et al.  Unsupervised two-stage keyword extraction from spoken documents by topic coherence and support vector machine , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Pascale Fung,et al.  Speech Summarization Without Lexical Features for Mandarin Broadcast News , 2007, NAACL.

[14]  Yu Huang,et al.  Spoken Lecture Summarization by Random Walk over a Graph Constructed with Automatically Extracted Key Terms , 2011, INTERSPEECH.

[15]  Hsin-Min Wang,et al.  A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization , 2009, TALIP.

[16]  Wessel Kraaij,et al.  Automatic Summarization of Meeting Data: A Feasibility Study , 2005, CLIN.

[17]  Ryan T. McDonald A Study of Global Inference Algorithms in Multi-document Summarization , 2007, ECIR.

[18]  Berlin Chen,et al.  Leveraging evaluation metric-related training criteria for speech summarization , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Berlin Chen,et al.  Hybrids of supervised and unsupervised models for extractive speech summarization , 2009, INTERSPEECH.

[20]  Gokhan Tur,et al.  Spoken Language Understanding: Systems for Extracting Semantic Information from Speech , 2011 .

[21]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[22]  Giuseppe Carenini,et al.  Domain Adaptation to Summarize Human Conversations , 2010 .

[23]  Jian Zhang,et al.  A comparative study on speech summarization of broadcast news and lecture speech , 2007, INTERSPEECH.

[24]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[25]  Dilek Z. Hakkani-Tür,et al.  Integrating prosodic features in extractive meeting summarization , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[26]  Lin-Shan Lee,et al.  Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[27]  Dilek Z. Hakkani-Tür,et al.  A global optimization framework for meeting summarization , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[28]  Frank Seide,et al.  Unsupervised speaker adaptation for telephone call transcription , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[29]  Michel Galley,et al.  A Skip-Chain Conditional Random Field for Ranking Meeting Utterances by Importance , 2006, EMNLP.

[30]  Anna Margolis,et al.  A Literature Review of Domain Adaptation with Unlabeled Data , 2011 .

[31]  Julia Hirschberg,et al.  Summarizing Speech Without Text Using Hidden Markov Models , 2006, NAACL.

[32]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[33]  Yang Liu,et al.  Using corpus and knowledge-based similarity measure in Maximum Marginal Relevance for meeting summarization , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[34]  Hui Lin,et al.  Graph-based submodular selection for extractive summarization , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[35]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[36]  Hui Lin,et al.  Semi-supervised extractive speech summarization via co-training algorithm , 2010, INTERSPEECH.