Opinion summarization on spontaneous conversations

HighlightsInvestigated opinion summarization in conversations.Annotated Switchboard conversations with sentiment, summary, and pronoun info.Compared graph-based and supervised summarization methods for opinion summarization.Evaluated summarization performance using both human transcripts and ASR output.Exploited pronoun coreference information for summarization of conversations. In this study we explore opinion summarization on spontaneous conversations using unsupervised and supervised approaches. We annotate a phone conversation corpus with reference extractive and abstractive summaries for a speaker's opinion on a given topic. We investigate two methods: the first is an unsupervised graph-based method, which incorporates topic and sentiment information, as well as sentence-to-sentence relations extracted based on dialogue structure; the second is a supervised method that casts the summarization problem as a classification problem. Furthermore, we investigate the use of pronoun resolution in this summarization task. We develop various features based on pronoun coreference and incorporate them in the supervised opinion summarization system. Our experimental results show that both the graph-based method and the supervised method outperform the baseline approach, and the pronoun related features can help to generate better summaries.

[1]  Claire Cardie,et al.  Multi-Perspective Question Answering Using the OpQA Corpus , 2005, HLT.

[2]  Ralf Krestel,et al.  {Using Knowledge-poor Coreference Resolution for Text Summarization} , 2003 .

[3]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[4]  Kam-Fai Wong,et al.  Extractive Summarization Using Supervised and Semi-Supervised Learning , 2008, COLING.

[5]  M. Maloof Learning When Data Sets are Imbalanced and When Costs are Unequal and Unknown , 2003 .

[6]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[7]  Pascale Fung,et al.  Improving lecture speech summarization using rhetorical information , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[8]  Michael J. Paul,et al.  Summarizing Contrastive Viewpoints in Opinionated Text , 2010, EMNLP.

[9]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[10]  Yang Liu,et al.  Improving supervised learning for meeting summarization using sampling and regression , 2010, Comput. Speech Lang..

[11]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[12]  Dilek Z. Hakkani-Tür,et al.  Clusterrank: a graph based method for meeting summarization , 2009, INTERSPEECH.

[13]  Josef Steinberger,et al.  Improving LSA-based Summarization with Anaphora Resolution , 2005, HLT.

[14]  Sadaoki Furui,et al.  Speech-to-text and speech-to-speech summarization of spontaneous speech , 2004, IEEE Transactions on Speech and Audio Processing.

[15]  Takaaki Hasegawa,et al.  Opinion Summarization with Integer Linear Programming Formulation for Sentence Extraction and Ordering , 2010, COLING.

[16]  Houfeng Wang,et al.  Entity-centric topic-oriented opinion summarization in twitter , 2012, KDD.

[17]  Yang Liu,et al.  What Are Meeting Summaries? An Analysis of Human Extractive Summaries in Meeting Corpus , 2008, SIGDIAL Workshop.

[18]  Jean Carletta,et al.  Extractive summarization of meeting recordings , 2005, INTERSPEECH.

[19]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[20]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[21]  Ellen Riloff,et al.  Creating Subjective and Objective Sentence Classifiers from Unannotated Texts , 2005, CICLing.

[22]  Hui Lin,et al.  Multi-document Summarization via Budgeted Maximization of Submodular Functions , 2010, NAACL.

[23]  Vincent Ng,et al.  Unsupervised Models for Coreference Resolution , 2008, EMNLP.

[24]  Lynette Hirschman,et al.  A Model-Theoretic Coreference Scoring Scheme , 1995, MUC.

[25]  Dong Wang,et al.  A Pilot Study of Opinion Summarization in Conversations , 2011, ACL.

[26]  Xuanjing Huang,et al.  Using query expansion in graph-based approach for query-focused multi-document summarization , 2009, Inf. Process. Manag..

[27]  Klaus Krippendorff,et al.  Answering the Call for a Standard Reliability Measure for Coding Data , 2007 .

[28]  Theresa Wilson Fine-grained subjectivity and sentiment analysis: recognizing the intensity, polarity, and attitudes of private states , 2008 .

[29]  Claire Cardie,et al.  Partially Supervised Coreference Resolution for Opinion Summarization through Structured Rule Learning , 2006, EMNLP.

[30]  Julia Hirschberg,et al.  Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization , 2005, INTERSPEECH.

[31]  Heeyoung Lee,et al.  Stanford’s Multi-Pass Sieve Coreference Resolution System at the CoNLL-2011 Shared Task , 2011, CoNLL Shared Task.

[32]  Robert J. Gaizauskas,et al.  Using Coreference Chains for Text Summarization , 1999, COREF@ACL.

[33]  Hwee Tou Ng,et al.  Combining Coherence Models and Machine Translation Evaluation Metrics for Summarization Evaluation , 2012, ACL.

[34]  Giuseppe Carenini,et al.  Summarizing Spoken and Written Conversations , 2008, EMNLP.

[35]  Florian Metze,et al.  Intra-Speaker Topic Modeling for Improved Multi-Party Meeting Summarization with Integrated Random Walk , 2012, HLT-NAACL.

[36]  Hsin-Min Wang,et al.  A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization , 2009, TALIP.

[37]  Jiawei Han,et al.  Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions , 2010, COLING.

[38]  Stephani Foraker,et al.  The role of prominence in pronoun resolution: Active versus passive representations , 2007 .

[39]  Ryan T. McDonald,et al.  Contrastive Summarization: An Experiment with Consumer Reviews , 2009, NAACL.

[40]  Vincent Ng,et al.  Examining the Role of Linguistic Knowledge Sources in the Automatic Identification and Classification of Reviews , 2006, ACL.

[41]  Konstantinos Koumpis,et al.  Automatic summarization of voicemail messages using lexical and prosodic features , 2005, TSLP.

[42]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[43]  Dilek Z. Hakkani-Tür,et al.  Integrating prosodic features in extractive meeting summarization , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[44]  Patricio Martínez-Barco,et al.  Going Beyond Traditional QA Systems: Challenges and Keys in Opinion Question Answering , 2010, COLING.

[45]  Giuseppe Carenini,et al.  Detecting subjectivity in multiparty speech , 2009, INTERSPEECH.

[46]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[47]  Claire Cardie,et al.  Coreference Resolution with Reconcile , 2010, ACL.

[48]  Hua Li,et al.  Document Summarization Using Conditional Random Fields , 2007, IJCAI.

[49]  Dilek Z. Hakkani-Tür,et al.  A global optimization framework for meeting summarization , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[50]  Theresa Wilson,et al.  Multimodal Subjectivity Analysis of Multiparty Conversation , 2008, EMNLP.

[51]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[52]  Gerald Penn,et al.  Summarization of spontaneous conversations , 2006, INTERSPEECH.

[53]  Klaus Zechner,et al.  Automatic Summarization of Open-Domain Multiparty Dialogues in Diverse Genres , 2002, CL.

[54]  Janyce Wiebe,et al.  Annotating Opinions in the World Press , 2003, SIGDIAL Workshop.

[55]  Andrew McCallum,et al.  Using Maximum Entropy for Text Classification , 1999 .