Identifying the gist of conversational text: automatic keyword extraction and summarization

With the rapid development of communication technologies and mass storage techniques, many conversational texts have quickly emerged as significant information sources, such as emails, forums, meeting conversation transcripts, chat logs, microblogs, etc. The ability to identify the gist of these conversational texts enables us to quickly browse through the huge amount of data and obtain the essential information. On the other hand, the conversational text style also poses great challenges to the traditional language processing techniques, including redundancies, disfluencies, ill-formed sentence structure, high word error rates, and so forth. In this work, we focus on keyword extraction and summarization on meeting transcripts, and also explore summarizing the Twitter posts (tweets) as another domain of conversational text. We propose to extract keywords using a novel supervised framework that incorporates various knowledge sources: beyond the traditional widely used features (e.g., TF-IDF, position information), we introduce additional rich features including term specificity information, decision-making sentence related features, speaker and prominence based features, and features extracted from system generated summaries. We propose a feedback strategy to reinforce the impact of summary sentences on selecting effective keywords. We conduct analysis to evaluate feature effectiveness using different feature selection processes, and define various measurements to characterize the quality of summaries that can benefit the keyword extraction task. We also evaluate system performance using both human transcripts and different automatic speech recognizer (ASR) output (1-best and n-best), and show promising improved keyword extraction results using n-best ASR output over 1-best hypothesis. For extractive meeting summarization, we explore multiple meeting-specific characteristics. We propose to use topic labels and speaker-dependent characteristics (such as verboseness, gender, native language, role in the meeting) to improve extractive meeting summarization performance. These properties were incorporated in both unsupervised Maximum Marginal Relevance (MMR) approach and the supervised framework. We observe consistent improvements using our proposed approaches, on both human transcripts and ASR output, and using different evaluation metrics including ROUGE, Pyramid, and a DA-level F-measure score. Beyond extractive summarization, we propose to perform sentence compression on the extractive summary to improve its readability and make it more like an abstractive summary. Various automatic compression algorithms are investigated, including the integer linear programming (ILP) based approach with filler phrase detection, a noisy-channel approach using Markovization formulation of grammar rules, as well as the conditional random fields (CRF) based approach. The automatically compressed utterances are compared against both human compression and the abstractive summaries. We also evaluate the impact of using compressed utterances on summarization, and propose a fully automatic summarizer that generates compressed meeting summaries by combing the utterance compression module with an extractive summarization system. We perform exploratory summarization studies on another domain of conversational text – the Twitter posts, to help users quickly browse through any available topics. As an important first step, we propose a novel letter transformation approach to convert the nonstandard tokens in the tweets into standard English words. Different from the prior work, our approach requires neither pre-categorization nor human supervision. The approach models the generation process from the dictionary words to nonstandard tokens under a sequence labeling framework. We also explore summarizing the Twitter topics using the concept-based global optimization approach, and investigate the effect of both noisy nonstandard tokens andlinked web contents on the summarization performance.

[1]  Berlin Chen,et al.  A Risk Minimization Framework for Extractive Speech Summarization , 2010, ACL.

[2]  Giuseppe Carenini,et al.  The impact of ASR on abstractive vs. extractive meeting summaries , 2010, INTERSPEECH.

[3]  Tadashi Nomoto,et al.  Discriminative sentence compression with conditional random fields , 2007, Inf. Process. Manag..

[4]  Giuseppe Carenini,et al.  Summarizing Spoken and Written Conversations , 2008, EMNLP.

[5]  Eugene Charniak,et al.  Supervised and Unsupervised Learning for Sentence Compression , 2005, ACL.

[6]  Peter D. Turney Coherent Keyphrase Extraction via Web Mining , 2003, IJCAI.

[7]  Yang Liu,et al.  Automatic accent detection: effect of base units and boundary information , 2009, INTERSPEECH.

[8]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[9]  Johanna D. Moore,et al.  Evaluating Automatic Summaries of Meeting Recordings , 2005, IEEvaluation@ACL.

[10]  Michael Halliday,et al.  Some grammatical problems in scientific English , 1989 .

[11]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[12]  Pascale Fung,et al.  Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[14]  Dong Yang,et al.  Automatic Chinese Abbreviation Generation Using Conditional Random Field , 2009, NAACL.

[15]  Julia Hirschberg,et al.  Comparing lexical, acoustic/prosodic, structural and discourse features for speech summarization , 2005, INTERSPEECH.

[16]  B. Eisenstein,et al.  Feature selection via dynamic programming for text-independent speaker identification , 1978 .

[17]  Mirella Lapata,et al.  Sentence Compression as Tree Transduction , 2009, J. Artif. Intell. Res..

[18]  Xuanjing Huang,et al.  Using query expansion in graph-based approach for query-focused multi-document summarization , 2009, Inf. Process. Manag..

[19]  Sadaoki Furui,et al.  A new approach to automatic speech summarization , 2003, IEEE Trans. Multim..

[20]  Ani Nenkova,et al.  Evaluating Content Selection in Summarization: The Pyramid Method , 2004, NAACL.

[21]  Inderjeet Mani,et al.  SUMMAC: a text summarization evaluation , 2002, Natural Language Engineering.

[22]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[23]  Carl Gutwin,et al.  Domain-Specific Keyphrase Extraction , 1999, IJCAI.

[24]  Hongyuan Zha,et al.  Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering , 2002, SIGIR '02.

[25]  Yang Liu,et al.  Normalization of text messages for text-to-speech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Xiaojun Wan,et al.  Manifold-Ranking Based Topic-Focused Multi-Document Summarization , 2007, IJCAI.

[27]  Hal Daumé Notes on CG and LM-BFGS Optimization of Logistic Regression , 2008 .

[28]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[29]  Fei Liu,et al.  Automatic keyword extraction for the meeting corpus using supervised approach and bigram expansion , 2008, 2008 IEEE Spoken Language Technology Workshop.

[30]  Peter Poller,et al.  Extrinsic summarization evaluation: A decision audit task , 2008, TSLP.

[31]  Mitsuru Ishizuka,et al.  Keyword extraction from a single document using word co-occurrence statistical information , 2004, Int. J. Artif. Intell. Tools.

[32]  Furu Wei,et al.  Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization , 2008, SIGIR '08.

[33]  Eric Fosler-Lussier,et al.  Discourse Segmentation of Multi-Party Conversation , 2003, ACL.

[34]  Dragomir R. Radev,et al.  Using Random Walks for Question-focused Sentence Retrieval , 2005, HLT.

[35]  Kirill Kireyev,et al.  Semantic-based Estimation of Term Informativeness , 2009, NAACL.

[36]  Jugal K. Kalita,et al.  Summarizing Microblogs Automatically , 2010, NAACL.

[37]  Thomas Hain,et al.  Recognition and understanding of meetings the AMI and AMIDA projects , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[38]  Tadashi Nomoto A Generic Sentence Trimmer with CRFs , 2008, ACL.

[39]  Fei Liu,et al.  Using n-best recognition output for extractive summarization and keyword extraction in meeting speech , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[40]  Michel Galley,et al.  A Skip-Chain Conditional Random Field for Ranking Meeting Utterances by Importance , 2006, EMNLP.

[41]  Klaus Zechner,et al.  Automatic Summarization of Open-Domain Multiparty Dialogues in Diverse Genres , 2002, CL.

[42]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[43]  Yang Liu,et al.  What Are Meeting Summaries? An Analysis of Human Extractive Summaries in Meeting Corpus , 2008, SIGDIAL Workshop.

[44]  Rada Mihalcea,et al.  Linguistically Motivated Features for Enhanced Back-of-the-Book Indexing , 2008, ACL.

[45]  Steve Renals,et al.  Dialogue act compression via pitch contour preservation , 2006, INTERSPEECH.

[46]  Yang Liu,et al.  Using Confusion Networks for Speech Summarization , 2010, NAACL.

[47]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[48]  Wessel Kraaij,et al.  Automatic Summarization of Meeting Data: A Feasibility Study , 2005, CLIN.

[49]  Jian Su,et al.  A Phrase-Based Statistical Model for SMS Text Normalization , 2006, ACL.

[50]  Tadashi Nomoto A Comparison of Model Free versus Model Intensive Approaches to Sentence Compression , 2009, EMNLP.

[51]  Fei Liu,et al.  A Supervised Framework for Keyword Extraction From Meeting Transcripts , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[52]  Jimmy J. Lin,et al.  Sentence Compression as a Component of a Multi-Document Summarization System , 2006 .

[53]  Feifan Liu,et al.  Unsupervised Approaches for Automatic Keyword Extraction Using Meeting Transcripts , 2009, NAACL.

[54]  Lonneke van der Plas,et al.  Automatic Keyword Extraction from Spoken Text. A Comparison of Two Lexical Resources: EDR and WordNet , 2004, LREC.

[55]  Dragomir R. Radev,et al.  LexRank: Graph-based Centrality as Salience in Text Summarization , 2004 .

[56]  Jugal K. Kalita,et al.  Experiments in Microblog Summarization , 2010, 2010 IEEE Second International Conference on Social Computing.

[57]  François Yvon,et al.  Normalizing SMS: are Two Metaphors Better than One ? , 2008, COLING.

[58]  Stanley Peters,et al.  Modelling and Detecting Decisions in Multi-party Dialogue , 2008, SIGDIAL Workshop.

[59]  Cédrick Fairon,et al.  A Hybrid Rule/Model-Based Finite-State Framework for Normalizing SMS Messages , 2010, ACL.

[60]  Suzanne Stevenson,et al.  C re at iv ity An Unsupervised Model for Text Message Normalization , 2009 .

[61]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[62]  Kristina Toutanova,et al.  Pronunciation Modeling for Improved Spelling Correction , 2002, ACL.

[63]  Berlin Chen,et al.  Improved speech summarization with multiple-hypothesis representations and kullback-leibler divergence measures , 2009, INTERSPEECH.

[64]  Daniel Marcu,et al.  Summarization beyond sentence extraction: A probabilistic approach to sentence compression , 2002, Artif. Intell..

[65]  Giuseppe Carenini,et al.  Interpretation and Transformation for Abstracting Conversations , 2010, HLT-NAACL.

[66]  Diana Inkpen,et al.  Extracting semantically-coherent keyphrases from speech , 2004 .

[67]  Johanna D. Moore,et al.  What Decisions Have You Made: Automatic Decision Detection in Conversational Speech , 2007 .

[68]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[69]  Xiaojun Wan,et al.  Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction , 2007, ACL.

[70]  Jian Zhang,et al.  A comparative study on speech summarization of broadcast news and lecture speech , 2007, INTERSPEECH.

[71]  Elizabeth Shriberg,et al.  The ICSI Meeting Recorder Dialog Act (MRDA) Corpus , 2004, SIGDIAL Workshop.

[72]  Miles Osborne,et al.  The Edinburgh Twitter Corpus , 2010, HLT-NAACL 2010.

[73]  Yaakov HaCohen-Kerner,et al.  Automatic Extraction and Learning of Keyphrases from Scientific Articles , 2005, CICLing.

[74]  Fei Liu,et al.  From Extractive to Abstractive Meeting Summaries: Can It Be Done by Sentence Compression? , 2009, ACL.

[75]  Yaakov HaCohen-Kerner,et al.  Automatic Extraction of Keywords from Abstracts , 2003, KES.

[76]  William E. Moen,et al.  Automatic keyword extraction for learning object repositories , 2008, ASIST.

[77]  Shankar Kumar,et al.  Normalization of non-standard words , 2001, Comput. Speech Lang..

[78]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[79]  Yaakov HaCohen-Kerner,et al.  AUTOMATIC MACHINE LEARNING OF KEYPHRASE EXTRACTION FROM SHORT HTML DOCUMENTS WRITTEN IN HEBREW , 2007, Cybern. Syst..

[80]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[81]  Anette Hulth Reducing false positives by expert combination in automatic keyword indexing , 2003, RANLP.

[82]  Gerald Penn,et al.  Comparing the roles of textual, acoustic and spoken-language features on spontaneous-conversation summarization , 2006, NAACL.

[83]  Joel D. Martin,et al.  Extracting Keyphrases from Spoken Audio Documents , 2001, SIGIR Workshop: Information Retrieval Techniques for Speech Applications.

[84]  Julia Hirschberg,et al.  Summarizing Speech Without Text Using Hidden Markov Models , 2006, NAACL.

[85]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[86]  B. Magnini,et al.  Keyphrase Extraction for Summarization Purposes : The LAKE System at DUC-2004 , 2004 .

[87]  Gordon W. Paynter,et al.  Automatic extraction of document keyphrases for use in digital libraries: Evaluation and applications , 2002, J. Assoc. Inf. Sci. Technol..

[88]  Gerald Penn,et al.  A Critical Reassessment of Evaluation Baselines for Speech Summarization , 2008, ACL.

[89]  J. Clarke,et al.  Global inference for sentence compression : an integer linear programming approach , 2008, J. Artif. Intell. Res..

[90]  Anette Hulth,et al.  Improved Automatic Keyword Extraction Given More Linguistic Knowledge , 2003, EMNLP.

[91]  Dilek Z. Hakkani-Tür,et al.  Integrating prosodic features in extractive meeting summarization , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[92]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .

[93]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[94]  Hui Lin,et al.  Evaluating the effectiveness of features and sampling in extractive meeting summarization , 2008, 2008 IEEE Spoken Language Technology Workshop.

[95]  Brendan T. O'Connor,et al.  TweetMotif: Exploratory Search and Topic Summarization for Twitter , 2010, ICWSM.

[96]  Andreas Stolcke,et al.  Recent innovations in speech-to-text transcription at SRI-ICSI-UW , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[97]  D. Inouye Multiple Post Microblog Summarization , 2010 .

[98]  Dilek Z. Hakkani-Tür,et al.  A global optimization framework for meeting summarization , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[99]  Animesh Mukherjee,et al.  Investigation and modeling of the structure of texting language , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[100]  Kathleen McKeown,et al.  Lexicalized Markov Grammars for Sentence Compression , 2007, NAACL.

[101]  Frank Rudzicz,et al.  Summarizing multiple spoken documents: finding evidence from untranscribed audio , 2009, ACL/IJCNLP.

[102]  Andreas Stolcke,et al.  The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[103]  Johanna D. Moore,et al.  Incorporating Speaker and Discourse Features into Speech Summarization , 2006, NAACL.

[104]  Grzegorz Kondrak,et al.  Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion , 2007, NAACL.

[105]  Jamie Reilly,et al.  Formal Distinctiveness of High- and Low-Imageability Nouns: Analyses and Theoretical Implications , 2007, Cogn. Sci..

[106]  Eugene Charniak,et al.  Edit Detection and Parsing for Transcribed Speech , 2001, NAACL.

[107]  Dilek Z. Hakkani-Tür,et al.  Probabilistic model-based sentiment analysis of twitter messages , 2010, 2010 IEEE Spoken Language Technology Workshop.

[108]  Dilek Z. Hakkani-Tür,et al.  Clusterrank: a graph based method for meeting summarization , 2009, INTERSPEECH.