Detection of Imperative and Declarative Question-Answer Pairs in Email Conversations

Question--answer pairs extracted from email threads are valuable in constructing summaries of the content of the thread, as well as in providing data for semantic-based assistance with email. Previous work dedicated to extracting question--answer pairs from email threads considers only questions in interrogative form. We extend the scope of question and answer detection and pairing to encompass questions in imperative and declarative forms, and to operate at sentence-level fidelity. Building on prior work, our methods are based on learned models over a set of features that include the content, context, and structure of email threads. On multiple benchmark email corpora, we show that our methods balance precision and recall in extracting question--answer pairs, while maintaining a modest computation time.

[1]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2]  Giuseppe Carenini,et al.  Summarizing email conversations with clue words , 2007, WWW '07.

[3]  Aaron Steinfeld,et al.  Evaluation of an integrated multi-task machine learning system with humans in the loop , 2007 .

[4]  Elizabeth Shriberg,et al.  Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual , 1997 .

[5]  T. Gelder,et al.  Mind as Motion: Explorations in the Dynamics of Cognition , 1995 .

[6]  Rajesh P. N. Rao,et al.  Hierarchical Learning of Navigational Behaviors in an Autonomous Robot using a Predictive Sparse Distributed Memory , 1998, Machine Learning.

[7]  Eric Brill,et al.  Automatic question answering using the web: Beyond the Factoid , 2006, Information Retrieval.

[8]  Julio Rosenblatt,et al.  DAMN: a distributed architecture for mobile navigation , 1997, J. Exp. Theor. Artif. Intell..

[9]  Lin Sun,et al.  Extracting Chinese question-answer pairs from online forums , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[10]  Lynne E. Parker,et al.  Distributed Algorithms for Multi-Robot Observation of Multiple Moving Targets , 2002, Auton. Robots.

[11]  Brian D. Davison,et al.  A classification-based approach to question answering in discussion boards , 2009, SIGIR.

[12]  Jacques Ferber,et al.  Action selection in an autonomous agent with a hierarchical distributed reactive planning architecture , 1998, AGENTS '98.

[13]  Diana Maynard,et al.  Motivating Intelligent E-mail in Business: An Investigation into Current Trends for E-mail Processing and Communication Research , 2009, 2009 IEEE Conference on Commerce and Enterprise Computing.

[14]  Gökhan Tür,et al.  Extracting question/answer pairs in multi-party meetings , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  William W. Cohen Learning Trees and Rules with Set-Valued Features , 1996, AAAI/IAAI, Vol. 1.

[16]  Owen Rambow,et al.  Using Question-Answer Pairs in Extractive Summarization of Email Conversations , 2007, CICLing.

[17]  Leslie Pack Kaelbling,et al.  Practical Reinforcement Learning in Continuous Spaces , 2000, ICML.

[18]  Stanley J. Rosenschein,et al.  A dynamical systems perspective on agent-environment interaction , 1996 .

[19]  Chin-Yew Lin,et al.  A Structural Support Vector Method for Extracting Contexts and Answers of Questions from Online Forums , 2009, EMNLP 2009.

[20]  Robert E. Kraut,et al.  Email overload at work: an analysis of factors associated with email strain , 2006, IEEE Engineering Management Review.

[21]  G. Carenini,et al.  A Publicly Available Annotated Corpus for Supervised Email Summarization , 2008 .

[22]  Nuno Seco,et al.  Design, Implementation and Evaluation of a New Semantic Similarity Metric Combining Features and Intrinsic Information Content , 2008, OTM Conferences.

[23]  Tom M. Mitchell,et al.  Learning to Classify Email into “Speech Acts” , 2004, EMNLP.

[24]  Rebecca E. Grinter,et al.  Quality versus quantity: e-mail-centric task management and its relation with overload , 2005 .

[25]  Jihie Kim,et al.  Profiling Student Interactions in Threaded Discussions with Speech Act Classifiers , 2007, AIED.

[26]  Ingrid Zukerman,et al.  An Empirical Study of Corpus-Based Response Automation Methods for an E-mail-Based Help-Desk Domain , 2009, CL.

[27]  Viii Supervisor Sonar-Based Real-World Mapping and Navigation , 2001 .

[28]  Elizabeth Shriberg,et al.  Meeting Recorder Project: Dialog Act Labeling Guide , 2004 .

[29]  Bradley R. Schmerl,et al.  Agent-assisted task management that reduces email overload , 2010, IUI '10.

[30]  Ronald C. Arkin,et al.  Motor Schema — Based Mobile Robot Navigation , 1989, Int. J. Robotics Res..

[31]  Marcel Kvassay,et al.  Email Social Network Extraction and Search , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[32]  Alan F. Smeaton,et al.  SeLeCT: a lexical cohesion based news story segmentation system , 2004, AI Commun..

[33]  Jacek Gwizdka,et al.  Everything through Email , 2007 .

[34]  Gary Geunbae Lee,et al.  Semi-supervised Speech Act Recognition in Emails and Forums , 2009, EMNLP.

[35]  Monica N. Nicolescu,et al.  A hierarchical architecture for behavior-based robots , 2002, AAMAS '02.

[36]  Oussama Khatib,et al.  Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Autonomous Robot Vehicles.

[37]  Yorick Wilks,et al.  FASIL Email Summarisation System , 2004, COLING.

[38]  Young-In Song,et al.  Finding question-answer pairs from online forums , 2008, SIGIR '08.

[39]  Kathleen McKeown,et al.  Detection of Question-Answer Pairs in Email Conversations , 2004, COLING.

[40]  Shlomo Hershkop,et al.  Automated social hierarchy detection through email network analysis , 2007, WebKDD/SNA-KDD '07.

[41]  Jukka Riekki,et al.  Reactive task execution by combining action maps , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[42]  Kian Hsiang Low,et al.  A hybrid mobile robot architecture with integrated planning and control , 2002, AAMAS '02.

[43]  Thiruvengadam Radhakrishnan,et al.  Comparing the Contribution of Syntactic and Semantic Features in Closed versus Open Domain Question Answering , 2007 .

[44]  Yiming Yang,et al.  The Enron Corpus: A New Dataset for Email Classi(cid:12)cation Research , 2004 .

[45]  Giuseppe Carenini,et al.  Summarizing Emails with Conversational Cohesion and Subjectivity , 2008, ACL.

[46]  José del R. Millán,et al.  Continuous-Action Q-Learning , 2002, Machine Learning.

[47]  Siegfried Handschuh,et al.  Classifying Action Items for Semantic Email , 2010, LREC.

[48]  Stephen Wan,et al.  Generating Overview Summaries of Ongoing Email Thread Discussions , 2004, COLING.

[49]  Claude F. Touzet,et al.  Neural reinforcement learning for behaviour synthesis , 1997, Robotics Auton. Syst..

[50]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[51]  Jen-Yuan Yeh,et al.  Email Thread Reassembly Using Similarity Matching , 2006, CEAS.

[52]  V. Braitenberg Vehicles, Experiments in Synthetic Psychology , 1984 .

[53]  Ani Nenkova,et al.  Facilitating email thread access by extractive summary generation , 2003, RANLP.

[54]  조동우 A Bayesian Method for Certainty Grids , 1989 .

[55]  William W. Cohen,et al.  Extracting Personal Names from Email: Applying Named Entity Recognition to Informal Text , 2005, HLT.

[56]  Jacek Gwizdka,et al.  TaskView: design and evaluation of a task-based email interface , 2002, CASCON.

[57]  Michael Freed,et al.  RADAR: A Personal Assistant that Learns to Reduce Email Overload , 2008, AAAI.