Intelligent Email: Aiding Users with AI

Email occupies a central role in the modern workplace. This has led to a vast increase in the number of email messages that users are expected to handle daily. Furthermore, email is no longer simply a tool for asynchronous online communication-email is now used for task management, personal archiving, as well both synchronous and asynchronous online communication (Whittaker and Sidner 1996). This explosion can lead to .. email overload"-many users are overwhelmed by the large quantity of information in their mailboxes. In the human--computer interaction community, there has been much research on tackling email overload. Recently, similar efforts have emerged in the artificial intelligence (AI) and machine learning communities to form an area of research known as intelligent email. In this paper, we take a user-oriented approach to applying AI to email. We identify enhancements to email user interfaces and employ machine learning techniques to support these changes. We focus on three tasks-summary keyword generation, reply prediction and attachment prediction-and summarize recent work in these areas.

[1]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[2]  ChengXiang Zhai,et al.  Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[3]  Carman Neustaedter,et al.  Understanding sequence and reply relationships within email conversations: a mixed-model visualization , 2003, CHI '03.

[4]  Thomas L. Griffiths,et al.  Integrating Topics and Syntax , 2004, NIPS.

[5]  Alon Lavie,et al.  Increasing the Coherence of Spoken Dialogue Summaries by Cross-Speaker Information Linking , 2001 .

[6]  Jacek Gwizdka,et al.  Individual differences and task-based user interface evaluation: a case study of pending tasks in email , 2004, Interact. Comput..

[7]  John Blitzer,et al.  Intelligent email: reply and attachment prediction , 2008, IUI '08.

[8]  Olle Bälter,et al.  Bifrost inbox organizer: giving users control over the inbox , 2002, NordiCHI '02.

[9]  Stefien Bickel,et al.  ECML-PKDD Discovery Challenge 2006 Overview , 2006 .

[10]  D. Sculley,et al.  Online Active Learning Methods for Fast Label-Efficient Spam Filtering , 2007, CEAS.

[11]  Terry R. Payne,et al.  Interface Agents That Learn an Investigation of Learning Issues in a Mail Agent Interface , 1997, Appl. Artif. Intell..

[12]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[13]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[14]  Thomas G. Dietterich,et al.  The TaskTracer system , 2005, AAAI 2005.

[15]  Richard S. Sutton,et al.  Adapting Bias by Gradient Descent: An Incremental Version of Delta-Bar-Delta , 1992, AAAI.

[16]  Melinda Gervasio,et al.  Learning Email Procedures for the Desktop , 2008 .

[17]  Ke Wang,et al.  Behavior-based modeling and its application to Email analysis , 2006, TOIT.

[18]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[19]  Deborah L. McGuinness,et al.  Explaining Social Relationships , 2008 .

[20]  Owen Rambow,et al.  Summarizing Email Threads , 2004, NAACL.

[21]  Aleks Jakulin Machine Learning Based on Attribute Interactions , 2005 .

[22]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[23]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[24]  Victoria Bellotti,et al.  E-mail as habitat: an exploration of embedded personal information management , 2001, INTR.

[25]  Sharma Chakravarthy,et al.  eMailSift: mining-based approaches to email classification , 2004, SIGIR '04.

[26]  Cécile Paris,et al.  The nature of requests and commitments in email messages , 2008, AAAI 2008.

[27]  Wei Li,et al.  Pachinko allocation: DAG-structured mixture models of topic correlations , 2006, ICML.

[28]  Mark Dredze,et al.  Activity-Centric Email: A Machine Learning Approach , 2006, AAAI.

[29]  Michael J. Muller,et al.  One-hundred days in an activity-centric collaboration environment based on shared objects , 2004, CHI.

[30]  Stan Matwin,et al.  Email classification with co-training , 2011, CASCON.

[31]  Marti A. Hearst Clustering versus faceted categories for information exploration , 2006, Commun. ACM.

[32]  Lise Getoor,et al.  Relationship Identification for Social Network Discovery , 2007, AAAI.

[33]  Adwait Ratnaparkhi,et al.  A Simple Introduction to Maximum Entropy Models for Natural Language Processing , 1997 .

[34]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[35]  Giorgio Satta,et al.  Guided Learning for Bidirectional Sequence Classification , 2007, ACL.

[36]  Derek Scott Lam,et al.  Exploiting E-mail Structure to Improve Summarization , 2002 .

[37]  William W. Cohen,et al.  Recommending Recipients in the Enron Email Corpus , 1972 .

[38]  William W. Cohen,et al.  Contextual search and name disambiguation in email using graphs , 2006, SIGIR.

[39]  Ani Nenkova,et al.  Email classification for contact centers , 2003, SAC '03.

[40]  William W. Cohen,et al.  Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.

[41]  Rajat Raina,et al.  Constructing informative priors using transfer learning , 2006, ICML.

[42]  Tom Heskes,et al.  Task Clustering and Gating for Bayesian Multitask Learning , 2003, J. Mach. Learn. Res..

[43]  Kathleen M. Carley,et al.  Exploration of communication networks from the Enron email corpus , 2005 .

[44]  Roger Wattenhofer,et al.  BuzzTrack: topic detection and tracking in email , 2007, IUI '07.

[45]  Weng-Keen Wong,et al.  Integrating rich user feedback into intelligent user interfaces , 2008, IUI '08.

[46]  Jacek Gwizdka,et al.  Email task management styles: the cleaners and the keepers , 2004, CHI EA '04.

[47]  Tom M. Mitchell,et al.  Inferring Ongoing Activities of Workstation Users by Clustering Email , 2004, CEAS.

[48]  William W. Cohen Learning Rules that Classify E-Mail , 1996 .

[49]  Mika Käki,et al.  Findex: search result categories help users when document ranking fails , 2005, CHI.

[50]  Thomas G. Dietterich,et al.  Fewer clicks and less frustration: reducing the cost of reaching the right folder , 2006, IUI '06.

[51]  William W. Cohen,et al.  On the collective classification of email "speech acts" , 2005, SIGIR '05.

[52]  Pannagadatta K. Shivaswamy Ellipsoidal Kernel Machines , 2007 .

[53]  William W. Cohen,et al.  Activity-centred Search in Email , 2008, CEAS.

[54]  Philip M. Long,et al.  Online Multitask Learning , 2006, COLT.

[55]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[56]  Filip Radlinski,et al.  Minimally Invasive Randomization for Collecting Unbiased Preferences from Clickthrough Logs , 2006, AAAI 2006.

[57]  Tomás E. Uribe,et al.  Active preference learning for personalized calendar scheduling assistance , 2005, IUI.

[58]  Gábor Lugosi,et al.  Minimizing regret with label efficient prediction , 2004, IEEE Transactions on Information Theory.

[59]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[60]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[61]  Cécile Paris,et al.  Classifying Speech Acts using Verbal Response Modes , 2006, ALTA.

[62]  Adam Tauman Kalai,et al.  Analysis of Perceptron-Based Active Learning , 2009, COLT.

[63]  Bernardo A. Huberman,et al.  Email as spectroscopy: automated discovery of community structure within organizations , 2003 .

[64]  Nathaniel Good,et al.  TV-ACTA: Embedding an Activity-Centered Interface for Task Management in Email , 2007, CEAS.

[65]  Fernando Pereira,et al.  Generating summary keywords for emails using topics , 2008, IUI '08.

[66]  Mark Dredze,et al.  Managers' email: beyond tasks and to-dos , 2005, CHI EA '05.

[67]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[68]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[69]  Victoria Bellotti,et al.  Ceci n'est pas un Objet? Talking About Objects in E-mail , 2003, Hum. Comput. Interact..

[70]  Daniel G. Bobrow,et al.  What a to-do: studies of task management towards the design of a personal task list manager , 2004, CHI.

[71]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks , 2005, IJCAI.

[72]  Gary Geunbae Lee,et al.  MMR-based Active Machine Learning for Bio Named Entity Recognition , 2006, NAACL.

[73]  Susan T. Dumais,et al.  Fast, Flexible Filtering with Phlat — Personal Search and Organization Made Easy , 2006 .

[74]  John C. Platt,et al.  Online Bayes Point Machines , 2003, PAKDD.

[75]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[76]  Eric K. Ringger,et al.  Active Learning for Part-of-Speech Tagging: Accelerating Corpus Annotation , 2007, LAW@ACL.

[77]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[78]  A. J. Bernheim Brush,et al.  Revisiting Whittaker & Sidner's "email overload" ten years later , 2006, CSCW '06.

[79]  John Blitzer,et al.  "Sorry, I Forgot the Attachment": Email Attachment Prediction , 2006, CEAS.

[80]  Yorick Wilks,et al.  FASIL Email Summarisation System , 2004, COLING.

[81]  Koby Crammer,et al.  Active Learning with Confidence , 2008, ACL.

[82]  John C. Tang,et al.  Tag-it, snag-it, or bag-it: combining tags, threads, and folders in e-mail , 2008, CHI Extended Abstracts.

[83]  Tessa A. Lau,et al.  Automated email activity management: an unsupervised learning approach , 2005, IUI.

[84]  Yiming Yang,et al.  The Enron Corpus: A New Dataset for Email Classi(cid:12)cation Research , 2004 .

[85]  Li-Te Cheng,et al.  Supporting activity-centric collaboration through peer-to-peer shared objects , 2003, GROUP '03.

[86]  Oren Etzioni,et al.  Semantic email: theory and applications , 2004, J. Web Semant..

[87]  Massimiliano Pontil,et al.  Regularized multi--task learning , 2004, KDD.

[88]  Susan T. Dumais,et al.  LSI meets TREC: A Status Report , 1992, TREC.

[89]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[90]  Thomas L. Griffiths,et al.  Probabilistic Topic Models , 2007 .

[91]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[92]  Guy Lapalme,et al.  Using information extraction and natural language generation to answer e-mail , 2001, Data Knowl. Eng..

[93]  Andrew McCallum,et al.  Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.

[94]  Susan T. Dumais,et al.  Searching to eliminate personal information management , 2006, CACM.

[95]  Samy Bengio,et al.  Modeling Interactions from Email Communication , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[96]  Michael Freed,et al.  RADAR: A Personal Assistant that Learns to Reduce Email Overload , 2008, AAAI.

[97]  Tom Bylander Worst-Case Analysis of the Perceptron and Exponentiated Update Algorithms , 1998, Artif. Intell..

[98]  Manfred K. Warmuth,et al.  The Weighted Majority Algorithm , 1994, Inf. Comput..

[99]  Thomas L. Griffiths,et al.  Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[100]  James A. Landay,et al.  Investigating statistical machine learning as a tool for software development , 2008, CHI.

[101]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[102]  Jack Park,et al.  IRIS: Integrate. Relate. Infer. Share , 2005, Semantic Desktop Workshop.

[103]  Victoria Bellotti,et al.  Taskmaster: recasting email as task management , 2002 .

[104]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[105]  de,et al.  Prototype Demonstration: Using Link Analysis to Identify Aspects in Faceted Web Search , 2006 .

[106]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[107]  Kenrick J. Mock An experimental framework for email categorization and management , 2001, SIGIR '01.

[108]  Giuseppe Carenini,et al.  Summarizing email conversations with clue words , 2007, WWW '07.

[109]  Aaron Steinfeld,et al.  Evaluation of an integrated multi-task machine learning system with humans in the loop , 2007 .

[110]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[111]  David R. McGee,et al.  Human-centered collaborative interaction , 2006, HCM '06.

[112]  Thomas G. Dietterich,et al.  A hybrid learning system for recognizing user tasks from desktop activities and email messages , 2006, IUI '06.

[113]  Kaare Brandt Petersen,et al.  The Matrix Cookbook , 2006 .

[114]  Josef Kittler,et al.  Combining multiple classifiers by averaging or by multiplying? , 2000, Pattern Recognit..

[115]  Carman Neustaedter,et al.  The Social Network and Relationship Finder: Social Sorting for Email Triage , 2005, CEAS.

[116]  Andrew McCallum,et al.  Automatic Categorization of Email into Folders: Benchmark Experiments on Enron and SRI Corpora , 2005 .

[117]  Kathleen F. McCoy,et al.  Efficiently Computed Lexical Chains as an Intermediate Representation for Automatic Text Summarization , 2002, CL.

[118]  Bradley Malin,et al.  Email alias detection using social network analysis , 2005, LinkKDD '05.

[119]  Michael J. Muller,et al.  Predicting individual priorities of shared activities using support vector machines , 2007, CIKM '07.

[120]  Y. Singer,et al.  Ultraconservative online algorithms for multiclass problems , 2003 .

[121]  Robert E. Kraut,et al.  Understanding email use: predicting action on a message , 2005, CHI.

[122]  Thomas P. Moran,et al.  Unified activity management: supporting people in e-business , 2005, CACM.

[123]  Simone Stumpf,et al.  Predicting Task-Specific Webpages for Revisiting , 2006, AAAI.

[124]  Jeffrey O. Kephart,et al.  Incremental Learning in SwiftFile , 2000, ICML.

[125]  Susan T. Dumais,et al.  Examining Repetition in User Search Behavior , 2007, ECIR.

[126]  Jacek Gwizdka,et al.  TaskView: design and evaluation of a task-based email interface , 2002, CASCON.

[127]  Kevin W. Bowyer,et al.  Combination of Multiple Classifiers Using Local Accuracy Estimates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[128]  Mark Dredze,et al.  User Models for Email Activity Management , 2008 .

[129]  William W. Cohen,et al.  Preventing Information Leaks in Email , 2007, SDM.

[130]  Tom M. Mitchell,et al.  Learning to Classify Email into “Speech Acts” , 2004, EMNLP.

[131]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[132]  Owen Rambow,et al.  Using Question-Answer Pairs in Extractive Summarization of Email Conversations , 2007, CICLing.

[133]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[134]  Christopher Joseph Pal CC Prediction with Graphical Models , 2006, CEAS.

[135]  Lise Getoor,et al.  Name Reference Resolution in Organizational Email Archives , 2006, SDM.

[136]  John Blitzer,et al.  Summarizing archived discussions: a beginning , 2003, IUI '03.

[137]  Jade Goldstein-Stewart,et al.  Using Speech Acts to Categorize Email and Identify Email Genres , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[138]  Judy Kay,et al.  An intelligent interface for sorting electronic mail , 2002, IUI '02.

[139]  John C. Platt,et al.  Automatic Discovery of Personal Topics to Organize Email , 2005, CEAS.

[140]  Andrew Lampert,et al.  Can Requests-for-Action and Commitments-to-Act be Reliably Identified in Email Messages ? , 2007 .

[141]  John C. Tang,et al.  When Can I Expect an Email Response? A Study of Rhythms in Email Usage , 2003, ECSCW.

[142]  Tong Zhang,et al.  Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[143]  Eyal Oren,et al.  Extending Faceted Navigation for RDF Data , 2006, SEMWEB.

[144]  Hanna Wallach,et al.  Structured Topic Models for Language , 2008 .

[145]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[146]  Tommi S. Jaakkola,et al.  Automatic Feature Induction for Text Classification , 2002 .

[147]  Philip Resnik,et al.  Online Large-Margin Training of Syntactic and Structural Translation Features , 2008, EMNLP.

[148]  Oren Etzioni,et al.  Semantic email , 2004, WWW '04.

[149]  Manabu Sassano,et al.  An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation , 2002, ACL.

[150]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[151]  Marti A. Hearst,et al.  Nearly-Automated Metadata Hierarchy Creation , 2004, NAACL.

[152]  K. Fujimura,et al.  BLOGRANGER – A Multi-faceted Blog Search Engine , 2006 .

[153]  Ian Smith,et al.  Taking email to task: the design and evaluation of a task management centered email tool , 2003, CHI '03.

[154]  Nick Craswell,et al.  Overview of the TREC 2006 Enterprise Track , 2006, TREC.

[155]  Andrew Y. Ng,et al.  Transfer learning for text classification , 2005, NIPS.

[156]  Einat Minkov Activity-centric Search in Email , 2008 .

[157]  Victoria Bellotti,et al.  Managing Activities with TV-ACTA: TaskVista and Activity- Centered Task Assistant , 2006 .

[158]  Bernardo A. Huberman,et al.  E-Mail as Spectroscopy: Automated Discovery of Community Structure within Organizations , 2005, Inf. Soc..

[159]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[160]  Nicholas Kushmerick,et al.  Email Task Management: An Iterative Relational Learning Approach , 2005, CEAS.

[161]  Wisam Dakka Automatic Discovery of Useful Facet Terms , 2006 .

[162]  Koby Crammer,et al.  Online Large-Margin Training of Dependency Parsers , 2005, ACL.

[163]  Akira Shimazu,et al.  Construction of Deliberation Structure in E‐Mail Communication , 2000, Comput. Intell..

[164]  Sven Schmeier,et al.  Message Classification in the Call Center , 2000, ANLP.

[165]  Steve Whittaker,et al.  Supporting collaborative task management in e-mail , 2005 .

[166]  Ani Nenkova,et al.  Facilitating email thread access by extractive summary generation , 2003, RANLP.

[167]  Thomas L. Griffiths,et al.  A probabilistic approach to semantic representation , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[168]  Michael Muller,et al.  Collaborating Within - not Through - Email: Users Reinvent a Familiar Technology , 2002 .

[169]  Tom M. Mitchell,et al.  Extracting Knowledge about Users' Activities from Raw Workstation Contents , 2006, AAAI.

[170]  Eric Horvitz,et al.  Attention-Sensitive Alerting , 1999, UAI.

[171]  Yi Zhang,et al.  Graph-based ranking algorithms for e-mail expertise analysis , 2003, DMKD '03.

[172]  Koby Crammer,et al.  Flexible Text Segmentation with Structured Multilabel Classification , 2005, HLT.

[173]  Eric Brill,et al.  Learning effective ranking functions for newsgroup search , 2004, SIGIR '04.

[174]  Marti A. Hearst,et al.  Flexible Search and Navigation using Faceted Metadata , 2002 .

[175]  Samy Bengio,et al.  Modeling interactions from email communications , 2006 .

[176]  Anoop Gupta,et al.  Supporting Email Workflow , 2001 .

[177]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[178]  Susan T. Dumais,et al.  A Bayesian Approach to Filtering Junk E-Mail , 1998, AAAI 1998.

[179]  Smaranda Muresan,et al.  Combining linguistic and machine learning techniques for email summarization , 2001, CoNLL.

[180]  Koby Crammer,et al.  Confidence-weighted linear classification , 2008, ICML '08.

[181]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[182]  Wei Li,et al.  Mixtures of hierarchical topics with Pachinko allocation , 2007, ICML '07.

[183]  John Blitzer,et al.  Reply Expectation Prediction for Email Management , 2005, CEAS.

[184]  Foster J. Provost,et al.  Aggregation-based feature invention and relational concept classes , 2003, KDD '03.

[185]  Min Tang,et al.  Active Learning for Statistical Natural Language Parsing , 2002, ACL.

[186]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[187]  Stephen Wan,et al.  Generating Overview Summaries of Ongoing Email Thread Discussions , 2004, COLING.

[188]  Koby Crammer,et al.  A Family of Additive Online Algorithms for Category Ranking , 2003, J. Mach. Learn. Res..

[189]  Aravind K. Joshi,et al.  Ranking and Reranking with Perceptron , 2005, Machine Learning.

[190]  Douglas W. Oard,et al.  Resolving Personal Names in Email Using Context Expansion , 2008, ACL.

[191]  William W. Cohen,et al.  Discovering Leadership Roles in Email Workgroups , 2007, CEAS.

[192]  Koby Crammer Online Learning for Complex Cat-egorial Problems , 2005 .

[193]  Thomas G. Dietterich,et al.  TaskTracer: a desktop environment to support multi-tasking knowledge workers , 2005, IUI.

[194]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[195]  Stephen Wan,et al.  Using Thematic Information in Statistical Headline Generation , 2003, Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering -.

[196]  Philip M. Long,et al.  Worst-case quadratic loss bounds for a generalization of the Widrow-Hoff rule , 1993, COLT '93.

[197]  Xavier Carreras,et al.  TAG, Dynamic Programming, and the Perceptron for Efficient, Feature-Rich Parsing , 2008, CoNLL.

[198]  Michael Gamon,et al.  Task-Focused Summarization of Email , 2004 .

[199]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[200]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[201]  Hongyuan Zha,et al.  Probabilistic models for discovering e-communities , 2006, WWW '06.

[202]  Koby Crammer,et al.  Online Methods for Multi-Domain Learning and Adaptation , 2008, EMNLP.

[203]  Douglas W. Oard,et al.  Modeling Identity in Archival Collections of Email: A Preliminary Study , 2006, CEAS.

[204]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[205]  Marti A. Hearst,et al.  Automating Creation of Hierarchical Faceted Metadata Structures , 2007, NAACL.

[206]  Colin Campbell,et al.  Bayes Point Machines , 2001, J. Mach. Learn. Res..

[207]  Paula S. Newman Email archive overviews using subject indexes , 2002, CHI Extended Abstracts.

[208]  Deborah L. McGuinness,et al.  Toward establishing trust in adaptive agents , 2008, IUI '08.

[209]  Salvatore J. Stolfo,et al.  A temporal based forensic analysis of electronic communication , 2006, DG.O.

[210]  Mary Czerwinski,et al.  FaThumb: a facet-based interface for mobile search , 2006, CHI.

[211]  J. Austin How to do things with words , 1962 .

[212]  Kathleen McKeown,et al.  Detection of Question-Answer Pairs in Email Conversations , 2004, COLING.

[213]  Jeffrey O. Kephart,et al.  SpamGuru: An Enterprise Anti-Spam Filtering System , 2004, CEAS.

[214]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[215]  Giuseppe Carenini,et al.  Summarizing Emails with Conversational Cohesion and Subjectivity , 2008, ACL.

[216]  Eric Wilcox,et al.  Designing remail: reinventing the email client through innovation and integration , 2004, CHI EA '04.

[217]  Yoram Singer,et al.  Online multiclass learning by interclass hypothesis sharing , 2006, ICML.

[218]  William W. Cohen,et al.  CutOnce-Recipient Recommendation and Leak Detection in Action , 2008 .

[219]  William W. Cohen,et al.  Single-pass online learning: performance, voting schemes and online feature selection , 2006, KDD '06.

[220]  Bernard Kerr Thread Arcs: an email thread visualization , 2003, IEEE Symposium on Information Visualization 2003 (IEEE Cat. No.03TH8714).

[221]  J. Sadock Speech acts , 2007 .

[222]  Carman Neustaedter,et al.  Beyond "from" and "received": exploring the dynamics of email triage , 2005, CHI Extended Abstracts.

[223]  Stan Matwin,et al.  Email Classification with Temporal Features , 2004, Intelligent Information Systems.

[224]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[225]  Rebecca E. Grinter,et al.  Quality versus quantity: e-mail-centric task management and its relation with overload , 2005 .

[226]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[227]  Stanley F. Chen,et al.  A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .

[228]  Desney S. Tan,et al.  FacetMap: A Scalable Search and Browse Visualization , 2006, IEEE Transactions on Visualization and Computer Graphics.

[229]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[230]  Alex Acero,et al.  Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo , 2006, Comput. Speech Lang..

[231]  Mark Dredze,et al.  Automatically classifying emails into activities , 2006, IUI '06.

[232]  Koby Crammer,et al.  Feature Design for Transfer Learning , 2006 .

[233]  Martin Wattenberg,et al.  ReMail: a reinvented email prototype , 2004, CHI EA '04.

[234]  Jeffrey O. Kephart,et al.  MailCat: an intelligent assistant for organizing e-mail , 1999, AGENTS '99.

[235]  Andrea Lockerd Thomaz,et al.  DriftCatcher: The Implicit Social Context of Email , 2003, INTERACT.

[236]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[237]  Candace L. Sidner,et al.  Email overload: exploring personal information management of email , 1996, CHI.

[238]  Karin Becker,et al.  Mail-by-example: a visual query interface for email management , 2000, AVI '00.

[239]  Edward F. Harrington,et al.  Online Ranking/Collaborative Filtering Using the Perceptron Algorithm , 2003, ICML.

[240]  Anton Leuski Email is a stage: discovering people roles from email archives , 2004, SIGIR '04.

[241]  Lise Getoor,et al.  Inferring Organizational Titles in Online Communication , 2006, SNA@ICML.

[242]  Jianqiang Shen,et al.  Automatically finding and recommending resources to support knowledge workers' activities , 2008, IUI '08.

[243]  Hwee Tou Ng,et al.  Domain Adaptation with Active Learning for Word Sense Disambiguation , 2007, ACL.

[244]  Christopher Meek,et al.  Challenges of the Email Domain for Text Classification , 2000, ICML.

[245]  Antoine Bordes,et al.  The Huller: A Simple and Efficient Online SVM , 2005, ECML.

[246]  Henry Tirri,et al.  A Scalable Topic-Based Open Source Search Engine , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[247]  Jason D. M. Rennie ifile: An Application of Machine Learning to E-Mail Filtering , 2000 .

[248]  Sebastian Thrun,et al.  Clustering Learning Tasks and the Selective Cross-Task Transfer of Knowledge , 1998, Learning to Learn.

[249]  Aleks Jakulin,et al.  Machine learning based on attribute interactions : phd dissertation , 2005 .

[250]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[251]  Claudio Gentile,et al.  A Second-Order Perceptron Algorithm , 2002, SIAM J. Comput..