Mapping Multichannel Documents to Attentive Tasks: Ensuring Information Gain and Detecting Failure

Our approach of process-driven document analysis (DA) aims at supporting enterprises in managing the complexity of multiple input channels. Within this approach, we proposed earlier to map each incoming document to its corresponding task context - denominated as Attentive Task (AT) - by applying search with Dempster-Shafer theory. In this paper, we extend the search algorithm with methods from machine learning for addressing the challenges in real enterprise domains: (1) information gain trees for optimizing initial evidence selection and (2) five strategies for detecting search failures. We evaluate all proposed methods on a corpus from a financial institution and give an overview on how the approach enables automation services in multichannel management.

[1]  Mark Dredze,et al.  Automatically classifying emails into activities , 2006, IUI '06.

[2]  Andreas Dengel,et al.  Attentive Tasks: Process-Driven Document Analysis for Multichannel Documents , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[3]  Tessa A. Lau,et al.  Automated email activity management: an unsupervised learning approach , 2005, IUI.

[4]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[5]  Steve Whittaker,et al.  Supporting collaborative task management in e-mail , 2005 .

[6]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[7]  Rebecca E. Grinter,et al.  Quality versus quantity: e-mail-centric task management and its relation with overload , 2005 .

[8]  Andreas Dengel,et al.  Searching attentive tasks with document analysis evidences and Dempster-Shafer theory , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[9]  Jan-Peter Krämer,et al.  PIM-Mail: consolidating task and email management , 2010, CHI Extended Abstracts.

[10]  Bradley R. Schmerl,et al.  Agent-assisted task management that reduces email overload , 2010, IUI '10.

[11]  Andreas S. Rath,et al.  Machine Learning based Work Task Classification , 2009, J. Digit. Inf. Manag..

[12]  Tom M. Mitchell,et al.  Learning to Classify Email into “Speech Acts” , 2004, EMNLP.

[13]  Siegfried Handschuh,et al.  Classifying Action Items for Semantic Email , 2010, LREC.

[14]  Andreas Dengel,et al.  The specialist board a technology workbench for document analysis and understanding , 1996 .