An event processing approach to text stream analysis: basic principles of event based information filtering

Information filtering is a crucial task in a world where data is generated steadily and at a high rate, helping users in distinguishing relevant from irrelevant content. This requires efficient processing of continuous streams of textual data. Event processing allows for real time processing of data streams. But up to now event processing has mainly been investigated in the context of business transaction-oriented domains like logistics or finance, but not explicitly in terms of text stream processing and information filtering. The growth of applications that analyze social media streams lets such an approach appear reasonable. Therefore we propose a common vocabulary represented by a text domain event model as well as a reference architecture for text stream processing and information filtering, in order to facilitate the implementation and the assessment of event processing applications for text streams. In addition we describe results from actual use cases that employ this architecture.

[1]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[2]  Sascha Ossowski,et al.  Injecting Semantics into Event-driven Architectures , 2009, ICEIS.

[3]  Alexandre Alves,et al.  Tutorial on advanced design patterns in event processing , 2012, DEBS.

[4]  Edward Curry,et al.  Approximate semantic matching of heterogeneous events , 2012, DEBS.

[5]  Christopher G. Lasater,et al.  Design Patterns , 2008, Wiley Encyclopedia of Computer Science and Engineering.

[6]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[7]  Emanuele Della Valle,et al.  An Introduction to Information Retrieval , 2013 .

[8]  Philippe Krutchen,et al.  Architectural blueprints--the '4+1' view model of software architecture , 1995 .

[9]  Martin Wattenberg,et al.  Stacked Graphs – Geometry & Aesthetics , 2008, IEEE Transactions on Visualization and Computer Graphics.

[10]  Charles L. A. Clarke,et al.  Information Retrieval - Implementing and Evaluating Search Engines , 2010 .

[11]  Schahram Dustdar,et al.  Deriving a unified fault taxonomy for event-based systems , 2012, DEBS.

[12]  Annika Marie Hinze A-MEDIAS: concept and design of an adaptive integrating event notification service , 2003 .

[13]  Marc-Thomas Schmidt,et al.  The Enterprise Service Bus: Making service-oriented architecture real , 2005, IBM Syst. J..

[14]  Nenad Stojanovic,et al.  Using Complex Event Processing for Modeling Semantic Requests in Real-Time Social Media Monitoring , 2012, ICWSM 2012.

[15]  Avigdor Gal,et al.  A Model for Reasoning with Uncertain Rules in Event Composition Systems , 2005, UAI.

[16]  Opher Etzion,et al.  Complex event processing , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[17]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[18]  Avigdor Gal,et al.  Complex event processing over uncertain data , 2008, DEBS.

[19]  Philippe Kruchten,et al.  Architecture blueprints—the “4+1” view model of software architecture , 1995, TRI-Ada '95.

[20]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[21]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[22]  David Crystal,et al.  A dictionary of linguistics and phonetics , 1997 .

[23]  Northrop Grumman,et al.  Reference Architecture Foundation for Service Oriented Architecture Version 1.0 , 2009 .

[24]  David Luckham,et al.  The power of events - an introduction to complex event processing in distributed enterprise systems , 2002, RuleML.

[25]  Jürgen Dunkel,et al.  Situation-aware energy control by combining simple sensors and Complex Event Processing , 2012 .

[26]  Opher Etzion,et al.  Event Processing in Action , 2010 .

[27]  Yushun Fan,et al.  Complex event processing in enterprise information systems based on RFID , 2007, Enterp. Inf. Syst..

[28]  Stephen E. Robertson,et al.  The TREC 2002 Filtering Track Report , 2002, TREC.

[29]  Alastair G. Smith Information Retrieval: Implementing and Evaluating Search Engines , 2011 .