Approximate semantic matching of heterogeneous events

Event-based systems have loose coupling within space, time and synchronization, providing a scalable infrastructure for information exchange and distributed workflows. However, event-based systems are tightly coupled, via event subscriptions and patterns, to the semantics of the underlying event schema and values. The high degree of semantic heterogeneity of events in large and open deployments such as smart cities and the sensor web makes it difficult to develop and maintain event-based systems. In order to address semantic coupling within event-based systems, we propose vocabulary free subscriptions together with the use of approximate semantic matching of events. This paper examines the requirement of event semantic decoupling and discusses approximate semantic event matching and the consequences it implies for event processing systems. We introduce a semantic event matcher and evaluate the suitability of an approximate hybrid matcher based on both thesauri-based and distributional semantics-based similarity and relatedness measures. The matcher is evaluated over a structured representation of Wikipedia and Freebase events. Initial evaluations show that the approach matches events with a maximal combined precision-recall F1 score of 75.89% on average in all experiments with a subscription set of 7 subscriptions. The evaluation shows how a hybrid approach to semantic event matching outperforms a single similarity measure approach.

[1]  David Luckham,et al.  The power of events - an introduction to complex event processing in distributed enterprise systems , 2002, RuleML.

[2]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[3]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[4]  Edward Curry,et al.  An entity-centric approach to green information systems , 2011, ECIS.

[5]  Seán O'Riain,et al.  Querying Heterogeneous Datasets on the Linked Data Web: Challenges, Approaches, and Trends , 2012, IEEE Internet Computing.

[6]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[7]  Rajeev Sangal,et al.  Proceedings of the 20th international joint conference on Artifical intelligence , 2007 .

[8]  John Davidson,et al.  Ogc® sensor web enablement:overview and high level achhitecture. , 2007, 2007 IEEE Autotestcon.

[9]  Fatma Ozcan Proceedings of the 2005 ACM SIGMOD international conference on Management of data , 2005, SIGMOD 2005.

[10]  Annika Hinze,et al.  Event-based applications and enabling technologies , 2009, DEBS '09.

[11]  Edward Curry,et al.  Toward Situation Awareness for the Semantic Sensor Web: Complex Event Processing with Dynamic Linked Data Enrichment , 2011, SSN.

[12]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[13]  Karl Aberer,et al.  Top-k/w publish/subscribe: finding k most relevant publications in sliding time window w , 2008, DEBS.

[14]  Ted Pedersen,et al.  Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts , 2006 .

[15]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[16]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[17]  Antonio Iera,et al.  The Internet of Things: A survey , 2010, Comput. Networks.

[18]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[19]  Ashwin Machanavajjhala,et al.  Scalable ranked publish/subscribe , 2008, Proc. VLDB Endow..

[20]  Hans-Arno Jacobsen,et al.  S-ToPSS: Semantic Toronto Publish/Subscribe System , 2003, VLDB.

[21]  Avigdor Gal,et al.  Complex event processing over uncertain data , 2008, DEBS.

[22]  Evaggelia Pitoura,et al.  Preference-aware publish/subscribe delivery with diversity , 2009, DEBS '09.

[23]  Erhard Rahm,et al.  Schema and ontology matching with COMA++ , 2005, SIGMOD '05.

[24]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[25]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[26]  Hans-Arno Jacobsen,et al.  A-TOPSS - A Publish/Subscribe System Supporting Approximate Matching , 2002, VLDB.

[27]  Weiwei Zhang,et al.  FOMatch: A Fuzzy Ontology-Based Semantic Matching Algorithm of Publish/Subscribe Systems , 2008, 2008 International Conference on Computational Intelligence for Modelling Control & Automation.

[28]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[29]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .