A dynamic and fast event matching algorithm for a content-based publish/subscribe information dissemination system in Sensor-Grid

In this paper, we discuss one of the most important issues in Sensor-Grid, i.e., to develop a fast and flexible content-based publish/subscribe information dissemination (CBPSID) system for automatic fusion, interpretation, sharing and delivery of huge sensor data to consumers as the entire Sensor-Grid environment is very dynamic. Existing works to develop the CBPSID system in Sensor-Grid mostly focus on reducing the effort to define and maintain subscriptions and to handle the difficulty of dynamic changes of publishers and consumers data. However, the performance of a CBPSID system in Sensor-Grid is bounded by the expensive matching/evaluation cost of events. Existing event-matching algorithms are not very efficient, especially for interval range predicates or overlapping predicates in subscriptions which are practical in Sensor-Grid as well as other application areas. So in this paper we discuss the above challenge and propose a dynamic and fast event-matching algorithm called CGIM for the CBPSID system in Sensor-Grid. The algorithm supports range predicates or overlapping predicates very well and provides single and composite event matching. It uses two approaches, called SGIM and DGIM, to group the subscriptions by the predicates and dynamically identifies appropriate number of groups considering different statistical distributions of subscriptions at run time. Also, we present an experimental evaluation of the proposed algorithm in a Sensor-Grid based u-Healthcare scenario using synthetic workloads and compare its performance with existing algorithms. The experimental results show that our algorithm significantly reduces the evaluation cost (on average using SGIM by 79% and DGIM by 88%) comparing with others and guarantees the scalability with respect to the number of subscriptions as well as the number of predicates and events. In addition, further experiments were conducted by applying the CGIM algorithm in other application areas, e.g. in the publish/subscribe system for online job sites, to show its diverse utilization and scalability.

[1]  Sungyoung Lee,et al.  Distributed, Scalable and Reconfigurable Inter-grid Resource Sharing Framework , 2006, ICCSA.

[2]  Emiliano Casalicchio,et al.  Distributed subscriptions clustering with limited knowledge sharing for content-based publish/subscribe systems , 2007, Sixth IEEE International Symposium on Network Computing and Applications (NCA 2007).

[3]  Peter Triantafillou,et al.  Subscription summaries for scalability and efficiency in publish/subscribe systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems Workshops.

[4]  Jaswinder Pal Singh,et al.  Efficient event routing in content-based publish-subscribe service networks , 2004, IEEE INFOCOM 2004.

[5]  Philip S. Yu,et al.  VCR indexing for fast event matching for highly-overlapping range predicates , 2004, SAC '04.

[6]  Christof Fetzer,et al.  Prefix forwarding for publish/subscribe , 2007, DEBS '07.

[7]  Zubair A. Baig,et al.  SGSIA-in-Network Data Preprocessing for Secure Grid-Sensor Integration , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[8]  TaeChoong Chung,et al.  Adaptive and Reconfigurable ResOurce Management for Wireless Sensors using Grid Technology , 2007, 2007 2nd International Conference on Communication Systems Software and Middleware.

[9]  Marcos K. Aguilera,et al.  Matching events in a content-based subscription system , 1999, PODC '99.

[10]  Christof Fetzer,et al.  Bloom filter based routing for content-based publish/subscribe , 2008, DEBS.

[11]  David S. Rosenblum,et al.  Design and evaluation of a wide-area event notification service , 2001, TOCS.

[12]  Philip S. Yu,et al.  New algorithms for content-based publication-subscription systems , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[13]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[14]  Chae-Woo Lee,et al.  u-Healthcare SensorGrid Gateway for connecting Wireless Sensor Network and Grid Network , 2008, 2008 10th International Conference on Advanced Communication Technology.

[15]  Yong Meng Teo,et al.  Sensor grid: integration of wireless sensor networks and the grid , 2005, The IEEE Conference on Local Computer Networks 30th Anniversary (LCN'05)l.

[16]  Ashwin Machanavajjhala,et al.  Scalable ranked publish/subscribe , 2008, Proc. VLDB Endow..

[17]  Hao Yang,et al.  Scalable event matching for overlapping subscriptions in pub/sub systems , 2007, DEBS '07.

[18]  Huajun Chen,et al.  Dart-Dataflow: Towards Communicating Data Semantics in Sensor Grid , 2005, GCC.

[19]  Sriram Padmanabhan,et al.  A framework for the selective dissemination of XML documents based on inferred user profiles , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[20]  Haiying Shen,et al.  An efficient and scalable framework for content-based publish/subscribe systems , 2008, Peer-to-Peer Netw. Appl..

[21]  Alexander L. Wolf,et al.  Forwarding in a content-based network , 2003, SIGCOMM '03.

[22]  Sasu Tarkoma,et al.  Chained forests for fast subsumption matching , 2007, DEBS '07.

[23]  Jatinder Singh,et al.  Event-based data control in healthcare , 2008, Companion '08.

[24]  Jun Wei,et al.  Efficient Event Matching in Publish/subscribe: Based on Routing Destination and Matching History , 2008, 2008 International Conference on Networking, Architecture, and Storage.

[25]  Gero Mühl,et al.  Disseminating information to mobile clients using publish-subscribe , 2004, IEEE Internet Computing.

[26]  Yuichi Nagahara,et al.  A method of simulating multivariate nonnormal distributions by the Pearson distribution system and estimation , 2004, Comput. Stat. Data Anal..

[27]  M. Ghanem,et al.  Air Pollution Monitoring and Mining Based on Sensor Grid in London , 2008, Sensors.

[28]  Shou-Chih Lo,et al.  Design of Content-Based Publish/Subscribe Systems over Structured Overlay Networks , 2008, IEICE Trans. Inf. Syst..

[29]  Geoffrey C. Fox,et al.  NaradaBrokering: A Distributed Middleware Framework and Architecture for Enabling Durable Peer-to-Peer Grids , 2003, Middleware.

[30]  Guruduth Banavar,et al.  An efficient multicast protocol for content-based publish-subscribe systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[31]  Yuichi Nagahara Erratum to "A method of simulating multivariate nonnormal distributions by the Pearson distribution system and estimation" [Comput. Statist. Data Analysis 47 (2004) 1-29] , 2006, Comput. Stat. Data Anal..

[32]  Ravi Jain,et al.  Efficient dissemination of personalized information using content-based multicast , 2004, IEEE Transactions on Mobile Computing.

[33]  Boyang Li,et al.  The national weather sensor grid , 2007, SenSys '07.

[34]  Jaswinder Pal Singh,et al.  MEDYM: Match-Early with Dynamic Multicast for Content-Based Publish-Subscribe Networks , 2005, Middleware.

[35]  Matt Welsh,et al.  Integrating wireless sensor networks with the grid , 2004, IEEE Internet Computing.

[36]  Srikanta Tirthapura,et al.  Faster Event Forwarding in a Content-Based Publish-Subscribe System through Lookup ReuseEvent , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[37]  Fabrizio Davide,et al.  A Novel Approach to Adaptive Content-based Subscription Management in DHT-based Overlay Networks , 2006, Journal of Grid Computing.

[38]  Eui-nam Huh,et al.  A probabilistic and adaptive scheduling algorithm using system-generated predictions for inter-grid resource sharing , 2007, The Journal of Supercomputing.

[39]  Peter Triantafillou,et al.  Publish-Subscribe Information Delivery with Substring Predicates , 2007, IEEE Internet Computing.

[40]  Peter Triantafillou,et al.  Subscription summarization: a new paradigm for efficient publish/subscribe systems , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[41]  Kenneth Chiu,et al.  Integrating Instruments and Sensors into the Grid with CIMA Web Services , 2005 .