Modelling performance optimizations for content-based publish/subscribe

Content-based Publish/Subscribe (CPS) systems can efficiently deliver messages to large numbers of subscribers with diverse interests and consequently, have often been considered an appropriate technology for large-scale, event-based applications. In fact, a significant amount of existing research addresses the issue of providing scalable CPS services [3, 8, 7, 11]. In these approaches, scalability and high performance matching is often achieved by taking advantage of similarities between subscriptions. However, even though such optimization techniques are widely used, no model has been developed yet to capture them. Such an abstraction would allow CPS matching algorithms to be studied, analyzed, and optimized at a more fundamental and formal level. In this work-in-progress paper, we present the initial results of our work towards modelling and analyzing matching optimizations frequently used by CPS systems. Using our proposed model, we find that probabilistically optimal CPS matching is possible in certain types of subscription sets and that there is also a non-obvious upper bound on the expected cost of some subscription sets. We also provide experimental results that support the model proposed and studied in this paper.

[1]  Dennis Shasha,et al.  Filtering algorithms and implementation for very fast publish/subscribe systems , 2001, SIGMOD '01.

[2]  David S. Rosenblum,et al.  Design and evaluation of a wide-area event notification service , 2001, TOCS.

[3]  Peter R. Pietzuch,et al.  Hermes: a distributed event-based middleware architecture , 2002, Proceedings 22nd International Conference on Distributed Computing Systems Workshops.

[4]  Gero Mühl,et al.  Large-scale content based publish, subscribe systems , 2002 .

[5]  Marcos K. Aguilera,et al.  Matching events in a content-based subscription system , 1999, PODC '99.

[6]  Russell Greiner,et al.  Finding optimal satisficing strategies for and-or trees , 2006, Artif. Intell..

[7]  Hans-Arno Jacobsen,et al.  A Unified Approach to Routing, Covering and Merging in Publish/Subscribe Systems Based on Modified Binary Decision Diagrams , 2005, 25th IEEE International Conference on Distributed Computing Systems (ICDCS'05).

[8]  Alexander L. Wolf,et al.  Forwarding in a content-based network , 2003, SIGCOMM '03.

[9]  Herbert A. Simon,et al.  Optimal Problem-Solving Search: All-Oor-None Solutions , 1975, Artif. Intell..

[10]  Yannis E. Ioannidis,et al.  The History of Histograms (abridged) , 2003, VLDB.

[11]  Guruduth Banavar,et al.  An efficient multicast protocol for content-based publish-subscribe systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[12]  Peter R. Pietzuch,et al.  Peer-to-peer overlay broker networks in an event-based middleware , 2003, DEBS '03.

[13]  Helmut Veith,et al.  Efficient filtering in publish-subscribe systems using binary decision diagrams , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.