Scalable Distribution of XML Content with XNet

The XNET XML content network was designed to implement efficient and reliable distribution of structured XML content to very large populations of consumers. For that purpose, our system integrates several technologies: the routing protocol XROUTE makes extensive use of subscription aggregation to limit the size of routing tables while ensuring perfect routing (that is, an event is forwarded to a link only if it leads to an interested consumer). The filtering engine XTRIE uses a sophisticated algorithm to match incoming XML documents against large populations of tree-structured subscriptions, whereas the XSEARCH subscription management algorithm enables the system to efficiently manage large and highly dynamic consumer populations. Finally, our XNET system integrates reliability mechanisms to guarantee that its state is consistent with the consumer population and implements several approaches to fault tolerance to recover from various types of router and link failures. We have analyzed the efficiency of our techniques with various simulations, and to assess the performance of our system in realistic settings and show that it is perfectly suitable for large-scale distributed environments, we have performed a large-scale experimental deployment on the PlanetLab testbed.

[1]  Olga Papaemmanouil,et al.  SemCast: semantic multicast for content-based data dissemination , 2005, 21st International Conference on Data Engineering (ICDE'05).

[2]  Raphael Chand Large scale diffusion of information in publish/subscribe systems , 2005 .

[3]  Pascal Felber,et al.  XNET: a reliable content-based publish/subscribe system , 2004, Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems, 2004..

[4]  Yanlei Diao,et al.  Towards an Internet-Scale XML Dissemination Service , 2004, VLDB.

[5]  Alexander L. Wolf,et al.  A routing scheme for content-based networking , 2004, IEEE INFOCOM 2004.

[6]  Raphael Chand Efficient subscription management in content based networks , 2004, ICSE 2004.

[7]  Masami Hagiya,et al.  XML Schema Containment Checking Based on Semi-implicit Techniques , 2003, CIAA.

[8]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[9]  Pascal Felber,et al.  A scalable protocol for content-based routing in overlay networks , 2003, Second IEEE International Symposium on Network Computing and Applications, 2003. NCA 2003..

[10]  Thomas Schwentick,et al.  XPath Containment in the Presence of Disjunction, DTDs, and Variables , 2003, ICDT.

[11]  Peter T. Wood Containment for XPath Fragments under DTD Constraints , 2003, ICDT.

[12]  David S. Rosenblum,et al.  Design and evaluation of a wide-area event notification service , 2001, Foundations of Intrusion Tolerant Systems, 2003 [Organically Assured and Survivable Information Systems].

[13]  Rajeev Rastogi,et al.  Efficient filtering of XML documents with XPath expressions , 2002, Proceedings 18th International Conference on Data Engineering.

[14]  Miguel Castro,et al.  Scribe: a large-scale and decentralized application-level multicast infrastructure , 2002, IEEE J. Sel. Areas Commun..

[15]  Gero Mühl,et al.  Large-scale content based publish, subscribe systems , 2002 .

[16]  Rajeev Rastogi,et al.  Tree Pattern Aggregation for Scalable XML Data Dissemination , 2002, VLDB.

[17]  Yanlei Diao,et al.  YFilter: efficient and scalable filtering of XML documents , 2002, Proceedings 18th International Conference on Data Engineering.

[18]  Peter R. Pietzuch,et al.  Hermes: a distributed event-based middleware architecture , 2002, Proceedings 22nd International Conference on Distributed Computing Systems Workshops.

[19]  Dan Suciu,et al.  Containment and equivalence for an XPath fragment , 2002, PODS.

[20]  Alfonso Fuggetta,et al.  The JEDI Event-Based Infrastructure and Its Application to the Development of the OPSS WFMS , 2001, IEEE Trans. Software Eng..

[21]  Laks V. S. Lakshmanan,et al.  Minimization of tree pattern queries , 2001, SIGMOD '01.

[22]  Ben Y. Zhao,et al.  Bayeux: an architecture for scalable and fault-tolerant wide-area data dissemination , 2001, NOSSDAV '01.

[23]  Bill Segall,et al.  Content Based Routing with Elvin4 , 2000 .

[24]  Guruduth Banavar,et al.  An efficient multicast protocol for content-based publish-subscribe systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[25]  Ellen W. Zegura,et al.  How to model an internetwork , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[26]  Dale Skeen,et al.  The Information Bus: an architecture for extensible distributed systems , 1994, SOSP '93.