Hierarchical Clustering of Message Flows in a Multicast Data Dissemination System

A large-scale data dissemination application is characterized by a large number of information flows and information consumers. Consumers are interested in different, yet overlapping, subsets of the flows. Multicast is used to deliver subsets of the flows to subsets of the consumers. Since multicast groups are a limited resource, each consumer must filter out a large number of unneeded flows. We alleviate the end-node filtering load by using hierarchical clustering of flows to transport-layer sessions, and clustering of sessions to network-layer multicast groups. This scheme allows for hierarchical filtering of flows at the receivers. We formulate a cost function that models and emphasizes the filtering process, and propose algorithms for the solution of the hierarchical mapping problem. Performance evaluation indicates a significant reduction of endnode filtering cost compared to a non-hierarchic approach.

[1]  Steven McCanne,et al.  A Preference Clustering Protocol for Large-Scale Multicast Applications , 1999, Networked Group Communication.

[2]  Dino Farinacci,et al.  PGM Reliable Transport Protocol Specification , 2001, RFC.

[3]  Guruduth Banavar,et al.  An efficient multicast protocol for content-based publish-subscribe systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[4]  Donald F. Towsley,et al.  Channelization problem in large scale data dissemination , 2001, Proceedings Ninth International Conference on Network Protocols. ICNP 2001.

[5]  Pnina Vortman,et al.  High throughput reliable message dissemination , 2004, SAC '04.

[6]  Philip S. Yu,et al.  Clustering algorithms for content-based publication-subscription systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[7]  J. Crowcroft,et al.  The PGM Reliable Multicast Protocol , 2002 .

[8]  Manuel Menezes de Oliveira Neto,et al.  Router level filtering for receiver interest delivery , 2000, Networked Group Communication.

[9]  Steven McCanne,et al.  An evaluation of preference clustering in large-scale multicast applications , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[10]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .