Jaccard Coefficients based Clustering of XML Web Messages for Network Traffic Aggregation

This paper provides static efficient clustering model based simple Jaccard coefficients that supports XML messages aggregator in order to potentially reduce network traffic. The proposed model works by grouping only highly similar messages with the aim to provide messages with high redundancy for web aggregators. Web messages aggregation has become a significant solution to overcome network bottlenecks and congestions by efficiently reducing network volume by aggregating messages together removing their redundant information. The proposed model performance is compared to both K-Means and Principle Component Analysis (PCA) combined with K-Means. Jaccard based clustering model has shown potential performance as it only consumes around %32 and %25 processing time in comparison with K-Means and PCA combined with K-Means respectively. Quality measure (Aggregator Compression Ratio) has overcome both benchmark models.

[1]  Dhiah Al-Shammary,et al.  Clustering SOAP Web Services on Internet Computing Using Fast Fractals , 2011, 2011 IEEE 10th International Symposium on Network Computing and Applications.

[2]  Dhiah Al-Shammary,et al.  Redundancy-aware SOAP messages compression and aggregation for enhanced performance , 2012, J. Netw. Comput. Appl..

[3]  Jeong Hee Hwang,et al.  Clustering XML Documents Based on the Weight of Frequent Structures , 2007, 2007 International Conference on Convergence Information Technology (ICCIT 2007).

[4]  Zahir Tari,et al.  Similarity-Based SOAP Multicast Protocol to Reduce Bandwith and Latency in Web Services , 2008, IEEE Transactions on Services Computing.

[5]  Christos Makris,et al.  Techniques to support Web Service selection and consumption with QoS characteristics , 2008, J. Netw. Comput. Appl..

[6]  C. Werner,et al.  Compressing SOAP messages by using differential encoding , 2004 .

[7]  John C. Hart Fractal image compression and recurrent iterated function systems , 1996, IEEE Computer Graphics and Applications.

[8]  Manish Parashar,et al.  Latency Performance of SOAP Implementations , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[9]  Yang Zhang,et al.  Development of Web-Telecom based hybrid services orchestration and execution middleware over convergence networks , 2010, J. Netw. Comput. Appl..

[10]  Michael J. Lewis,et al.  Differential Deserialization for Optimized SOAP Performance , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[11]  Marios D. Dikaiakos,et al.  Cloud Computing: Distributed Internet Computing for IT and Scientific Research , 2009, IEEE Internet Computing.

[12]  Le Jiajin,et al.  Clustering XML Documents by Combining Content and Structure , 2008, 2008 International Symposium on Information Science and Engineering.

[13]  A. Kumar Bisoi,et al.  Enhancing the beauty of fractals , 1999, Proceedings Third International Conference on Computational Intelligence and Multimedia Applications. ICCIMA'99 (Cat. No.PR00300).