Application of Data Mining Algorithms to TCP throughput Prediction in HTTP Transactions

This paper presents a study of the application of data mining algorithms to the prediction of TCP throughput in HTTP transactions. We are using data mining models built on the basis of historic measurements of network performance gathered using WING system. These measurements reflect Web performance as experienced by the end-users located in Wroclaw, Poland. Data mining models are created using the algorithms available in Microsoft SQL Server 2005and IBM Intelligent Minertools. Our results show that our data mining based TCP throughput prediction returns accurate results. The application of our method in building of so-called "best performance hit" operation mode of the search engines is proposed.

[1]  Edwin P. D. Pednault,et al.  Transform Regression and the Kolmogorov Superposition Theorem , 2006, SDM.

[2]  Leszek Borzemski,et al.  Wing- system do pomiaru wydajności usługi WWW po stronie klienta , 2003 .

[3]  Leszek Borzemski,et al.  Application of data mining for the analysis of Internet path performance , 2004, 12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings..

[4]  Peter A. Dinda,et al.  Characterizing and Predicting TCP Throughput on the Wide Area Network , 2005, 25th IEEE International Conference on Distributed Computing Systems (ICDCS'05).

[5]  Jaspal Subhlok,et al.  Fast pattern-based throughput prediction for TCP bulk transfers , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[6]  San-qi Li,et al.  A predictability analysis of network traffic , 2002, Comput. Networks.

[7]  Peter A. Dinda,et al.  An empirical study of the multiscale predictability of network traffic , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[8]  Leszek Borzemski,et al.  MWING: A Multiagent System for Web Site Measurements , 2007, KES-AMSTA.

[9]  Leszek Borzemski THE USE OF DATA MINING TO PREDICT WEB PERFORMANCE , 2006, Cybern. Syst..

[10]  Paul Barford,et al.  A Machine Learning Approach to TCP Throughput Prediction , 2007, IEEE/ACM Transactions on Networking.

[11]  Leszek Borzemski,et al.  An Empirical Study of Web Quality: Measuring the Web from Wroclaw University of Technology Campus , 2004, ICWE Workshops.