Towards fine-grained traffic classification for web applications

Web applications, such as video streaming, map services and office applications, have become very popular due to the advances of web technology. Traditional traffic classification methods based on port numbers and payload signatures barely work because the applications run on the same port numbers (usually port 80 and 443) and the payloads are usually encrypted. Furthermore, a web application may provide multiple functions, and the traffic from them has diverse characteristics. In this work, we use statistical features from application messages to characterize the traffic from individual functions of web applications, and perform fine-grained classification to identify the application functions. The experimental results show the classification can achieve high accuracy up to 98.30% for the interaction functions and 92.72% for the download functions.

[1]  Grenville J. Armitage,et al.  Training on multiple sub-flows to optimise the use of Machine Learning classifiers in real-world IP networks , 2006, Proceedings. 2006 31st IEEE Conference on Local Computer Networks.

[2]  James Won-Ki Hong,et al.  Toward fine-grained traffic classification , 2011, IEEE Communications Magazine.

[3]  Dan Ionescu,et al.  Measurement-based traffic characterization for Web 2.0 applications , 2009, 2009 IEEE Instrumentation and Measurement Technology Conference.

[4]  Ke Xu,et al.  Web 2.0 traffic measurement: analysis on online map applications , 2009, NOSSDAV '09.

[5]  Australasian Telecommunication Networks and Applications Conference, ATNAC 2011, Melbourne, Australia, November 9-11, 2011 , 2011, ATNAC.

[6]  Rui Wang,et al.  Side-Channel Leaks in Web Applications: A Reality Today, a Challenge Tomorrow , 2010, 2010 IEEE Symposium on Security and Privacy.

[7]  Jun Zhang,et al.  Classification of Correlated Internet Traffic Flows , 2012, 2012 IEEE 11th International Conference on Trust, Security and Privacy in Computing and Communications.

[8]  Yali Liu,et al.  Disambiguating HTTP: Classifying web Applications , 2011, 2011 7th International Wireless Communications and Mobile Computing Conference.

[9]  Brice Augustin,et al.  On Traffic Patterns of HTTP Applications , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[10]  Elena Baralis,et al.  Hierarchical learning for fine grained internet traffic classification , 2012, 2012 8th International Wireless Communications and Mobile Computing Conference (IWCMC).

[11]  Baohua Yang,et al.  SMILER: Towards Practical Online Traffic Classification , 2011, 2011 ACM/IEEE Seventh Symposium on Architectures for Networking and Communications Systems.

[12]  Michalis Faloutsos,et al.  Internet traffic classification demystified: myths, caveats, and the best practices , 2008, CoNEXT '08.

[13]  Antonio Pescapè,et al.  Traffic Classification through Joint Distributions of Packet-Level Statistics , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.