MTRAC - discovering M2M devices in cellular networks from coarse-grained measurements

Machine-to-Machine (M2M) network traffic is becoming highly relevant in nowadays cellular networks. The ever-increasing number of M2M devices is heavily modifying the traffic patterns observed in cellular networks, and the interest in discovering and tracking these devices is rapidly growing among operators. In this paper we introduce MTRAC, a complete approach for M2M TRAffic Classification, capable of discovering M2M devices from coarse-grained measurements. MTRAC uses different Machine Learning (ML) algorithms to unveil M2M devices in cellular networks. It relies on very simple traffic descriptors to characterize the communication patterns of each device. These descriptors are robust to traffic encryption techniques, and improve the portability of the MTRAC approach to other network scenarios. MTRAC is implemented on top of DBStream, a novel Data Stream Warehouse which allows to classify M2M devices in an on-line basis, using different temporal and logical traffic aggregations. We study the performance of MTRAC in the on-line classification of more than two months of traffic observed in a operational, nationwide cellular network, comparing different ML algorithms and different traffic aggregation techniques. To the best of our knowledge, MTRAC is the first ML-based approach for automatic M2M device classification in operational cellular networks.

[1]  Marco Mellia,et al.  Large-scale network traffic monitoring with DBStream, a system for rolling big data analysis , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[2]  Fabio Ricciato,et al.  Traffic analysis at short time-scales: an empirical case study from a 3G cellular network , 2008, IEEE Transactions on Network and Service Management.

[3]  Lukasz Golab,et al.  DBStream: An online aggregation, filtering and processing system for network traffic monitoring , 2014, 2014 International Wireless Communications and Mobile Computing Conference (IWCMC).

[4]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[5]  Pierdomenico Fiadino,et al.  HTTPtag: a flexible on-line HTTP classification system for operational 3g networks , 2013, 2013 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[6]  Antonio Pescapè,et al.  Issues and future directions in traffic classification , 2012, IEEE Network.

[7]  Alessandro D'Alconzo,et al.  Device-Specific Traffic Characterization for Root Cause Analysis in Cellular Networks , 2015, TMA.

[8]  Grenville J. Armitage,et al.  A survey of techniques for internet traffic classification using machine learning , 2008, IEEE Communications Surveys & Tutorials.

[9]  Dario Rossi,et al.  Reviewing Traffic Classification , 2013, Data Traffic Monitoring and Analysis.

[10]  Markus Rupp,et al.  Users in cells: A data traffic analysis , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[11]  Lusheng Ji,et al.  A first look at cellular machine-to-machine traffic: large scale measurement and characterization , 2012, SIGMETRICS '12.

[12]  Marco Mellia,et al.  DNS to the rescue: discerning content and services in a tangled web , 2012, IMC '12.