Understanding and Predicting Data Hotspots in Cellular Networks

The unprecedented growth in mobile data usage is posing significant challenges to cellular operators. One key challenge is how to provide quality of service to subscribers when their residing cell is experiencing a significant amount of traffic, i.e. becoming a traffic hotspot. In this paper, we perform an empirical study on data hotspots in today’s cellular networks using a 9-week cellular dataset with 734K+ users and 5327 cell sites. Our analysis examines in details static and dynamic characteristics, predictability, and causes of data hotspots, and their correlation with call hotspots. We show that using standard machine learning methods, future hotspots can be accurately predicted from past observations. We believe the understanding of these key issues will lead to more efficient and responsive resource management and thus better QoS provision in cellular networks. To the best of our knowledge, our work is the first to empirically characterize traffic hotspots in today’s cellular networks.

[1]  A. Liu,et al.  Characterizing and modeling internet traffic dynamics of cellular devices , 2011, PERV.

[2]  R. Sinnott Virtues of the Haversine , 1984 .

[3]  Ameet Talwalkar,et al.  Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Samir Ranjan Das,et al.  Understanding traffic dynamics in cellular data networks , 2011, 2011 Proceedings IEEE INFOCOM.

[6]  Xiaoli Chu,et al.  User data traffic analysis for 3G cellular networks , 2013, 2013 8th International Conference on Communications and Networking in China (CHINACOM).

[7]  Zhifeng Zhao,et al.  The predictability of cellular networks traffic , 2012, 2012 International Symposium on Communications and Information Technologies (ISCIT).

[8]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[9]  Markus Rupp,et al.  Users in cells: A data traffic analysis , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Margaret Martonosi,et al.  Identifying Important Places in People's Lives from Cellular Network Data , 2011, Pervasive.

[12]  D. Defays,et al.  An Efficient Algorithm for a Complete Link Method , 1977, Comput. J..

[13]  Samir Ranjan Das,et al.  Understanding spatial relationships in resource usage in cellular data networks , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[14]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[15]  Lusheng Ji,et al.  Characterizing geospatial dynamics of application usage in a 3G cellular data network , 2012, 2012 Proceedings IEEE INFOCOM.