A Mobility Analytical Framework for Big Mobile Data in Densely Populated Area

Due to the pervasiveness of mobile devices, a vast amount of geolocated data is generated, which allows us to gain deep insight into human behavior. Among other data sources, the analysis of data traffic from mobile Internet enables the study of mobile subscribers' movements over long time periods at large scales, which is paramount to research over a wide range of disciplines, e.g., sociology, transportation, epidemiology, networking, etc. However, to efficiently analyze the massive data traffic from the view of user mobility, several technical challenges have to be tackled before releasing the full potential of such data sources, including data collection, trajectory construction, data noise removing, data storage, and methods for analyzing user mobility. This paper introduces a mobility analytical framework for big mobile data, based on real data traffic collected from second-, third- and fourth-generation networks, which covered nearly 7 million people. To construct a user's history trajectories, we apply different rules to extract users' locations from different data sources and reduce oscillations between the cell towers. The comparison of mobility characteristics between our mobile data and other existing data sources shows the large potential of mobile Internet data traffic to study human mobility. In addition, our experiments discover the changing of city hotspots, the movement patterns during peak hours, and people with similar history trajectories, which uncover the common rules that exist among huge populations in a city.

[1]  Sara Gatmir-Motahari,et al.  Time-Clustering-Based Place Prediction for Wireless Subscribers , 2013, IEEE/ACM Transactions on Networking.

[2]  Yi-Bing Lin,et al.  EasyConnect: A Management System for IoT Devices and Its Applications for Interactive Design and Art , 2015, IEEE Internet of Things Journal.

[3]  Yi-Bing Lin,et al.  Prefetching for mobile web album , 2016, Wirel. Commun. Mob. Comput..

[4]  Dino Pedreschi,et al.  A Complexity Science Perspective on Human Mobility , 2013, Mobility Data.

[5]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[6]  David Konopnicki,et al.  SoLoMo analytics for telco Big Data monetization , 2014, IBM J. Res. Dev..

[7]  Wei Xiang,et al.  Big data-driven optimization for mobile networks toward 5G , 2016, IEEE Network.

[8]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[9]  Zhifeng Zhao,et al.  Human Mobility Patterns in Cellular Networks , 2013, IEEE Communications Letters.

[10]  T. Geisel,et al.  The scaling laws of human travel , 2006, Nature.

[11]  Nei Kato,et al.  Global and individual mobility pattern discovery based on hotspots , 2015, 2015 IEEE International Conference on Communications (ICC).

[12]  Gerhard Fettweis,et al.  A Framework Enabling Spatial Analysis of Mobile Traffic Hot Spots , 2014, IEEE Wireless Communications Letters.

[13]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[14]  Enhong Chen,et al.  Cell Oscillation Resolution in Mobility Profile Building , 2012, ArXiv.

[15]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[16]  Christian Bettstetter,et al.  Smooth is better than sharp: a random mobility model for simulation of wireless networks , 2001, MSWIM '01.

[17]  Zbigniew Smoreda,et al.  Unravelling daily human mobility motifs , 2013, Journal of The Royal Society Interface.

[18]  Balázs Csanád Csáji,et al.  Exploring the Mobility of Mobile Phone Users , 2012, ArXiv.

[19]  Yu Zheng,et al.  Location-Based Social Networks: Users , 2011, Computing with Spatial Trajectories.

[20]  Ning Lu,et al.  Soft-defined heterogeneous vehicular network: architecture and challenges , 2015, IEEE Network.

[21]  Zbigniew Smoreda,et al.  Spatiotemporal Data from Mobile Phones for Personal Mobility Assessment , 2013 .

[22]  M. Barthelemy,et al.  From mobile phone data to the spatial structure of cities , 2014, Scientific Reports.

[23]  Chaoming Song,et al.  Modelling the scaling properties of human mobility , 2010, 1010.0436.

[24]  Murat Ali Bayir,et al.  Mobility profiler: A framework for discovering mobility profiles of cell phone users , 2010, Pervasive Mob. Comput..

[25]  Mahadev Konar,et al.  ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.

[26]  Wei Wu,et al.  Oscillation Resolution for Mobile Phone Cellular Tower Data to Enable Mobility Modelling , 2014, 2014 IEEE 15th International Conference on Mobile Data Management.

[27]  Margaret Martonosi,et al.  ON CELLULAR , 2022 .

[28]  LuJiaheng,et al.  Big data challenge , 2013 .

[29]  Vanessa Ann Davies,et al.  EVALUATING MOBILITY MODELS WITHIN AN AD HOC NETWORK , 2000 .

[30]  Nei Kato,et al.  Relay-by-smartphone: realizing multihop device-to-device communications , 2014, IEEE Communications Magazine.

[31]  Do Young Eun,et al.  Crossing Over the Bounded Domain: From Exponential to Power-Law Intermeeting Time in Mobile Ad Hoc Networks , 2009, IEEE/ACM Transactions on Networking.

[32]  Klara Nahrstedt,et al.  Jyotish: A novel framework for constructing predictive model of people movement from joint Wifi/Bluetooth trace , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[33]  Cecilia Mascolo,et al.  A Tale of Many Cities: Universal Patterns in Human Urban Mobility , 2011, PloS one.

[34]  David J. Aldous,et al.  Lower bounds for covering times for reversible Markov chains and random walks on graphs , 1989 .

[35]  Cecilia Mascolo,et al.  NextPlace: A Spatio-temporal Prediction Framework for Pervasive Systems , 2011, Pervasive.

[36]  Injong Rhee,et al.  On the levy-walk nature of human mobility , 2011, TNET.

[37]  Yu Zheng,et al.  Computing with Spatial Trajectories , 2011, Computing with Spatial Trajectories.

[38]  Tristan Henderson,et al.  CRAWDAD: a community resource for archiving wireless data at Dartmouth , 2005, CCRV.

[39]  Fang Liu,et al.  Characterizing User Behavior in Mobile Internet , 2015, IEEE Transactions on Emerging Topics in Computing.

[40]  Nei Kato,et al.  Device-to-device communications for enhancing quality of experience in software defined multi-tier LTE-A networks , 2015, IEEE Network.

[41]  Alessandro Vespignani,et al.  Multiscale mobility networks and the spatial spreading of infectious diseases , 2009, Proceedings of the National Academy of Sciences.

[42]  Ying Zhang User mobility from the view of cellular data networks , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[43]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[44]  Tao Zhou,et al.  Diversity of individual mobility patterns and emergence of aggregated scaling laws , 2012, Scientific Reports.

[45]  Wen-Jing Hsu,et al.  Mining GPS data for mobility patterns: A survey , 2014, Pervasive Mob. Comput..

[46]  Marta C. González,et al.  A universal model for mobility and migration patterns , 2011, Nature.

[47]  Zygmunt J. Haas,et al.  Predictive distance-based mobility management for PCS networks , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[48]  Lars George,et al.  HBase: The Definitive Guide , 2011 .

[49]  Feng Liu,et al.  Monitoring and analyzing big traffic data of a large-scale cellular network with Hadoop , 2014, IEEE Network.

[50]  Kevin C. Almeroth,et al.  Towards realistic mobility models for mobile ad hoc networks , 2003, MobiCom '03.

[51]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[52]  Xin Lu,et al.  Approaching the Limit of Predictability in Human Mobility , 2013, Scientific Reports.

[53]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[54]  Cai Qi,et al.  CORRIGENDUM: Analysis of interactions between SNARE proteins using imaging ellipsometer coupled with microfluidic array , 2014, Scientific Reports.

[55]  Mario Gerla,et al.  Wireless network multicasting , 1998 .

[56]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[57]  Hari Balakrishnan,et al.  Accurate, Low-Energy Trajectory Mapping for Mobile Devices , 2011, NSDI.

[58]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[59]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[60]  Suvadip Batabyal,et al.  Mobility Models, Traces and Impact of Mobility on Opportunistic Routing Algorithms: A Survey , 2015, IEEE Communications Surveys & Tutorials.

[61]  Matthias Sax,et al.  Apache Kafka , 2019, Encyclopedia of Big Data Technologies.

[62]  Marcos Dias de Assunção,et al.  Apache Spark , 2019, Encyclopedia of Big Data Technologies.

[63]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.