Zipf's power law in activity schedules and the effect of aggregation

Abstract People’s behavior depends on extremely complex, multidimensional processes. This poses challenges when trying to model their behavior. In the transportation modeling community, great effort is spent to model the activity schedules of people. Remarkably however, the frequency of occurrence of day-long activity schedules obeys a ubiquitous power law distribution, commonly referred to as Zipf’s law. Previous research established the universal nature of this distribution and proposed potential application areas. However, these application areas require additional information about the distribution’s properties. To stress-test this universal power law, this paper discusses the role of aggregation within the phenomenon of Zipf’s law in activity schedules. Aggregation is analyzed in three dimensions: activity type encoding, aggregation over time and the aggregation of individual data. Five data sets are used: the household travel survey from the USA (2009) and from GBR (2009–2014), two six-week travel surveys (DEU MobiDrive 1999 and CHE Thurgau 2003) and a donated 450-day data set from one individual. To analyze the effect of aggregation in the first dimension, five different activity encoding aggregation levels were created, each aggregating the activity types somewhat differently. In the second dimension, the distribution of schedules is compared over multiple years and over the days of the week. Finally, in the third dimension, the analysis moves from study area-wide aggregated data to subsets of the data, and finally to individual (longitudinal) data.

[1]  Soora Rasouli,et al.  Two-regime pattern in human mobility:evidence from GPS taxi trajectory data , 2016 .

[2]  Colin S Gillespie,et al.  Fitting Heavy Tailed Distributions: The poweRlaw Package , 2014, 1407.3492.

[3]  S. Piantadosi,et al.  The ecological fallacy. , 1988, American journal of epidemiology.

[4]  Soora Rasouli,et al.  Activity-based models of travel demand: promises, progress and prospects , 2014 .

[5]  B. Corominas-Murtra,et al.  Universality of Zipf's law. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  Kay W. Axhausen,et al.  Mobidrive: A six week travel diary , 2004 .

[7]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[8]  Yannis M. Ioannides,et al.  Zipf’s law for cities : an empirical examination , 2000 .

[9]  Chaoming Song,et al.  Modelling the scaling properties of human mobility , 2010, 1010.0436.

[10]  R. Hanel,et al.  Fitting power-laws in empirical data with estimators that work for all exponents , 2016, PloS one.

[11]  M. E. J. Newman,et al.  Power laws, Pareto distributions and Zipf's law , 2005 .

[12]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[13]  T. Geisel,et al.  The scaling laws of human travel , 2006, Nature.

[14]  Siddharth Gupta,et al.  The TimeGeo modeling framework for urban mobility without travel surveys , 2016, Proceedings of the National Academy of Sciences.

[15]  M. Marsili,et al.  Interacting Individuals Leading to Zipf's Law , 1998, cond-mat/9801289.

[16]  Wen-chen Chen On the Weak form of Zipf's law , 1980, Journal of Applied Probability.

[17]  Bin Jiang,et al.  Zipf's law for all the natural cities in the United States: a geospatial perspective , 2010, Int. J. Geogr. Inf. Sci..

[18]  Dino Pedreschi,et al.  The purpose of motion: Learning activities from Individual Mobility Networks , 2014, 2014 International Conference on Data Science and Advanced Analytics (DSAA).

[19]  Wentian Li,et al.  Random texts exhibit Zipf's-law-like word frequency distribution , 1992, IEEE Trans. Inf. Theory.

[20]  George Kingsley Zipf,et al.  Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology , 2012 .

[21]  Carlos M. Urzúa Testing for Zipf’s Law: A Common Pitfall , 2011 .

[22]  Davy Janssens,et al.  Developing an optimised activity type annotation method based on classification accuracy and entropy indices , 2017 .

[23]  Roberto Trasarti,et al.  TOSCA: two-steps clustering algorithm for personal locations detection , 2015, SIGSPATIAL/GIS.

[24]  X. Gabaix Zipf's Law for Cities: An Explanation , 1999 .

[25]  K. Soo Zipf's law for cities: a cross-country investigation , 2005 .

[26]  W. Reed The Pareto, Zipf and other power laws , 2001 .

[27]  Renato Redondi,et al.  A Comparative Study of Airport Connectivity in China, Europe and US: Which Network Provides the Best Service to Passengers? , 2010 .

[28]  Luca Pappalardo,et al.  Data-driven generation of spatio-temporal routines in human mobility , 2016, Data Mining and Knowledge Discovery.

[29]  Gaurav Vyas,et al.  Generation of Mandatory Activities and Formation of Mandatory Tours: Application to the Activity-Based Model for Phoenix, AZ , 2015 .

[30]  Yu Liu,et al.  The promises of big data and small data for travel behavior (aka human mobility) analysis , 2016, Transportation research. Part C, Emerging technologies.

[31]  Shengyong Chen,et al.  Study on some bus transport networks in China with considering spatial characteristics , 2014 .

[32]  Michael Löchl Stability of Travel behaviour: Thurgau 2003 , 2005 .

[33]  Sebastian Bernhardsson,et al.  Zipf's law unzipped , 2011, ArXiv.

[34]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[35]  G. Wets,et al.  Exploratory analysis of Zipf’s universal power law in activity schedules , 2018, Transportation.