Detecting activity locations from raw GPS data: a novel kernel-based algorithm

BackgroundHealth studies and mHealth applications are increasingly resorting to tracking technologies such as Global Positioning Systems (GPS) to study the relation between mobility, exposures, and health. GPS tracking generates large sets of geographic data that need to be transformed to be useful for health research. This paper proposes a method to test the performance of activity place detection algorithms, and compares the performance of a novel kernel-based algorithm with a more traditional time-distance cluster detection method.MethodsA set of 750 artificial GPS tracks containing three stops each were generated, with various levels of noise.. A total of 9,000 tracks were processed to measure the algorithms’ capacity to detect stop locations and estimate stop durations, with varying GPS noise and algorithm parameters.ResultsThe proposed kernel-based algorithm outperformed the traditional algorithm on most criteria associated to activity place detection, and offered a stronger resilience to GPS noise, managing to detect up to 92.3% of actual stops, and estimating stop duration within 5% error margins at all tested noise levels.ConclusionsCapacity to detect activity locations is an important feature in a context of increasing use of GPS devices in health and place research. While further testing with real-life tracks is recommended, testing algorithms’ performance with artificial track sets for which characteristics are controlled is useful. The proposed novel algorithm outperformed the traditional algorithm under these conditions.

[1]  S. Cummins,et al.  Place effects on health: how can we conceptualise, operationalise and measure them? , 2002, Social science & medicine.

[2]  Torsten Hägerstraand WHAT ABOUT PEOPLE IN REGIONAL SCIENCE , 1970 .

[3]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[4]  Xing Xie,et al.  Mining interesting locations and travel sequences from GPS trajectories , 2009, WWW '09.

[5]  Petteri Nurmi,et al.  Identifying Meaningful Places , 2009 .

[6]  Y. Kestens,et al.  Using experienced activity spaces to measure foodscape exposure. , 2010, Health and Place.

[7]  Kentaro Toyama,et al.  Project Lachesis: Parsing and Modeling Location Histories , 2004, GIScience.

[8]  Thad Starner,et al.  Learning Significant Locations and Predicting User Movement with GPS , 2002, Proceedings. Sixth International Symposium on Wearable Computers,.

[9]  Daniel Fuller,et al.  The impact of implementing a public bicycle share program on the likelihood of collisions and near misses in Montreal, Canada. , 2013, Preventive medicine.

[10]  Filip Biljecki,et al.  Automatic segmentation and classification of movement trajectories for transportation modes , 2010 .

[11]  Christian S. Jensen,et al.  Mining significant semantic locations from GPS data , 2010, Proc. VLDB Endow..

[12]  Xun Shi,et al.  INTERNATIONAL JOURNAL OF HEALTH GEOGRAPHICS METHODOLOGY Density estimation and adaptive bandwidths: , 2022 .

[13]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[14]  Y. Kestens,et al.  Conceptualization and measurement of environmental exposure in epidemiology: accounting for activity space related to daily mobility. , 2013, Health & place.

[15]  P. Diggle,et al.  Spatial point pattern analysis and its application in geographical epidemiology , 1996 .

[16]  Basile Chaix,et al.  Geographic life environments and coronary heart disease: a literature review, theoretical contributions, methodological updates, and a research agenda. , 2009, Annual review of public health.

[17]  KangJong Hee,et al.  Extracting places from traces of locations , 2005 .

[18]  Shashi Shekhar,et al.  Mining Personally Important Places from GPS Tracks , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[19]  B. Worton Kernel methods for estimating the utilization distribution in home-range studies , 1989 .

[20]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[21]  Torsten Hägerstrand REFLECTIONS ON “WHAT ABOUT PEOPLE IN REGIONAL SCIENCE?” , 1989 .

[22]  Basile Chaix,et al.  GPS tracking in neighborhood and health studies: a step forward for environmental exposure assessment, a step backward for causal inference? , 2013, Health & place.

[23]  Nobuyuki Enomoto,et al.  Algorithm for Detecting Significant Locations from Raw GPS Data , 2010, Discovery Science.

[24]  Carlos R. García-Alonso,et al.  Spatial analysis to identify hotspots of prevalence of schizophrenia , 2008, Social Psychiatry and Psychiatric Epidemiology.

[25]  Y. Kestens,et al.  Cohort profile: residential and non-residential environments, individual activity spaces and cardiovascular risk factors and diseases--the RECORD Cohort Study. , 2012, International journal of epidemiology.

[26]  Ian Matthews,et al.  Spatial Contouring of Risk: A Tool for Environmental Epidemiology , 2004, Epidemiology.

[27]  Eduardo Mario Nebot,et al.  Mining GPS data for extracting significant places , 2009, 2009 IEEE International Conference on Robotics and Automation.

[28]  Elizabeth Shay,et al.  Identifying walking trips from GPS and accelerometer data in adolescent females. , 2012, Journal of physical activity & health.

[29]  Sunghyun Choi,et al.  Proceedings of the 3rd ACM international workshop on Wireless mobile applications and services on WLAN hotspots , 2005 .

[30]  Itai Kloog,et al.  Using kernel density function as an urban analysis tool: Investigating the association between nightlight exposure and the incidence of breast cancer in Haifa, Israel , 2009, Comput. Environ. Urban Syst..

[31]  R. Shephard Use of a New Public Bicycle Share Program in Montreal, Canada , 2012 .

[32]  Edmund Seto,et al.  A study of community design, greenness, and physical activity in children using satellite, GPS and accelerometer data. , 2012, Health & place.

[33]  Steven Cummins,et al.  Understanding and representing 'place' in health research: a relational approach. , 2007, Social science & medicine.

[34]  Henry A. Kautz,et al.  Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields , 2007, Int. J. Robotics Res..

[35]  N. Levine Crime Mapping and the CrimeStat Program , 2006 .

[36]  Thad Starner,et al.  Using GPS to learn significant locations and predict movement across multiple users , 2003, Personal and Ubiquitous Computing.

[37]  Basile Chaix,et al.  An interactive mapping tool to assess individual mobility patterns in neighborhood studies. , 2012, American journal of preventive medicine.

[38]  Gaetano Borriello,et al.  Extracting places from traces of locations , 2004, MOCO.

[39]  Steven Cummins,et al.  Commentary: investigating neighbourhood effects on health--avoiding the 'local trap'. , 2007, International journal of epidemiology.

[40]  M. Brauer,et al.  The impact of daily mobility on exposure to traffic-related air pollution and health effect estimates , 2011, Journal of Exposure Science and Environmental Epidemiology.

[41]  Shih-Lung Shaw,et al.  Exploring potential human activities in physical and virtual spaces: a spatio‐temporal GIS approach , 2008, Int. J. Geogr. Inf. Sci..

[42]  Daniel Fuller,et al.  Use of a new public bicycle share program in Montreal, Canada. , 2011, American journal of preventive medicine.