Refining Time-Activity Classification of Human Subjects Using the Global Positioning System

Background Detailed spatial location information is important in accurately estimating personal exposure to air pollution. Global Position System (GPS) has been widely used in tracking personal paths and activities. Previous researchers have developed time-activity classification models based on GPS data, most of them were developed for specific regions. An adaptive model for time-location classification can be widely applied to air pollution studies that use GPS to track individual level time-activity patterns. Methods Time-activity data were collected for seven days using GPS loggers and accelerometers from thirteen adult participants from Southern California under free living conditions. We developed an automated model based on random forests to classify major time-activity patterns (i.e. indoor, outdoor-static, outdoor-walking, and in-vehicle travel). Sensitivity analysis was conducted to examine the contribution of the accelerometer data and the supplemental spatial data (i.e. roadway and tax parcel data) to the accuracy of time-activity classification. Our model was evaluated using both leave-one-fold-out and leave-one-subject-out methods. Results Maximum speeds in averaging time intervals of 7 and 5 minutes, and distance to primary highways with limited access were found to be the three most important variables in the classification model. Leave-one-fold-out cross-validation showed an overall accuracy of 99.71%. Sensitivities varied from 84.62% (outdoor walking) to 99.90% (indoor). Specificities varied from 96.33% (indoor) to 99.98% (outdoor static). The exclusion of accelerometer and ambient light sensor variables caused a slight loss in sensitivity for outdoor walking, but little loss in overall accuracy. However, leave-one-subject-out cross-validation showed considerable loss in sensitivity for outdoor static and outdoor walking conditions. Conclusions The random forests classification model can achieve high accuracy for the four major time-activity categories. The model also performed well with just GPS, road and tax parcel data. However, caution is warranted when generalizing the model developed from a small number of subjects to other populations.

[1]  Gert R. G. Lanckriet,et al.  Physical activity recognition in free-living from body-worn sensors , 2013, SenseCam '13.

[2]  Bradley D Schultz,et al.  GPS-based microenvironment tracker (MicroTrac) model to estimate time–location of individuals for air pollution exposure assessments: Model evaluation in central North Carolina , 2014, Journal of Exposure Science and Environmental Epidemiology.

[3]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[4]  M. Duncan,et al.  GIS or GPS? A comparison of two methods for assessing route taken during active transport. , 2007, American journal of preventive medicine.

[5]  Zhen Liu,et al.  Performances of Different Global Positioning System Devices for Time-Location Tracking in Air Pollution Epidemiological Studies , 2010, Environmental health insights.

[6]  Miguel A. Labrador,et al.  Automating mode detection for travel behaviour analysis by using global positioning systemsenabled mobile phones and neural networks , 2010 .

[7]  Mark S Goldberg,et al.  Using Global Positioning Systems (GPS) and temperature data to generate time-activity classifications for estimating personal exposure in air monitoring studies: an automated method , 2014, Environmental Health.

[8]  N. Englert Fine particles and human health--a review of epidemiological studies. , 2004, Toxicology letters.

[9]  Harry Timmermans,et al.  Mobile Technologies for Activity-Travel Data Collection and Analysis , 2014 .

[10]  R. Lynch,et al.  Use of global positioning system technology to track subject's location during environmental exposure sampling , 2001, Journal of Exposure Analysis and Environmental Epidemiology.

[11]  A. Field,et al.  Combining global positioning system and accelerometer data to determine the locations of physical activity in children. , 2012, Geospatial health.

[12]  Lianfa Li,et al.  Modeling personal particle-bound polycyclic aromatic hydrocarbon (pb-pah) exposure in human subjects in Southern California , 2012, Environmental Health.

[13]  Vinay Kumar Dadhwal,et al.  Spatial Assessment of Soil Organic Carbon Density Through Random Forests Based Imputation , 2014, Journal of the Indian Society of Remote Sensing.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  I. McDowell,et al.  Conceptualizing the healthscape: contributions of time geography, location technologies and spatial ecology to place and health research. , 2010, Social science & medicine.

[16]  Howard A. Fine,et al.  Predicting in vitro drug sensitivity using Random Forests , 2011, Bioinform..

[17]  Robert D Brook,et al.  Is air pollution a cause of cardiovascular disease? Updated review and controversies , 2007, Reviews on environmental health.

[18]  Gert R. G. Lanckriet,et al.  A random forest classifier for the prediction of energy expenditure and type of physical activity from wrist and hip accelerometers , 2014, Physiological measurement.

[19]  Tim Appelhans,et al.  Improving the accuracy of rainfall rates from optical satellite sensors with machine learning — A random forests-based approach applied to MSG SEVIRI , 2014 .

[20]  Young-Seuk Park,et al.  Hazard ratings of pine forests to a pine wilt disease at two spatial scales (individual trees and stands) using self-organizing map and random forest , 2013, Ecol. Informatics.

[21]  M. Duncan,et al.  Utility of global positioning system to measure active transport in urban areas. , 2007, Medicine and science in sports and exercise.

[22]  Elizabeth Shay,et al.  Identifying walking trips from GPS and accelerometer data in adolescent females. , 2012, Journal of physical activity & health.

[23]  Donald R Mattison,et al.  Environmental Exposures and Adverse Pregnancy Outcomes: A Review of the Science , 2008, Reproductive Sciences.

[24]  Dongwoo Yang,et al.  Particle-bound polycyclic aromatic hydrocarbon concentrations in transportation microenvironments , 2013 .

[25]  Kelly R Evenson,et al.  Assessing the contribution of parks to physical activity using global positioning system and accelerometry. , 2013, Medicine and science in sports and exercise.

[26]  Eui-Hwan Chung,et al.  A Trip Reconstruction Tool for GPS-based Personal Travel Surveys , 2005 .

[27]  Bumjoon Kang,et al.  Walking objectively measured: classifying accelerometer data with GPS and travel diaries. , 2013, Medicine and science in sports and exercise.

[28]  Luc Int Panis,et al.  Improving health through policies that promote active travel: a review of evidence to support integrated health impact assessment. , 2011, Environment international.

[29]  Yifang Zhu,et al.  In-cabin commuter exposure to ultrafine particles on Los Angeles freeways. , 2007, Environmental science & technology.

[30]  Gert R. G. Lanckriet,et al.  Identifying Active Travel Behaviors in Challenging Environments Using GPS, Accelerometers, and Machine Learning Algorithms , 2014, Front. Public Health.

[31]  Peter R. Stopher,et al.  Search for a global positioning system device to measure person travel , 2008 .

[32]  Kees Maat,et al.  Deriving and validating trip purposes and travel modes for multi-day GPS-based travel surveys: A large-scale application in the Netherlands , 2009 .

[33]  P. Larsson,et al.  The use of dGPS and simultaneous metabolic measurements during orienteering. , 2001, Medicine and science in sports and exercise.

[34]  Michael Meschik,et al.  A New Algorithm for Mode Detection in Travel Surveys , 2014 .

[35]  Kay W. Axhausen,et al.  Processing Raw Data from Global Positioning Systems without Additional Information , 2009 .

[36]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[37]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[38]  Paola Zuccolotto,et al.  Variable Selection Using Random Forests , 2006 .

[39]  Brian E. Saelens,et al.  Emerging Technologies for Assessing Physical Activity Behaviors in Space and Time , 2014, Front. Public Health.

[40]  André Stumpf,et al.  Object-oriented mapping of landslides using Random Forests , 2011 .

[41]  Kees de Hoogh,et al.  Effects of travel mode on exposures to particulate air pollution. , 2008, Environment international.

[42]  Jianhe Du,et al.  Increasing the accuracy of trip rate information from passive multi-day GPS travel datasets: Automatic trip end identification issues , 2007 .

[43]  Ying Nian Wu,et al.  Unbalanced data classification using support vector machines with active learning on scleroderma lung disease patterns , 2015 .

[44]  T. T. Norton,et al.  Light levels, refractive development, and myopia--a speculative review. , 2013, Experimental eye research.

[45]  Pat Fenton,et al.  Theory and Performance of Narrow Correlator Spacing in a GPS Receiver , 1992 .

[46]  C P Weisel,et al.  Functional group characterization of indoor, outdoor, and personal PM: results from RIOPA. , 2005, Indoor air.

[47]  John D. Spengler,et al.  Driver exposure to volatile organic compounds, carbon monoxide, ozone and nitrogen dioxide under different driving conditions , 1991 .

[48]  A. J. Van,et al.  Theory and Performance of Narrow Correlator Spacing in a GPS Receiver , 1992 .

[49]  I. Cionni,et al.  Random Forests Analysis: a Useful Tool for Defining the Relative Importance of Environmental Conditions on Crown Defoliation , 2014, Water, Air, & Soil Pollution.

[50]  Basile Chaix,et al.  Detecting activity locations from raw GPS data: a novel kernel-based algorithm , 2013, International Journal of Health Geographics.

[51]  Jun Wu,et al.  Automated time activity classification based on global positioning system (GPS) tracking data , 2011, Environmental health : a global access science source.

[52]  T. Bahadori,et al.  Criteria air pollutants and toxic air pollutants. , 2000, Environmental health perspectives.

[53]  Arthur M Winer,et al.  Relationships of Indoor, Outdoor, and Personal Air (RIOPA). Part I. Collection methods and descriptive analyses. , 2005, Research report.

[54]  Zahir Ali,et al.  Extracting parcel boundaries from satellite imagery for a Land Information System , 2013, 2013 6th International Conference on Recent Advances in Space Technologies (RAST).

[55]  R. J. Shephard,et al.  Utility of Global Positioning System to Measure Active Transport in Urban Areas , 2008 .

[56]  Scott Fruin,et al.  Mobile platform measurements of ultrafine particles and associated pollutant concentrations on freeways and residential streets in Los Angeles , 2005 .

[57]  X. Chen,et al.  Random forests for genomic data analysis. , 2012, Genomics.

[58]  M. Goldberg,et al.  A Systematic Review of the Relation Between Long-term Exposure to Ambient Air Pollution and Chronic Diseases , 2008, Reviews on environmental health.

[59]  Kai Elgethun,et al.  Comparison of global positioning system (GPS) tracking and parent-report diaries to characterize children's time–location patterns , 2007, Journal of Exposure Science and Environmental Epidemiology.

[60]  Alexander Hapfelmeier,et al.  A new variable selection approach using Random Forests , 2013, Comput. Stat. Data Anal..

[61]  Rona Campbell,et al.  Adolescent perspectives on wearing accelerometers to measure physical activity in population-based trials. , 2013, European journal of public health.

[62]  P. Schantz,et al.  A criterion method for measuring route distance in physically active commuting. , 2009, Medicine and science in sports and exercise.

[63]  Toshiyuki Yamamoto,et al.  Deriving Personal Trip Data from GPS Data: A Literature Review on the Existing Methodologies , 2014 .

[64]  G. Schofield,et al.  Combining GPS with heart rate monitoring to measure physical activity in children: A feasibility study. , 2009, Journal of science and medicine in sport.

[65]  RUBEN BRONDEEL,et al.  Using GPS, GIS, and Accelerometer Data to Predict Transportation Modes. , 2015, Medicine and science in sports and exercise.

[66]  John D. Spengler,et al.  Criteria air pollutants and toxic air pollutants. , 2000 .

[67]  Jacqueline Kerr,et al.  Indoor versus outdoor time in preschoolers at child care. , 2013, American journal of preventive medicine.

[68]  Eva Negri,et al.  Long-term particulate matter exposure and mortality: a review of European epidemiological studies , 2009, BMC public health.

[69]  Laura Davis,et al.  Mapping the walk to school using accelerometry combined with a global positioning system. , 2010, American journal of preventive medicine.