Bootstrap Based Uncertainty Propagation for Data Quality Estimation in Crowdsensing Systems

The diffusion of mobile devices equipped with sensing, computation, and communication capabilities is opening unprecedented possibilities for high-resolution, spatio-temporal mapping of several phenomena. This novel data generation, collection, and processing paradigm, termed crowdsensing, lays upon complex, distributed cyberphysical systems. Collective data gathering from heterogeneous, spatially distributed devices inherently raises the question of how to manage different quality levels of contributed data. In order to extract meaningful information, it is, therefore, desirable to the introduction of effective methods for evaluating the quality of data. In this paper, we propose an approach aimed at systematic accuracy estimation of quantities provided by end-user devices of a crowd-based sensing system. This is obtained thanks to the combination of statistical bootstrap with uncertainty propagation techniques, leading to a consistent and technically sound methodology. Uncertainty propagation provides a formal framework for combining uncertainties, resulting from different quantities influencing a given measurement activity. Statistical bootstrap enables the characterization of the sampling distribution of a given statistics without any prior assumption on the type of statistical distributions behind the data generation process. The proposed approach is evaluated on synthetic benchmarks and on a real world case study. Cross-validation experiments show that confidence intervals computed by means of the presented technique show a maximum 1.5% variation with respect to interval widths computed by means of controlled standard Monte Carlo methods, under a wide range of operating conditions. In general, experimental results confirm the suitability and validity of the introduced methodology.

[1]  Alessandro Bogliolo,et al.  Geospatial data aggregation and reduction in vehicular sensing applications: The case of road surface monitoring , 2014, 2014 International Conference on Connected Vehicles and Expo (ICCVE).

[2]  Joon-Hyuk Chang,et al.  Improved Gaussian Mixture Regression Based on Pseudo Feature Generation Using Bootstrap in Blood Pressure Estimation , 2016, IEEE Transactions on Industrial Informatics.

[3]  Cecilia Mascolo,et al.  ParkSense: a smartphone based sensing system for on-street parking , 2013, MobiCom.

[4]  Kendrick Kay,et al.  Statistics and Data Analysis in MATLAB , 2014 .

[5]  Karl Aberer,et al.  ExposureSense: Integrating daily activities with air quality using mobile participatory sensing , 2013, 2013 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[6]  Alberto Carini,et al.  SmartRoadSense: Collaborative Road Surface Condition Monitoring , 2014 .

[7]  T. Hesterberg,et al.  What Teachers Should Know About the Bootstrap: Resampling in the Undergraduate Statistics Curriculum , 2014, The American statistician.

[8]  Lothar Thiele,et al.  Deriving high-resolution urban air pollution maps using mobile sensor nodes , 2015 .

[9]  John R. Taylor Introduction to Error Analysis, the Study of Uncertainties in Physical Measurements, 2nd Edition , 1997 .

[10]  Peter M. Harris,et al.  Evaluation of Measurement Uncertainty Based on the Propagation of Distributions Using Monte Carlo Simulation , 2003 .

[11]  Zhigang Liu,et al.  Darwin phones: the evolution of sensing and inference on mobile phones , 2010, MobiSys '10.

[12]  Zhu Wang,et al.  Mobile Crowd Sensing and Computing , 2015, ACM Comput. Surv..

[13]  M. G. Cox,et al.  Propagation of distributions by a Monte Carlo method, with an application to ratio models , 2009 .

[14]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[15]  Cecilia Mascolo,et al.  Smart Sensing Systems for the Daily Drive , 2016, IEEE Pervasive Computing.

[16]  Ryan Newton,et al.  The pothole patrol: using a mobile sensor network for road surface monitoring , 2008, MobiSys '08.

[17]  Marco Gruteser,et al.  ParkNet: drive-by sensing of road-side parking statistics , 2010, MobiSys '10.

[18]  Lothar Thiele,et al.  OpenSense: open community driven sensing of environment , 2010, IWGS '10.

[19]  Wen Hu,et al.  On the need for a reputation system in mobile phone based sensing , 2014, Ad Hoc Networks.

[20]  Julie A. McCann,et al.  Lightweight Sensing Uncertainty Metric—Incorporating Accuracy and Trust , 2014, IEEE Sensors Journal.

[21]  Martin Krzywinski,et al.  Points of Significance: Sampling distributions and the bootstrap , 2015, Nature Methods.

[22]  Wen Hu,et al.  Ear-phone: an end-to-end participatory urban noise mapping system , 2010, IPSN '10.

[23]  Hengchang Liu,et al.  SmartRoad , 2015, ACM Trans. Sens. Networks.

[24]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[25]  Emery N. Brown,et al.  Propagation of Uncertainty and the Bootstrap , 2014 .

[26]  Wen Hu,et al.  Are you contributing trustworthy data?: the case for a reputation system in participatory sensing , 2010, MSWIM '10.

[27]  Fan Ye,et al.  Mobile crowdsensing: current state and future challenges , 2011, IEEE Communications Magazine.

[28]  Alessandro Bogliolo,et al.  Efficient Algorithms for Accuracy Improvement in Mobile Crowdsensing Vehicular Applications , 2015 .

[29]  Sivan Toledo,et al.  VTrack: accurate, energy-aware road traffic delay estimation using mobile phones , 2009, SenSys '09.

[30]  Victor C. M. Leung,et al.  Vita: A Crowdsensing-Oriented Mobile Cyber-Physical System , 2013, IEEE Transactions on Emerging Topics in Computing.

[31]  Emiliano Miluzzo,et al.  A survey of mobile phone sensing , 2010, IEEE Communications Magazine.

[32]  Xiang-Yang Li,et al.  Opportunistic coverage for urban vehicular sensing , 2015, Comput. Commun..

[33]  Andreas Schrader,et al.  SoundOfTheCity - Continuous noise monitoring for a healthy city , 2013, 2013 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).