Towards altruistic data quality assessment for mobile sensing

High-quality data is a necessity for successful research and development endeavors. In this article, we review relevant literature for data quality (DQ) assessment methods in different domains and discuss the possibilities, challenges and constraints of applying them to mobile sensing. We identify DQ dimensions directly applicable to sensor data: believability (comparison with the correct operating bounds), completeness (missing values), free-of-error (erroneous values), consistency (over time), timeliness (delay), accuracy (deviation from true value) and precision (granularity of readings) are core aspects of high-quality sensor data. We also emphasize that sensor data must be representative of the originating type of sensor. We propose an altruistic approach to DQ assessment for sensor data that facilitates aggregating and sharing of domain knowledge through a community-contributed library of DQ assessment methods organized by sensor type.

[1]  Laura Sebastian-Coleman,et al.  Measuring Data Quality for Ongoing Improvement: A Data Quality Assessment Framework , 2012 .

[2]  Denzil Ferreira,et al.  Understanding the Challenges of Mobile Phone Usage Data , 2015, MobileHCI.

[3]  Juha Röning,et al.  From User-independent to Personal Human Activity Recognition Models Using Smartphone Sensors , 2016, ESANN.

[4]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[5]  Denzil Ferreira,et al.  Detecting Drinking Episodes in Young Adults Using Smartphone-based Sensors , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[6]  Jorge Gonçalves,et al.  Mobile cloud storage: a contextual experience , 2014, MobileHCI '14.

[7]  Denzil Ferreira,et al.  AWARE: Mobile Context Instrumentation Framework , 2015, Front. ICT.

[8]  Qi Han,et al.  Quality-Aware Sensor Data Management , 2014 .

[9]  Guoliang Xing,et al.  iSleep: unobtrusive sleep quality monitoring using smartphones , 2013, SenSys '13.

[10]  Habib M. Ammari,et al.  The art of wireless sensor networks , 2014 .

[11]  Heidi P. K. Enwald,et al.  Effect of tailored, gamified, mobile physical activity intervention on life satisfaction and self-rated health in young adolescent men: A population-based, randomized controlled trial (MOPO study) , 2017, Comput. Hum. Behav..

[12]  Laura Sebastian-Coleman Introduction: Measuring Data Quality for Ongoing Improvement , 2013 .

[13]  Simo Hosio,et al.  Are Smartphones Ubiquitous?: An in-depth survey of smartphone adoption by seniors , 2017, IEEE Consumer Electronics Magazine.

[14]  Schahram Dustdar,et al.  Data Quality Observation in Pervasive Environments , 2012, 2012 IEEE 15th International Conference on Computational Science and Engineering.

[15]  Wolfgang Lehner,et al.  Representing Data Quality in Sensor Data Streaming Environments , 2009, JDIQ.

[16]  James H. Garrett,et al.  Fault perturbations in building sensor network data streams , 2010, Int. J. Sens. Networks.

[17]  Hajar Mousannif,et al.  Data quality in internet of things: A state-of-the-art survey , 2016, J. Netw. Comput. Appl..

[18]  Minho Shin,et al.  Challenges in Data Quality Assurance in Pervasive Health Monitoring Systems , 2009 .

[19]  Jodi Forlizzi,et al.  Understanding my data, myself: supporting self-reflection with ubicomp technologies , 2011, UbiComp '11.

[20]  Anne Marsden,et al.  International Organization for Standardization , 2014 .

[21]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[22]  Nirvana Meratnia,et al.  Ensuring high sensor data quality through use of online outlier detection techniques , 2010, Int. J. Sens. Networks.

[23]  Vittorio Loreto,et al.  Participatory Sensing, Opinions and Collective Awareness , 2016, Understanding Complex Systems.

[24]  Diane M. Strong,et al.  Data quality in context , 1997, CACM.