Detecting Label Errors in Crowd-Sourced Smartphone Sensor Data

Applications relying on supervised learning algorithms are susceptible to producing false outputs in the presence of label errors, i.e., situations were labels have been corrupted, both deliberately and accidentally. While prior work has focused on detecting and handling label errors for various types of applications, there is a lack of research addressing label errors in smartphone-based crowd-sensing applications, especially when used for action recognition. In this paper, we discuss and address two common types of smartphone-based label errors:mislabeling and multi-action labels. We also compare multiple learning algorithms, including an ensemble of four stratified trained classifiers. The results indicate the importance of the action type for filtering label error. The goal of this work is to provide guidelines for developing effective techniques to discover and remove error labels for action recognition systems.

[1]  Tim J Gabbett,et al.  The Use of Wearable Microsensors to Quantify Sport-Specific Movements , 2015, Sports Medicine.

[3]  Edward G. Carmines,et al.  Reliability and Validity Assessment , 1979 .

[4]  Deborah Estrin,et al.  Improving activity classification for health applications on mobile devices using active and semi-supervised learning , 2010, 2010 4th International Conference on Pervasive Computing Technologies for Healthcare.

[5]  S. Crowder A simple method for studying run-length distribution of exponentially weighted moving average charts , 1987 .

[6]  Hae-Young Kim Statistical notes for clinical researchers: assessing normal distribution (2) using skewness and kurtosis , 2013, Restorative dentistry & endodontics.

[7]  Julien Clinton Sprott,et al.  Some simple chaotic jerk functions , 1997 .

[8]  Delbert S. Elliott,et al.  Reassessing the reliability and validity of self-report delinquency measures , 1986, Journal of Quantitative Criminology.

[9]  Christian Poellabauer,et al.  Quantifying community mobility after stroke using mobile phone technology , 2017, UbiComp/ISWC Adjunct.

[10]  Judith A. McLaughlin Understanding Statistics in the Behavioral Sciences , 2002 .

[11]  George A. F. Seber,et al.  Linear regression analysis , 1977 .

[12]  Sung-Bae Cho,et al.  Activity Recognition Using Hierarchical Hidden Markov Models on a Smartphone with 3D Accelerometer , 2011, HAIS.

[13]  Ling Bao,et al.  Activity Recognition from User-Annotated Acceleration Data , 2004, Pervasive.

[14]  G Atkinson,et al.  Statistical Methods For Assessing Measurement Error (Reliability) in Variables Relevant to Sports Medicine , 1998, Sports medicine.

[15]  Gary M. Weiss,et al.  Activity recognition using cell phone accelerometers , 2011, SKDD.

[16]  K. Gaberson,et al.  Measurement reliability and validity. , 1997, AORN journal.

[17]  Keiji Yanai,et al.  Real-time eating action recognition system on a smartphone , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[18]  A. Petersen,et al.  A self-report measure of pubertal status: Reliability, validity, and initial norms , 1988, Journal of youth and adolescence.

[19]  Florian Michahelles,et al.  Driving behavior analysis with smartphones: insights from a controlled field study , 2012, MUM.

[20]  Christian Poellabauer,et al.  Evaluation of Sensing and Processing Parameters for Human Action Recognition , 2017, 2017 IEEE International Conference on Healthcare Informatics (ICHI).

[21]  Paul J. M. Havinga,et al.  Activity Recognition Using Inertial Sensing for Healthcare, Wellbeing and Sports Applications: A Survey , 2010, ARCS Workshops.

[22]  Christian Poellabauer,et al.  Configurable Integrated Monitoring System for Mobile Devices , 2014, FNC/MobiSPC.

[23]  Christian Poellabauer,et al.  Assessing health trends of college students using smartphones , 2016, 2016 IEEE Healthcare Innovation Point-Of-Care Technologies Conference (HI-POCT).

[24]  Mahesh K. Marina,et al.  Towards multimodal deep learning for activity recognition on mobile devices , 2016, UbiComp Adjunct.

[25]  Kevin Currell,et al.  Validity, Reliability and Sensitivity of Measures of Sporting Performance , 2008, Sports medicine.

[26]  Nicholas D. Lane,et al.  Can Deep Learning Revolutionize Mobile Sensing? , 2015, HotMobile.

[27]  Richard A. Groeneveld,et al.  Measuring Skewness and Kurtosis , 1984 .

[28]  M. Csíkszentmihályi,et al.  Validity and Reliability of the Experience‐Sampling Method , 1987, The Journal of nervous and mental disease.

[29]  Naonori Ueda,et al.  Training human activity recognition for labels with inaccurate time stamps , 2014, UbiComp Adjunct.

[30]  A. Santiago,et al.  Development of a wearable ZigBee sensor system for upper limb rehabilitation robotics , 2012, 2012 4th IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob).

[31]  Elaine M. Bochniewicz,et al.  Using Wearable Sensors and Machine Learning Models to Separate Functional Upper Extremity Use From Walking-Associated Arm Movements. , 2016, Archives of physical medicine and rehabilitation.

[32]  Christian Poellabauer,et al.  Activity Recognition for Persons With Stroke Using Mobile Phone Technology: Toward Improved Performance in a Home Setting , 2017, Journal of medical Internet research.