Recruit Until It Fails

Distinguishing identities is useful for several applications such as automated grocery or personalized recommendations. Unfortunately, several recent proposals for identification systems are evaluated using poor recruitment practices. We discovered that 23 out of 30 surveyed systems used datasets with 20 participants or less. Those studies achieved an average classification accuracy of 93%. We show that the classifier performance is misleading when the participant count is small. This is because the finite precision of measurements creates upper limits on the number of users that can be distinguished. To demonstrate why classifier performance is misleading, we used publicly available datasets. The data was collected from human subjects. We created five systems with at least 20 participants each. In three cases we achieved accuracies greater than 90% by merely applying readily available machine learning software packages, often with default parameters. For datasets where we had sufficient participants, we evaluated how the performance degrades as the number of participants increases. One of the systems built suffered a drop in accuracy that was over 35% as the participant count increased from 20 to 250. We argue that data from small participant count datasets do not adequately explore variations. Systems trained on such limited data are likely to incorrectly identify users when the user base increases beyond what was tested. We conclude by explaining generalizable reasons for this issue and provide insights on how to conduct more robust system analysis and design.

[1]  Matteo Gadaleta,et al.  IDNet: Smartphone-based Gait Recognition with Convolutional Neural Networks , 2016, Pattern Recognit..

[2]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[3]  Wei Dong,et al.  Rapid , 2017 .

[4]  Wei Wang,et al.  Gait recognition using wifi signals , 2016, UbiComp.

[5]  Shridatt Sugrim,et al.  Robust Performance Metrics for Authentication Systems , 2019, NDSS.

[6]  Gierad Laput,et al.  Thumprint: Socially-Inclusive Local Group Authentication Through Shared Secret Knocks , 2017, CHI.

[7]  Jian Liu,et al.  PPG-based Finger-level Gesture Recognition Leveraging Wearables , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[8]  Ming Zeng,et al.  XRec: Behavior-Based User Recognition Across Mobile Devices , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[9]  A. Hackshaw,et al.  Small studies: strengths and limitations , 2008, European Respiratory Journal.

[10]  Yang Zhang,et al.  Pyro: Thumb-Tip Gesture Recognition Using Pyroelectric Infrared Sensing , 2017, UIST.

[11]  Hans-Peter Kriegel,et al.  2D Image Registration in CT Images Using Radial Image Descriptors , 2011, MICCAI.

[12]  Mike Fraser,et al.  EchoFlex: Hand Gesture Recognition using Ultrasound Imaging , 2017, CHI.

[13]  Muhammad Shahzad,et al.  Multi-User Gesture Recognition Using WiFi , 2018, MobiSys.

[14]  Steven Kay,et al.  Fundamentals Of Statistical Signal Processing , 2001 .

[15]  Yunhao Liu,et al.  Inferring Motion Direction using Commodity Wi-Fi for Interactive Exergames , 2017, CHI.

[16]  Gregory D. Abowd,et al.  FingerPing: Recognizing Fine-grained Hand Poses using Active Acoustic On-body Sensing , 2018, CHI.

[17]  Yanwen Wang,et al.  Modeling RFID Signal Reflection for Contact-free Activity Recognition , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[18]  Lei Yang,et al.  AudioGest: enabling fine-grained hand gesture detection by decoding echo signal , 2016, UbiComp.

[19]  Alex Olwal,et al.  Zensei: Embedded, Multi-electrode Bioimpedance Sensing for Implicit, Ubiquitous User Recognition , 2017, CHI.

[20]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[21]  Mike Fraser,et al.  SensIR: Detecting Hand Gestures with a Wearable Bracelet using Infrared Transmission and Reflection , 2017, UIST.

[22]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[23]  Shuangquan Wang,et al.  SignFi , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[24]  M. Sandelowski Sample size in qualitative research. , 1995, Research in nursing & health.

[25]  R. MacCallum,et al.  Power analysis and determination of sample size for covariance structure modeling. , 1996 .

[26]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Yang Li,et al.  Bootstrapping User-Defined Body Tapping Recognition with Offline-Learned Probabilistic Representation , 2016, UIST.

[28]  Stefan Schneegaß,et al.  SkullConduct: Biometric User Identification on Eyewear Computers Using Bone Conduction Through the Skull , 2016, CHI.

[29]  Dina Katabi,et al.  Duet , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[30]  Antonio Krüger,et al.  User-independent real-time hand gesture recognition based on surface electromyography , 2017, MobileHCI.

[31]  Albrecht Heeffer,et al.  The Pigeonhole Principle, Two Centuries Before Dirichlet , 2014 .

[32]  Eduardo Velloso,et al.  Combining Low and Mid-Level Gaze Features for Desktop Activity Recognition , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[33]  Gregory D. Abowd,et al.  FingerSound , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[34]  Muhammad Shahzad,et al.  Position and Orientation Agnostic Gesture Recognition Using WiFi , 2017, MobiSys.

[35]  Michael E. Schuckers,et al.  Computational Methods in Biometric Authentication: Statistical Methods for Performance Evaluation , 2010 .

[36]  Kelly Caine,et al.  Local Standards for Sample Size at CHI , 2016, CHI.

[37]  Ivan Poupyrev,et al.  Interacting with Soli: Exploring Fine-Grained Dynamic Gesture Recognition in the Radio-Frequency Spectrum , 2016, UIST.

[38]  Ford,et al.  Do That ? Abnormal Predictive Processes in Schizophrenia When Button , 2014 .

[39]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[40]  Michael E. Schuckers,et al.  Computational Methods in Biometric Authentication , 2010 .

[41]  W. Bossert,et al.  The Measurement of Diversity , 2001 .

[42]  Muhammad Shahzad,et al.  Gesture Recognition Using Ambient Light , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[43]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[44]  Petia Radeva,et al.  Personalization and user verification in wearable systems using biometric walking patterns , 2011, Personal and Ubiquitous Computing.

[45]  A. Chao,et al.  Nonparametric estimation of Shannon’s index of diversity when there are unseen species in sample , 2004, Environmental and Ecological Statistics.

[46]  Jian Liu,et al.  Multi - Touch in the Air: Device-Free Finger Tracking and Gesture Recognition via COTS RFID , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[47]  Davide Anguita,et al.  A Public Domain Dataset for Human Activity Recognition using Smartphones , 2013, ESANN.

[48]  R. V. Krejcie,et al.  Determining Sample Size for Research Activities , 1970 .

[49]  Muhammad Shahzad,et al.  Augmenting User Identification with WiFi Based Gesture Recognition , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[50]  Arjan Kuijper,et al.  Platypus: Indoor Localization and Identification through Sensing of Electric Potential Changes in Human Bodies , 2016, MobiSys.

[51]  Michael Rohs,et al.  Pentelligence: Combining Pen Tip Motion and Writing Sounds for Handwritten Digit Recognition , 2018, CHI.

[52]  Andrew J Anderson,et al.  Small samples: does size matter? , 2001, Investigative ophthalmology & visual science.

[53]  Brian A. Nosek,et al.  Power failure: why small sample size undermines the reliability of neuroscience , 2013, Nature Reviews Neuroscience.

[54]  Zhou Yu,et al.  EIS , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[55]  Buntarou Shizuki,et al.  CanalSense: Face-Related Movement Recognition System based on Sensing Air Pressure in Ear Canals , 2017, UIST.

[56]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[57]  Yunhao Liu,et al.  MindID: Person Identification from Brain Waves through Aention-based Recurrent Neural Network , 2017 .

[58]  Anind K. Dey,et al.  Serendipity: Finger Gesture Recognition using an Off-the-Shelf Smartwatch , 2016, CHI.

[59]  Florian Alt,et al.  Behavioural Biometrics in VR: Identifying People from Body Motion and Relations in Virtual Reality , 2019, CHI.

[60]  A. Azzouz 2011 , 2020, City.

[61]  J. Lachin Introduction to sample size determination and power analysis for clinical trials. , 1981, Controlled clinical trials.