Representation Learning for Emotion Recognition from Smartphone Keyboard Interactions

Characteristics of typing on smartphone keyboards among different individuals can elicit emotion, similar to speech prosody or facial expressions. Existing works on typing based emotion recognition rely on feature engineering to build machine learning models, while recent speech and facial expression based techniques have shown the efficacy of learning the features automatically. Therefore, in this work, we explore the effectiveness of such learning models in keyboard interaction based emotion detection. In this paper, we propose an end-to-end framework, which first uses a sequence-based encoding method to automatically learn the representation from raw keyboard interaction pattern and subsequently uses this representation to train a multi-task learning based neural network (MTL-NN)to identify different emotions. We carry out a 3-week in-the-wild study involving 24 participants using a custom keyboard capable of tracing users' interaction pattern during text entry. We collect interaction details like touch speed, error rate, pressure and self-reported emotions (happy, sad, stressed, relaxed) during the study. Our analysis on the collected dataset reveals that the representation learnt from the interaction pattern has an average correlation of 0.901 within the same emotion and 0.811 between different emotions. As a result, the representation is effective in distinguishing different emotions with an average accuracy (AUCROC)of 84%.

[1]  Katarzyna Wac,et al.  Individuals’ Stress Assessment Using Human-Smartphone Interaction Analysis , 2018, IEEE Transactions on Affective Computing.

[2]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[3]  Niels van Berkel,et al.  The Experience Sampling Method on Mobile Devices , 2017, ACM Comput. Surv..

[4]  Wu Guo,et al.  An Attention Pooling Based Representation Learning Method for Speech Emotion Recognition , 2018, INTERSPEECH.

[5]  Pradipta De,et al.  Towards designing an intelligent experience sampling method for emotion detection , 2017, 2017 14th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[6]  Erin Antono,et al.  Overcoming data scarcity with transfer learning , 2017, ArXiv.

[7]  Rich Caruana,et al.  Multitask Learning , 1997, Machine-mediated learning.

[8]  Pradipta De,et al.  TapSense: combining self-report patterns and typing characteristics for smartphone based emotion detection , 2017, MobileHCI.

[9]  Hyun-Jun Kim,et al.  Exploring emotional preference for smartphone applications , 2012, 2012 IEEE Consumer Communications and Networking Conference (CCNC).

[10]  Pradipta De,et al.  Effectiveness of Deep Neural Network Model in Typing-based Emotion Detection on Smartphones , 2018, MobiCom.

[11]  Kasper Hornbæk,et al.  An affect detection technique using mobile commodity sensors in the wild , 2016, UbiComp.

[12]  Niloy Ganguly,et al.  Designing an Experience Sampling Method for Smartphone Based Emotion Detection , 2019, IEEE Transactions on Affective Computing.

[13]  M. Csíkszentmihályi,et al.  The Experience Sampling Method , 2014 .

[14]  Louis-Philippe Morency,et al.  Representation Learning for Speech Emotion Recognition , 2016, INTERSPEECH.

[15]  Pradipta De,et al.  EmoKey: An Emotion-aware Smartphone Keyboard for Mental Health Monitoring , 2019, 2019 11th International Conference on Communication Systems & Networks (COMSNETS).

[16]  Niloy Ganguly,et al.  Emotion detection from touch interactions during text entry on smartphones , 2019, Int. J. Hum. Comput. Stud..

[17]  Hosub Lee,et al.  Towards unobtrusive emotion recognition for affective social communication , 2012, 2012 IEEE Consumer Communications and Networking Conference (CCNC).

[18]  Thomas G. Dietterich,et al.  To transfer or not to transfer , 2005, NIPS 2005.

[19]  Yang Liu,et al.  A Multi-Task Learning Framework for Emotion Recognition Using 2D Continuous Space , 2017, IEEE Transactions on Affective Computing.

[20]  Sidney K. D'Mello,et al.  Detecting boredom and engagement during writing with keystroke analysis, task appraisals, and stable traits , 2013, IUI '13.

[21]  Pradipta De,et al.  Evaluating effectiveness of smartphone typing as an indicator of user emotion , 2017, 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).

[22]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[24]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[25]  Peng Xu,et al.  Emo2Vec: Learning Generalized Emotion Representation by Multi-task Training , 2018, WASSA@EMNLP.

[26]  Katarzyna Wac,et al.  iSensestress: Assessing stress through human-smartphone interaction analysis , 2015, 2015 9th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth).

[27]  Pradipta De,et al.  Impact of experience sampling methods on tap pattern based emotion recognition , 2015, UbiComp/ISWC Adjunct.

[28]  J. Russell A circumplex model of affect. , 1980 .