Approaching Human Performance in Behavior Estimation in Couples Therapy Using Deep Sentence Embeddings

Identifying complex behavior in human interactions for observational studies often involves the tedious process of transcribing and annotating large amounts of data. While there is significant work towards accurate transcription in Automatic Speech Recognition, automatic Natural Language Understanding of high-level human behaviors from the transcribed text is still at an early stage of development. In this paper we present a novel approach for modeling human behavior using sentence embeddings and propose an automatic behavior annotation framework. We explore unsupervised methods of extracting semantic information, using seq2seq models, into deep sentence embeddings and demonstrate that these embeddings capture behaviorally meaningful information. Our proposed framework utilizes LSTM Recurrent Neural Networks to estimate behavior trajectories from these sentence embeddings. Finally, we employ fusion to compare our high-resolution behavioral trajectories with the coarse, session-level behavioral ratings of human annotators in Couples Therapy. Our experiments show that behavior annotation using this framework achieves better results than prior methods and approaches or exceeds human performance in terms of annotator agreement.

[1]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[2]  Panayiotis G. Georgiou,et al.  Couples Behavior Modeling and Annotation Using Low-Resource LSTM Language Models , 2016, INTERSPEECH.

[3]  Panayiotis G. Georgiou,et al.  Power-spectral analysis of head motion signal for behavioral modeling in human interaction , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4]  Mikael Bodén,et al.  A guide to recurrent neural networks and backpropagation , 2001 .

[5]  Jörg Tiedemann,et al.  News from OPUS — A collection of multilingual parallel corpora with tools and interfaces , 2009 .

[6]  Quoc V. Le,et al.  A Neural Conversational Model , 2015, ArXiv.

[7]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[8]  Shrikanth S. Narayanan,et al.  "Rate My Therapist": Automated Detection of Empathy in Drug and Alcohol Counseling via Speech and Language Processing , 2015, PloS one.

[9]  Rahul Gupta,et al.  A language-based generative model framework for behavioral analysis of couples' therapy , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Panayiotis G. Georgiou,et al.  An audio-visual approach to learning salient behaviors in couples' problem solving discussions , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[12]  David C. Atkins,et al.  A Comparison of Natural Language Processing Methods for Automated Coding of Motivational Interviewing. , 2016, Journal of substance abuse treatment.

[13]  Rabab Kreidieh Ward,et al.  Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[14]  G. Margolin,et al.  The Nuts and Bolts of Behavioral Observation of Marital and Family Interaction , 1998, Clinical child and family psychology review.

[15]  Panayiotis G. Georgiou,et al.  Behavioral signal processing for understanding (distressed) dyadic interactions: some recent developments , 2011, J-HGBU '11.

[16]  Athanasios Katsamanis,et al.  Automatic classification of married couples' behavior using audio features , 2010, INTERSPEECH.

[17]  Panayiotis G. Georgiou,et al.  Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language , 2013, Proceedings of the IEEE.

[18]  David C. Atkins,et al.  Traditional versus integrative behavioral couple therapy for significantly and chronically distressed married couples. , 2004, Journal of consulting and clinical psychology.

[19]  Shrikanth S. Narayanan,et al.  A dialog act tagging approach to behavioral coding: a case study of addiction counseling conversations , 2015, INTERSPEECH.

[20]  Panayiotis G. Georgiou,et al.  "That's Aggravating, Very Aggravating": Is It Possible to Classify Behaviors in Couple Interactions Using Automatically Derived Lexical Features? , 2011, ACII.

[21]  C. Bryan,et al.  Improving the detection and prediction of suicidal behavior among military personnel by measuring suicidal beliefs: an evaluation of the Suicide Cognitions Scale. , 2014, Journal of affective disorders.

[22]  Panayiotis G. Georgiou,et al.  Behavioral Coding of Therapist Language in Addiction Counseling Using Recurrent Neural Networks , 2016, INTERSPEECH.

[23]  Haoqi Li,et al.  Sparsely Connected and Disjointly Trained Deep Neural Networks for Low Resource Behavioral Annotation: Acoustic Classification in Couples' Therapy , 2016, INTERSPEECH.

[24]  Che-Wei Huang,et al.  Distributed under Creative Commons Cc-by 4.0 a Technology Prototype System for Rating Therapist Empathy from Audio Recordings in Addiction Counseling , 2022 .

[25]  Shrikanth S. Narayanan,et al.  "It sounds like...": A natural language processing approach to detecting counselor reflections in motivational interviewing. , 2016, Journal of counseling psychology.

[26]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[27]  Panayiotis G. Georgiou,et al.  Analyzing speech rate entrainment and its relation to therapist empathy in drug addiction counseling , 2015, INTERSPEECH.

[28]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.