Computational Analysis of Acoustic Descriptors in Psychotic Patients

Various forms of psychotic disorders, including schizophrenia, can influence how we speak. Therefore, clinicians assess speech and language behaviors of their patients. While it is difficult for humans to quantify speech behaviors precisely, acoustic descriptors, such as tenseness of voice and speech rate, can be quantified automatically. In this work, we identify previously unstudied acoustic descriptors related to the severity of psychotic symptoms within a clinical population (N=29). Our dataset consists of semi-structured interviews between patients and clinicians. Psychotic disorders are often characterized by two groups of symptoms: negative and positive. While negative symptoms are also prevalent in disorders such as depression, positive symptoms in psychotic disorders have rarely been studied from an acoustic and computational perspective. Our experiments show relationships between psychotic symptoms and acoustic descriptors related to voice quality consistency, variation of speech rate and volume, vowel space, and a parameter of glottal flow. Further, we show that certain acoustic descriptors can track a patient’s state from admission to discharge. Finally, we demonstrate that measures from the Brief Psychiatric Rating Scale (BPRS) can be estimated with acoustic descriptors.

[1]  Abeer Alwan,et al.  Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics , 2019, INTERSPEECH.

[2]  Albert A. Rizzo,et al.  Self-Reported Symptoms of Depression and PTSD Are Associated with Reduced Vowel Space in Screening Interviews , 2016, IEEE Transactions on Affective Computing.

[3]  John Kane,et al.  Identifying Regions of Non-Modal Phonation Using Features of the Wavelet Transform , 2011, INTERSPEECH.

[4]  Dan J Stein,et al.  Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013 , 2015, The Lancet.

[5]  John J. Foxe,et al.  Sensory Contributions to Impaired Prosodic Processing in Schizophrenia , 2005, Biological Psychiatry.

[6]  Yusuke Kida,et al.  Voice Activity Detection: Merging Source and Filter-based Information , 2016, IEEE Signal Processing Letters.

[7]  W Goldfarb,et al.  Speech and language faults of schizophrenic children , 1972, Journal of autism and childhood schizophrenia.

[8]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Louis-Philippe Morency,et al.  Investigating voice quality as a speaker-independent indicator of depression and PTSD , 2013, INTERSPEECH.

[10]  R. Baker,et al.  The Psychiatric Mental Status Examination , 1993 .

[11]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[12]  Daphna Weinshall,et al.  Automated Facial Expressions Analysis in Schizophrenia: A Continuous Dynamic Approach , 2015, MindCare.

[13]  K. Scherer,et al.  Vocal indicators of psychiatric treatment effects in depressives and schizophrenics. , 1982, Journal of communication disorders.

[14]  C. Kohler,et al.  Facial emotion perception in schizophrenia: a meta-analytic review. , 2010, Schizophrenia bulletin.

[15]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[16]  Panayiotis G. Georgiou,et al.  The SAIL speaker diarization system for analysis of spontaneous meetings , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[17]  J. Overall,et al.  The Brief Psychiatric Rating Scale , 1962 .

[18]  M. Covington,et al.  Phonetic measures of reduced tongue movement correlate with negative symptom severity in hospitalized patients with first-episode schizophrenia-spectrum disorders , 2012, Schizophrenia Research.

[19]  S. Kay,et al.  The positive and negative syndrome scale (PANSS) for schizophrenia. , 1987, Schizophrenia bulletin.

[20]  John Laver,et al.  Individual features in voice quality. , 1987 .

[21]  T. Hacki Klassifizierung von Glottisdysfunktionen mit Hilfe der Elektroglottographie , 1989 .

[22]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[23]  R. Neufeld,et al.  Variability in BPRS definitions of positive and negative symptoms , 1995, Schizophrenia Research.

[24]  Louis-Philippe Morency,et al.  Computational study of psychosis symptoms and facial expressions , 2016 .