Analyzing acoustic and prosodic fluctuations in free speech to predict psychosis onset in high-risk youths

The diagnosis and treatment of psychiatric disorders depends on the analysis of behavior through language by a clinical specialist. This analysis is subjective in nature and could benefit from automated, objective acoustic and linguistic processing methods. This integrated approach would convey a richer representation of patient speech, particularly for expression of emotion. In this work, we explore the potential of acoustic and prosodic metrics to infer clinical variables and predict psychosis, a condition which produces measurable derailment and tangentiality in patient language. To that purpose, we analyzed the recordings of 32 young patients at high risk of developing clinical psychosis. The subjects were evaluated using the Structured Interview for Prodromal Syndromes/Scale of Prodromal Symptoms (SIPS/SOPS) criteria. To analyze the recordings, we examined the variation of different acoustic and prosodic metrics across time. This preliminary analysis shows that these features can infer negative symptom severity ratings (i.e., SIPS-Btotal), obtaining a Pearson correlation of 0.77 for all the subjects after cross-validated evaluation. In addition, these features can predict development of psychosis with high accuracy above 90%, outperforming classification using clinical variables only. This improved predictive power ultimately can help provide early treatment and improve quality of life for those at risk for developing psychosis.

[1]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[2]  J. McGrath,et al.  Schizophrenia: a concise overview of incidence, prevalence, and mortality. , 2008, Epidemiologic reviews.

[3]  H. Moore,et al.  The relationship of social function to depressive and negative symptoms in individuals at clinical high risk for psychosis , 2010, Psychological Medicine.

[4]  E. Workman,et al.  Psychosis , 2010, BMJ : British Medical Journal.

[5]  Klaus R. Scherer,et al.  Emotion dimensions and formant position , 2009, INTERSPEECH.

[6]  Zhigang Deng,et al.  An acoustic study of emotions expressed in speech , 2004, INTERSPEECH.

[7]  Albert A. Rizzo,et al.  Self-Reported Symptoms of Depression and PTSD Are Associated with Reduced Vowel Space in Screening Interviews , 2016, IEEE Transactions on Affective Computing.

[8]  M. Covington,et al.  Phonetic measures of reduced tongue movement correlate with negative symptom severity in hospitalized patients with first-episode schizophrenia-spectrum disorders , 2012, Schizophrenia Research.

[9]  Erich Studerus,et al.  Gender differences in first self‐perceived signs and symptoms in patients with an at‐risk mental state and first‐episode psychosis , 2019, Early intervention in psychiatry.

[10]  Brita Elvevåg,et al.  What do we really know about blunted vocal affect and alogia? A meta-analysis of objective assessments , 2014, Schizophrenia Research.

[11]  Jie Zhu,et al.  Clinical investigation of speech signal features among patients with schizophrenia , 2016, Shanghai archives of psychiatry.

[12]  Michael T. Compton,et al.  Associations of acoustically measured tongue/jaw movements and portion of time speaking with negative symptom severity in patients with schizophrenia in Italy and the United States , 2016, Psychiatry Research.

[13]  Helen Killaspy,et al.  The impact of psychosis on social inclusion and associated factors , 2014, The International journal of social psychiatry.

[14]  Michael Wagner,et al.  Clinical high risk for psychosis: gender differences in symptoms and social functioning , 2017, Early intervention in psychiatry.

[15]  Angela Andreella,et al.  Prosody abilities in a large sample of affective and non-affective first episode psychosis patients. , 2018, Comprehensive psychiatry.

[16]  Theodoros Giannakopoulos pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis , 2015, PloS one.

[17]  M. Sigman,et al.  Automated analysis of free speech predicts psychosis onset in high-risk youths , 2015, npj Schizophrenia.

[18]  Fabien Ringeval,et al.  AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge , 2017, AVEC@ACM Multimedia.

[19]  Joseph Glicksohn Schizophrenia and Psychosis , 2011 .

[20]  Yunjung Kim,et al.  A psychometric investigation of “macroscopic” speech measures for clinical and psychological science , 2016, Behavior research methods.

[21]  Thomas F. Quatieri,et al.  On the relative importance of vocal source, system, and prosody in human depression , 2013, 2013 IEEE International Conference on Body Sensor Networks.

[22]  Larry J. Seidman,et al.  Sexual dimorphisms and prediction of conversion in the NAPLS psychosis prodrome , 2013, Schizophrenia Research.

[23]  David DeVault,et al.  The Distress Analysis Interview Corpus of human and computer interviews , 2014, LREC.