Analysis of phonetic markedness and gestural effort measures for acoustic speech-based depression classification

While acoustic-based links between clinical depression and abnormal speech have been established, there is still however little knowledge regarding what kinds of phonological content is most impacted. Moreover, for automatic speech-based depression classification and depression assessment elicitation protocols, even less is understood as to what phonemes or phoneme transitions provide the best analysis. In this paper we analyze articulatory measures to gain further insight into how articulation is affected by depression. In our investigative experiments, by partitioning acoustic speech data based on lower to high densities of specific phonetic markedness and gestural effort, we demonstrate improvements in depressed/non-depressed classification accuracy and F1 scores.

[1]  K. Scherer,et al.  Vocal indicators of affective disorders. , 1988, Psychotherapy and psychosomatics.

[2]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[3]  Björn W. Schuller,et al.  Enhancing Speech-Based Depression Detection Through Gender Dependent Vowel-Level Formant Features , 2017, AIME.

[4]  John Kane,et al.  COVAREP — A collaborative voice analysis repository for speech technologies , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  J. Darby,et al.  Speech and voice parameters of depression: a pilot study. , 1984, Journal of communication disorders.

[6]  David DeVault,et al.  The Distress Analysis Interview Corpus of human and computer interviews , 2014, LREC.

[7]  Dimitra Vergyri,et al.  The SRI AVEC-2014 Evaluation System , 2014, AVEC '14.

[8]  Linda I Shuster,et al.  Ease of articulation: A replication. , 2015, Journal of communication disorders.

[9]  John Crowe,et al.  Objective Methods for Reliable Detection of Concealed Depression , 2015, Front. ICT.

[10]  T. Pozzo,et al.  Psychomotor Retardation in Depression: A Systematic Review of Diagnostic, Pathophysiologic, and Therapeutic Implications , 2013, BioMed research international.

[11]  Louis Goldstein,et al.  Dynamics and articulatory phonology , 1996 .

[12]  J. Locke,et al.  Cost and complexity: selection for speech and language. , 2008, Journal of theoretical biology.

[13]  C. Browman,et al.  Articulatory Phonology: An Overview , 1992, Phonetica.

[14]  Carol Stoel-Gammon,et al.  The Word Complexity Measure: Description and application to developmental phonology and disorders , 2010, Clinical linguistics & phonetics.

[15]  Diane Austin,et al.  The percentage of consonants correct (PCC) metric: extensions and reliability data. , 1997, Journal of speech, language, and hearing research : JSLHR.

[16]  James R. Glass,et al.  Feature-based pronunciation modeling with trainable asynchrony probabilities , 2004, INTERSPEECH.

[17]  Louis Goldstein,et al.  Articulatory gestures as phonological units , 1989, Phonology.

[18]  H. Lindsley,et al.  Optimization of cellular ELISA for assay of surface antigens on human synoviocytes. , 1997, BioTechniques.

[19]  A. Flint,et al.  Abnormal speech articulation, psychomotor retardation, and subcortical dysfunction in major depression. , 1993, Journal of psychiatric research.

[20]  J. Logemann,et al.  Vocal Tract Control in Parkinson's Disease , 1981 .

[21]  Fabien Ringeval,et al.  AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge , 2016, AVEC@ACM Multimedia.

[22]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[23]  Karen Livescu,et al.  Feature-based pronunciation modeling for automatic speech recognition , 2005 .

[24]  T. Strine,et al.  The PHQ-8 as a measure of current depression in the general population. , 2009, Journal of affective disorders.

[25]  P. Ladefoged A course in phonetics , 1975 .

[26]  J. Logemann,et al.  Vocal tract control in Parkinson's disease: phonetic feature analysis of misarticulations. , 1981, The Journal of speech and hearing disorders.

[27]  Thomas F. Quatieri,et al.  A review of depression and suicide risk assessment using speech analysis , 2015, Speech Commun..

[28]  Angelika Königseder,et al.  Walter de Gruyter , 2016 .

[29]  Gábor Kiss,et al.  Physiological and Cognitive Status Monitoring on the Base of Acoustic-Phonetic Speech Parameters , 2014, SLSP.

[30]  Thomas F. Quatieri,et al.  Phonologically-based biomarkers for major depressive disorder , 2011, EURASIP J. Adv. Signal Process..

[31]  Roland Göcke,et al.  Elicitation Design for Acoustic Depression Classification: An Investigation of Articulation Effort, Linguistic Complexity, and Word Affect , 2017, INTERSPEECH.

[32]  A. Miccio Clinical Problem Solving , 2002 .

[33]  Zhenyu Liu,et al.  Evaluation of Depression Severity in Speech , 2016, BIH.

[34]  Louis Goldstein,et al.  Towards an articulatory phonology , 1986, Phonology.