Acoustic and prosodic information for home monitoring of bipolar disorder

Epidemiological studies suggest that bipolar disorder has a prevalence of about 1% in European countries, becoming one of the most disabling illnesses in working age adults, and often long-term and persistent with complex management and treatment. Therefore, the capacity of home monitoring for patients with this disorder is crucial for their quality of life. The current paper introduces the use of speech-based information as an easy-to-record, ubiquitous and non-intrusive health sensor suitable for home monitoring, and its application in the framework on the NYMPHA-MD project. Some preliminary results also show the potential of acoustic and prosodic features to detect and classify bipolar disorder, by predicting the values of the Hamilton Depression Rating Scale (HDRS) and the Young Mania Rating Scale (YMRS) from speech.

[1]  B. Atal Automatic Speaker Recognition Based on Pitch Contours , 1969 .

[2]  C. Katona,et al.  The Hamilton Rating Scale for Depression , 2002 .

[3]  J A STARKWEATHER,et al.  VOICE QUALITY IN DEPRESSION. , 1965, Journal of abnormal psychology.

[4]  Klara Vicsi,et al.  Examination of the sensitivity of acoustic-phonetic parameters of speech to depression , 2012, 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom).

[5]  Yasuhiro Omiya,et al.  Voice disability index using pitch rate , 2016, 2016 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES).

[6]  Abdelmalek Amine,et al.  Modelling and Implementation of Complex Systems , 2023, Lecture Notes in Networks and Systems.

[7]  S. Nooteboom,et al.  THE PROSODY OF SPEECH: MELODY AND RHYTHM , 2001 .

[8]  Rachel Sharp,et al.  The Hamilton Rating Scale for Depression. , 2015, Occupational medicine.

[9]  Nobuhiro Furuse,et al.  Chaos of Vowel /a/ in Japanese Patients with Depression: A Preliminary Study , 2005, Journal of occupational health.

[10]  Jie Zhu,et al.  Detecting Manic State of Bipolar Disorder Based on Support Vector Machine and Gaussian Mixture Model Using Spontaneous Speech , 2018, Psychiatry investigation.

[11]  Visar Berisha,et al.  Objective Assessment of Social Skills Using Automated Language Analysis for Identification of Schizophrenia and Bipolar Disorder , 2019, INTERSPEECH.

[12]  Maria Faurholt-Jepsen The NYMPHA-MD Project: Next Generation Mobile Platforms for Health, in Mental Disorders , 2017, European Psychiatry.

[13]  Douglas A. Reynolds,et al.  The SuperSID project: exploiting high-level information for high-accuracy speaker recognition , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[14]  S. Paradiso,et al.  Cognitive and neurological impairment in mood disorders. , 2004, The Psychiatric clinics of North America.

[15]  Anil K. Jain,et al.  Handbook of Fingerprint Recognition , 2005, Springer Professional Computing.

[16]  Hélène Kerhervé,et al.  Two-Stage Feature Selection of Voice Parameters for Early Alzheimer's Disease Prediction , 2018, IRBM.

[17]  Sunil Kumar Kopparapu,et al.  Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper Spectral Estimation , 2016, INTERSPEECH.

[18]  F. Goodwin,et al.  The stages of mania. A longitudinal analysis of the manic episode. , 1973, Archives of general psychiatry.

[19]  Leena Mary Extraction and Representation of Prosody for Speaker, Language, Emotion, and Speech Recognition , 2019 .

[20]  Gorkem Serbes,et al.  Analyzing the effectiveness of vocal features in early telediagnosis of Parkinson's disease , 2017, PloS one.

[21]  John H. L. Hansen,et al.  Nonlinear feature based classification of speech under stress , 2001, IEEE Trans. Speech Audio Process..

[22]  Jon Sánchez,et al.  Automatic emotion recognition using prosodic parameters , 2005, INTERSPEECH.

[23]  Enzo Pasquale Scilingo,et al.  Automatic analysis of speech F0 contour for the characterization of mood changes in bipolar patients , 2015, Biomed. Signal Process. Control..

[24]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[25]  R. C. Young,et al.  A Rating Scale for Mania: Reliability, Validity and Sensitivity , 1978, British Journal of Psychiatry.

[26]  Oscar Mayora-Ibarra,et al.  Classification of bipolar disorder episodes based on analysis of voice and motor activity of patients , 2016, Pervasive Mob. Comput..

[27]  Agnes Grünerbl,et al.  Assessing Bipolar Episodes Using Speech Cues Derived from Phone Calls , 2014, MindCare.

[28]  A. Benba,et al.  Hybridization of best acoustic cues for detecting persons with Parkinson's disease , 2014, 2014 Second World Conference on Complex Systems (WCCS).

[29]  Bachir Boudraa,et al.  Parkinson’s Disease Recognition by Speech Acoustic Parameters Classification , 2016 .

[30]  Nivja H. Jong,et al.  Praat script to detect syllable nuclei and measure speech rate automatically , 2009, Behavior research methods.

[31]  Joan Codina-Filba,et al.  Combining Prosodic, Voice Quality and Lexical Features to Automatically Detect Alzheimer's Disease , 2020, ArXiv.

[32]  Louis-Philippe Morency,et al.  Investigating voice quality as a speaker-independent indicator of depression and PTSD , 2013, INTERSPEECH.

[33]  Sharath Pankanti,et al.  Guide to Biometrics , 2003, Springer Professional Computing.

[34]  André Adami,et al.  Modeling prosodic differences for speaker recognition , 2007, Speech Commun..

[35]  Jody Kreiman,et al.  Perception of aperiodicity in pathological voice. , 2005, The Journal of the Acoustical Society of America.

[36]  Sergio Escalera,et al.  Mobile eHealth Platform for Home Monitoring of Bipolar Disorder , 2021, MMM.

[37]  Oscar Mayora-Ibarra,et al.  Mobile phones as medical devices in mental disorder treatment: an overview , 2014, Personal and Ubiquitous Computing.

[38]  L. Sher Manic-Depressive Illness: Bipolar Disorders and Recurrent Depression, Second Edition. , 2008, The American journal of psychiatry.

[39]  O. Kinouchi,et al.  Speech Graphs Provide a Quantitative Measure of Thought Disorder in Psychosis , 2012, PloS one.

[40]  Mireia Farrús,et al.  Using jitter and shimmer in speaker verification , 2009 .

[41]  Giuseppe Tradigo,et al.  Vocal signal analysis in patients affected by Multiple Sclerosis , 2017, ICCS.

[42]  G. B. Gour,et al.  Voice Disorder Analysis of Thyroid Patients , 2015 .

[43]  H. Lôo [On bipolar disorder]. , 2008, L'Encephale.

[44]  S. Strejilevich,et al.  A Longitudinal Mirror-Image Assessment of Morbidity in Bipolar Disorder , 2017, European Psychiatry.

[45]  Emily Mower Provost,et al.  Mood state prediction from speech of varying acoustic quality for individuals with bipolar disorder , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[46]  John Torous,et al.  Current research and trends in the use of smartphone applications for mood disorders , 2015 .

[47]  Ann K. Wennerstrom The Music of Everyday Speech: Prosody and Discourse Analysis , 2001 .

[48]  S. Strakowski,et al.  A Programmatic Approach to Treatment , 2020 .

[49]  Alessandro Puiatti,et al.  Personal health systems for bipolar disorder Anecdotes, challenges and lessons learnt from MONARCA project , 2013, 2013 7th International Conference on Pervasive Computing Technologies for Healthcare and Workshops.

[50]  Jiang Li,et al.  A Deep Transfer Learning Approach for Improved Post-Traumatic Stress Disorder Diagnosis , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[51]  D. Kupfer,et al.  Can bipolar disorder be viewed as a multi-system inflammatory disease? , 2012, Journal of affective disorders.

[52]  Li-Chen Fu,et al.  An Automatic Assessment System for Alzheimer’s Disease Based on Speech Using Feature Sequence Generator and Recurrent Neural Network , 2019, Scientific Reports.