Clinical investigation of speech signal features among patients with schizophrenia

Background A new area of interest in the search for biomarkers for schizophrenia is the study of the acoustic parameters of speech called 'speech signal features'. Several of these features have been shown to be related to emotional responsiveness, a characteristic that is notably restricted in patients with schizophrenia, particularly those with prominent negative symptoms. Aim Assess the relationship of selected acoustic parameters of speech to the severity of clinical symptoms in patients with chronic schizophrenia and compare these characteristics between patients and matched healthy controls. Methods Ten speech signal features-six prosody features, formant bandwidth and amplitude, and two spectral features-were assessed using 15-minute speech samples obtained by smartphone from 26 inpatients with chronic schizophrenia (at enrollment and 1 week later) and from 30 healthy controls (at enrollment only). Clinical symptoms of the patients were also assessed at baseline and 1 week later using the Positive and Negative Syndrome Scale, the Scale for the Assessment of Negative Symptoms, and the Clinical Global Impression-Schizophrenia scale. Results In the patient group the symptoms were stable over the 1-week interval and the 1-week test-retest reliability of the 10 speech features was good (intraclass correlation coefficients [ICC] ranging from 0.55 to 0.88). Comparison of the speech features between patients and controls found no significant differences in the six prosody features or in the formant bandwidth and amplitude features, but the two spectral features were different: the Mel-frequency cepstral coefficient (MFCC) scores were significantly lower in the patient group than in the control group, and the linear prediction coding (LPC) scores were significantly higher in the patient group than in the control group. Within the patient group, 10 of the 170 associations between the 10 speech features considered and the 17 clinical parameters considered were statistically significant at the p<0.05 level. Conclusions This study provides some support for the potential value of speech signal features as indicators (i.e., biomarkers) of the severity of negative symptoms in schizophrenia, but more detailed studies using larger samples of more diverse patients that are followed over time will be needed before the potential utility of such acoustic parameters of speech can be fully assessed.

[1]  Michael F. Green,et al.  Social cognition in schizophrenia, Part 1: performance across phase of illness. , 2012, Schizophrenia bulletin.

[2]  S. Kay,et al.  The positive and negative syndrome scale (PANSS) for schizophrenia. , 1987, Schizophrenia bulletin.

[3]  Petri Laukka,et al.  Getting the cue: sensory contributions to auditory emotion recognition impairments in schizophrenia. , 2010, Schizophrenia bulletin.

[4]  D. Sheehan,et al.  The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. , 1998, The Journal of clinical psychiatry.

[5]  Albino Nogueiras,et al.  Speech emotion recognition using hidden Markov models , 2001, INTERSPEECH.

[6]  G. Toderean,et al.  Analysis and Synthesis of Vowels Using Matlab , 2006, 2006 IEEE International Conference on Automation, Quality and Testing, Robotics.

[7]  Ling Guan,et al.  A neural network approach for human emotion recognition in speech , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[8]  H H Stassen,et al.  Speaking behavior and voice sound characteristics associated with negative schizophrenia. , 1995, Journal of psychiatric research.

[9]  Peter B. Jones,et al.  The Clinical Global Impression–Schizophrenia scale: a simple instrument to measure the diversity of symptoms present in schizophrenia , 2003, Acta psychiatrica Scandinavica. Supplementum.

[10]  H H Stassen,et al.  Speaking behavior and speech sound characteristics in acute schizophrenia. , 1998, Journal of psychiatric research.

[11]  Li Zhao,et al.  A Study on Emotional Feature Analysis and Recognition in Speech Signal , 2009, 2009 International Conference on Measuring Technology and Mechatronics Automation.

[12]  Namrata Dave,et al.  Feature Extraction Methods LPC, PLP and MFCC In Speech Recognition , 2013 .

[13]  N C Andreasen,et al.  Negative symptoms in schizophrenia , 1982 .

[14]  Wang De-fang Analysis and Application of Voice Spectrum , 2010 .

[16]  D. Javitt,et al.  Auditory emotion recognition impairments in schizophrenia: relationship to acoustic features and cognition. , 2012, The American journal of psychiatry.