Difference in voice analysis result by pre- and post- processing of telephone line

The purpose of this study is to verify the impact of a deterioration of the sound quality of voice by a telephone line on estimating Vitality as the extent of depressive tendency based on voice analysis using MIMOSYS. First, the voices of about 1,000 people recorded using a recorder were prepared. Next, each voice was coded and resampled in preparation for transmission over a phone line. Vitalities obtained by analyzing the voices before and after these processes were compared. The results showed high correlation between the Vitality after coding and Vitality before coding, revealing that using a telephone would be an effective way to obtain voices.

[1]  Yasuhiro Omiya,et al.  Validity of the Mind Monitoring System as a Mental Health Indicator , 2016, 2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE).

[2]  Suramya Tomar,et al.  Converting video formats with FFmpeg , 2006 .

[3]  Christian A. Müller,et al.  Multilingual speaker age recognition: Regression analyses on the Lwazi corpus , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[4]  Abderrahim Marzouk,et al.  Performance Evaluation for Voice over LTE by using G.711 as a Codec , 2014 .

[5]  J Gonzalez,et al.  The effect of MPEG audio compression on multidimensional set of voice parameters , 2001, Logopedics, phoniatrics, vocology.

[6]  S. Kuroiwa,et al.  Non-verbal voice emotion analysis system , 2006 .

[7]  Yasuhiro Omiya,et al.  Voice disability index using pitch rate , 2016, 2016 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES).

[8]  미쓰요시순지 Emotion recognizing method, sensibility creating method, device, and software , 2001 .