Challenges with Audio Classification using Image Based Approaches for Health Measurement Applications

Image classification has had huge success in recent years, mainly due to the vast array of databases available. The lack of audio databases presents a problem when it comes to creating a deep neural network classifier aimed at measurement and monitoring of health-related sounds. Such sounds (i.e. cough) can be indicative of worsening health conditions, specifically as it relates to remote monitoring of older adults. The application of pre-existing deep neural network image classifiers to audio classification has been presented as a potential solution. This paper describes some of the issues associated with utilizing audio spectrograms to retrain the AlexNet image classifier for the purpose of remote patient monitoring. The spatial invariance assumption of the classifier is further investigated by creating two different classification tasks based on spectrograms computed from notes on a classical piano at four different noise levels; (1) octave classification and (2) note classification. As expected, the AlexNet classifier with clean data performs better when classifying octaves (98%), when compared to the note classification (83 %). When evaluating on audio with noise, the note classifier performance decreases more than the octave classification performance.

[1]  Sauro Longhi,et al.  Human Monitoring, Smart Health and Assisted Living: Techniques and technologies , 2017 .

[2]  Loris Nanni,et al.  Combining visual and acoustic features for audio classification tasks , 2017, Pattern Recognit. Lett..

[3]  Doroteo Torre Toledano,et al.  Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio dataset , 2019, EURASIP J. Audio Speech Music. Process..

[4]  Frank Knoefel,et al.  Comparison of Silence Removal Methods for the Identification of Audio Cough Events , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[5]  Elmar Nöth,et al.  Segmentation, Classification, and Visualization of Orca Calls Using Deep Learning , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Frank Knoefel,et al.  Feature extraction for the differentiation of dry and wet cough sounds , 2011, 2011 IEEE International Symposium on Medical Measurements and Applications.

[7]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Shengyu Zhang,et al.  Optimization of Center-of-Pressure-Based Indices for Assessing Balance Ability , 2018, 2018 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[11]  Aren Jansen,et al.  Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Zhiyong Xu,et al.  Automatic Bird Vocalization Identification Based on Fusion of Spectral Pattern and Texture Features , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Frank Knoefel,et al.  Smart monitoring of fluid intake and bladder voiding using pressure sensitive mats , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Lars Lundberg,et al.  Classifying environmental sounds using image recognition networks , 2017, KES.

[15]  Erin M. Bayne,et al.  Pre-processing spectrogram parameters improve the accuracy of bioacoustic classification using convolutional neural networks , 2020, Bioacoustics.

[16]  Rafik A. Goubran,et al.  Breathing signal combining for respiration rate estimation in smart beds , 2017, 2017 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[17]  Emanuele Rizzuto,et al.  Wearable heart rate monitoring as stress response indicator in children with neurodevelopmental disorder , 2018, 2018 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[18]  Eleni Stroulia,et al.  Preliminary Results from Longitudinal Balance Assessment for Older Adults with Cognitive Decline , 2019, 2019 IEEE International Symposium on Medical Measurements and Applications (MeMeA).