A Dual Source-Filter Model of Snore Audio for Snorer Group Classification

Snoring is a common symptom of serious chronic disease known as obstructive sleep apnea (OSA). Knowledge about the location of obstruction site (VVelum, OOropharyngeal lateral walls, T-Tongue, E-Epiglottis) in the upper airways is necessary for proper surgical treatment. In this paper we propose a dual source-filter model similar to the source-filter model of speech to approximate the generation process of snore audio. The first filter models the vocal tract from lungs to the point of obstruction with white noise excitation from the lungs. The second filter models the vocal tract from the obstruction point to the lips/nose with impulse train excitation which represents vibrations at the point of obstruction. The filter coefficients are estimated using the closed and open phases of the snore beat cycle. VOTE classification is done by using SVM classifier and filter coefficients as features. The classification experiments are performed on the development set (283 snore audios) of the MUNICH-PASSAU SNORE SOUND CORPUS (MPSSC). We obtain an unweighted average recall (UAR) of 49.58%, which is higher than the INTERSPEECH-2017 snoring sub-challenge baseline technique by ∼3% (absolute).

[1]  Arild Lacroix,et al.  Parameter estimation of branched tube models by iterative inverse filtering , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[2]  T. Young,et al.  Prospective study of the association between sleep-disordered breathing and hypertension. , 2000, The New England journal of medicine.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Stephen P. Boyd,et al.  An Interior-Point Method for Large-Scale $\ell_1$-Regularized Least Squares , 2007, IEEE Journal of Selected Topics in Signal Processing.

[5]  Paavo Alku,et al.  Stabilised weighted linear prediction , 2009, Speech Commun..

[6]  George Trigeorgis,et al.  The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring , 2017, INTERSPEECH.

[7]  Chih-Jen Lin,et al.  Probability Estimates for Multi-class Classification by Pairwise Coupling , 2003, J. Mach. Learn. Res..

[8]  J. Concato,et al.  Obstructive sleep apnea as a risk factor for stroke and death. , 2005, The New England journal of medicine.

[9]  Arild Lacroix,et al.  Analysis of nasals and nasalized vowels based on branched tube models , 2002, 2002 11th European Signal Processing Conference.

[10]  E. Kezirian,et al.  Drug-Induced Sleep Endoscopy. , 2016, Otolaryngologic clinics of North America.

[11]  Robin De Keyser,et al.  A Theoretical Study on Modeling the Respiratory Tract With Ladder Networks by Means of Intrinsic Fractal Geometry , 2010, IEEE Transactions on Biomedical Engineering.

[12]  Paavo Alku,et al.  Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  Lixi Huang,et al.  Biomechanics of snoring. , 1995, Endeavour.

[14]  Ronald M. Aarts,et al.  The acoustics of snoring. , 2010, Sleep medicine reviews.

[15]  K. Mcguinness,et al.  Sound frequency analysis and the site of snoring in natural and induced sleep. , 2002, Clinical otolaryngology and allied sciences.

[16]  Y. Itasaka,et al.  Acoustic analysis of snoring and the site of airway obstruction in sleep related respiratory disorders. , 1998, Acta oto-laryngologica. Supplementum.

[17]  L. Huang,et al.  Mechanical modeling of palatal snoring. , 1995, The Journal of the Acoustical Society of America.

[18]  George R. Wodicka,et al.  An acoustic model of the respiratory tract , 2001, IEEE Transactions on Biomedical Engineering.

[19]  T. Chan,et al.  Convergence of the alternating minimization algorithm for blind deconvolution , 2000 .

[20]  G. Fant Acoustic theory of speech production : with calculations based on X-ray studies of Russian articulations , 1961 .

[21]  J. Durbin EFFICIENT ESTIMATION OF PARAMETERS IN MOVING-AVERAGE MODELS , 1959 .

[22]  Zixing Zhang,et al.  Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multi-Feature Analysis. , 2016, IEEE transactions on bio-medical engineering.

[23]  P D Hill,et al.  Palatal snoring identified by acoustic crest factor analysis , 1999, Physiological measurement.

[24]  Alvar Agusti,et al.  Long-term cardiovascular outcomes in men with obstructive sleep apnoea-hypopnoea with or without treatment with continuous positive airway pressure: an observational study , 2005, The Lancet.

[25]  A. G. Ramakrishnan,et al.  Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index , 2013, IEEE Transactions on Audio, Speech, and Language Processing.