Local Tetra Pattern Texture Features for Environmental Sound Event Classification

Audio feature extraction and classification are important tool for audio signal analysis in many applications, such as home care system, security surveillance, meeting room sounds and music classification and so on. This paper presents sound classification by combining of image processing and signal processing to classify the data accurately. Firstly, audio signal is converted into time-frequency representation same as texture image in image processing. And then local tetra pattern (LTrP) text feature is used to extract features from this image. Finally, audio signal is classified by using one-vs-one SVM classifiers. Evaluation is tested on ESC-10 dataset.

[1]  I. Jacob,et al.  An Image Indexing and Retrieval Algorithm Using Local Tetra Texture Features , 2013 .

[2]  Jun Huang,et al.  An improved local binary pattern operator for texture classification , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Muhammad Huzaifah,et al.  Comparison of Time-Frequency Representations for Environmental Sound Classification using Convolutional Neural Networks , 2017, ArXiv.

[4]  Jean-Jacques E. Slotine,et al.  Audio classification from time-frequency texture , 2008, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Matti Pietikäinen,et al.  Unsupervised texture segmentation using feature distributions , 1997, Pattern Recognit..

[6]  Myung Jong Kim,et al.  Cross-acoustic transfer learning for sound event classification , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Sridhar Krishnan,et al.  Time–Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Haizhou Li,et al.  Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions , 2011, IEEE Signal Processing Letters.

[9]  Matti Pietikäinen,et al.  Median Robust Extended Local Binary Pattern for Texture Classification , 2016, IEEE Transactions on Image Processing.

[10]  Xudong Jiang,et al.  Sound-Event Classification Using Robust Texture Features for Robot Hearing , 2017, IEEE Transactions on Multimedia.

[11]  VirtanenTuomas,et al.  Detection and Classification of Acoustic Scenes and Events , 2018 .

[12]  Subrahmanyam Murala,et al.  Local Tetra Patterns: A New Feature Descriptor for Content-Based Image Retrieval , 2012, IEEE Transactions on Image Processing.

[13]  Zhenhua Guo,et al.  A Completed Modeling of Local Binary Pattern Operator for Texture Classification , 2010, IEEE Transactions on Image Processing.

[14]  Nabil Alshurafa,et al.  Spectrogram-based audio classification of nutrition intake , 2014, 2014 IEEE Healthcare Innovation Conference (HIC).

[15]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[16]  The Duy Bui,et al.  Speech classification using SIFT features on spectrogram images , 2016, Vietnam Journal of Computer Science.

[17]  Chng Eng Siong,et al.  Analysis of spectrogram image methods for sound event classification , 2014, INTERSPEECH.

[18]  Nitin Narayankar,et al.  Texture Extraction for Image Retrieval UsingLocal Tetra Pattern , 2014 .