Improving Classification Algorithms by Considering Score Series in Wireless Acoustic Sensor Networks

The reduction in size, power consumption and price of many sensor devices has enabled the deployment of many sensor networks that can be used to monitor and control several aspects of various habitats. More specifically, the analysis of sounds has attracted a huge interest in urban and wildlife environments where the classification of the different signals has become a major issue. Various algorithms have been described for this purpose, a number of which frame the sound and classify these frames, while others take advantage of the sequential information embedded in a sound signal. In the paper, a new algorithm is proposed that, while maintaining the frame-classification advantages, adds a new phase that considers and classifies the score series derived after frame labelling. These score series are represented using cepstral coefficients and classified using standard machine-learning classifiers. The proposed algorithm has been applied to a dataset of anuran calls and its results compared to the performance obtained in previous experiments on sensor networks. The main outcome of our research is that the consideration of score series strongly outperforms other algorithms and attains outstanding performance despite the noisy background commonly encountered in this kind of application.

[1]  Luís Felipe Toledo,et al.  Current knowledge on bioacoustics of the subfamily Lophyohylinae (Hylidae, Anura) and description of Ocellated treefrog Itapotihyla langsdorffii vocalizations , 2018, PeerJ.

[2]  Yifan Gong,et al.  An Overview of Noise-Robust Automatic Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[3]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[4]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[5]  Björn W. Schuller,et al.  Deep Learning for Environmentally Robust Speech Recognition , 2017, ACM Trans. Intell. Syst. Technol..

[6]  Eric R. Ziegel,et al.  An Introduction to Generalized Linear Models , 2002, Technometrics.

[7]  Giovanni Emilio Perona,et al.  An algorithm of the wildfire classification by its acoustic emission spectrum using Wireless Sensor Networks , 2017 .

[8]  Mark E. Cambron,et al.  An Automated Digital Sound Recording System: The Amphibulator , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[9]  Hani Yehia,et al.  On the Use of Compressive Sensing for the Reconstruction of Anuran Sounds in a Wireless Sensor Network , 2012, 2012 IEEE International Conference on Green Computing and Communications.

[10]  Justin Salamon,et al.  The Implementation of Low-cost Urban Acoustic Monitoring Devices , 2016, ArXiv.

[11]  张国亮,et al.  Comparison of Different Implementations of MFCC , 2001 .

[12]  Claudia Isaza,et al.  Automatic recognition of anuran species based on syllable identification , 2014, Ecol. Informatics.

[13]  Maximo Cobos,et al.  Spatio-Temporal Analysis of Urban Acoustic Environments with Binaural Psycho-Acoustical Considerations for IoT-Based Applications , 2018, Sensors.

[14]  L. L. Cam,et al.  Maximum likelihood : an introduction , 1990 .

[15]  T. Mitchell Aide,et al.  Species-specific audio detection: a comparison of three template-based detection algorithms using random forests , 2017, PeerJ Comput. Sci..

[16]  Enrique Personal,et al.  Evaluation of the Processing Times in Anuran Sound Classification , 2017, Wirel. Commun. Mob. Comput..

[17]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[18]  Carlos Agón,et al.  Time-series data mining , 2012, CSUR.

[19]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[20]  Andreas Holzinger,et al.  Data Mining with Decision Trees: Theory and Applications , 2015, Online Inf. Rev..

[21]  Zheng Fang,et al.  Comparison of different implementations of MFCC , 2001 .

[22]  Julio Barbancho,et al.  Non-sequential automatic classification of anuran sounds for the estimation of climate-change indicators , 2018, Expert Syst. Appl..

[23]  Sean A. Fulop Speech Spectrum Analysis , 2011 .

[24]  M. Petró‐Turza,et al.  The International Organization for Standardization. , 2003 .

[25]  Visa Koivunen,et al.  Academic Press Library in Signal Processing: Volume 3 -- Array and Statistical Signal Processing , 2014 .

[26]  Chang-Hsing Lee,et al.  Automatic Recognition of Bird Songs Using Cepstral Coefficients , 2006 .

[27]  Unai Hernández-Jayo,et al.  Remote Acoustic Monitoring System for Noise Sensing , 2017, REV.

[28]  Eduardo Freire Nakamura,et al.  An incremental technique for real-time bioacoustic signal segmentation , 2015, Expert Syst. Appl..

[29]  R. Wielgat,et al.  On using prefiltration in HMM-based bird species recognition , 2012, 2012 International Conference on Signals and Electronic Systems (ICSES).

[30]  Jan Gorodkin,et al.  Comparing two K-category assignments by a K-category correlation coefficient , 2004, Comput. Biol. Chem..

[31]  Alejandro Carrasco,et al.  Temporally-aware algorithms for the classification of anuran sounds , 2018, PeerJ.

[32]  Frank Kurth,et al.  Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring , 2010, Pattern Recognit. Lett..

[33]  T. Mitchell Aide,et al.  Real-time bioacoustics monitoring and automated species identification , 2013, PeerJ.

[34]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[35]  Todor Ganchev,et al.  Automated acoustic detection of Vanellus chilensis lampronotus , 2015, Expert Syst. Appl..

[36]  Mark A. Bee,et al.  Chapter 6 Anuran Acoustic Signal Perception in Noisy Environments , 2013 .

[37]  Joaquín Luque,et al.  Evaluation of MPEG-7-Based Audio Descriptors for Animal Voice Recognition over Wireless Acoustic Sensor Networks , 2016, Sensors.

[38]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[39]  Christian Breiteneder,et al.  Discrimination and retrieval of animal sounds , 2006, 2006 12th International Multi-Media Modelling Conference.

[40]  Chenn-Jung Huang,et al.  Frog classification using machine learning techniques , 2009, Expert Syst. Appl..

[41]  Paul Roe,et al.  Sampling environmental acoustic recordings to determine bird species richness. , 2013, Ecological applications : a publication of the Ecological Society of America.

[42]  Chong Mun Ho,et al.  Classification and identification of frog sound based on entropy approach , 2011 .

[43]  Anil Prabhakar,et al.  Automatic identification of bird calls using Spectral Ensemble Average Voice Prints , 2006, 2006 14th European Signal Processing Conference.

[44]  Panu Somervuo,et al.  Parametric Representations of Bird Sounds for Automatic Species Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[45]  Julio Barbancho,et al.  Optimal Representation of Anuran Call Spectrum in Environmental Monitoring Systems Using Wireless Sensor Networks , 2018, Sensors.

[46]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[47]  Paul Roe,et al.  The use of acoustic indices to determine avian species richness in audio-recordings of the environment , 2014, Ecol. Informatics.

[48]  Chenn-Jung Huang,et al.  Intelligent feature extraction and classification of anuran vocalizations , 2014, Appl. Soft Comput..

[49]  S. Pavlopoulos,et al.  A decision tree – based method for the differential diagnosis of Aortic Stenosis from Mitral Regurgitation using heart sounds , 2004, Biomedical engineering online.

[50]  Peter A. Flach,et al.  Machine Learning - The Art and Science of Algorithms that Make Sense of Data , 2012 .

[51]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[52]  David A. Landgrebe,et al.  The minimum distance approach to classification , 1971 .

[53]  Jesús B. Alonso,et al.  Automatic anuran identification using noise removal and audio activity detection , 2017, Expert Syst. Appl..

[54]  Rita R. Patel,et al.  Measurement of glottal cycle characteristics between children and adults: physiological variations. , 2014, Journal of voice : official journal of the Voice Foundation.

[55]  E. S. Gopi Digital Speech Processing Using Matlab , 2013 .

[56]  Geoffrey A. Williamson,et al.  Methods for classification of nocturnal migratory bird vocalizations using Pseudo Wigner-Ville Transform , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[57]  Chih-Hsun Chou,et al.  Bird Species Recognition by Wavelet Transformation of a Section of Birdsong , 2009, 2009 Symposia and Workshops on Ubiquitous, Autonomic and Trusted Computing.

[58]  Juan J. Noda Arencibia,et al.  Methodology for automatic bioacoustic classification of anurans based on feature fusion , 2016, Expert Syst. Appl..

[59]  Ilyas Potamitis,et al.  Unsupervised dictionary extraction of bird vocalisations and new tools on assessing and visualising bird activity , 2015, Ecol. Informatics.

[60]  Chia-Feng Juang,et al.  Birdsong recognition using prediction-based recurrent neural fuzzy networks , 2007, Neurocomputing.