A Novel Representation of Bioacoustic Events for Content-Based Search in Field Audio Data

Bioacoustic data can provide an important base for environmental monitoring. To explore a large amount of field recordings collected, an automated similarity search algorithm is presented in this paper. A region of an audio defined by frequency and time bounds is provided by a user; the content of the region is used to construct a query. In the retrieving process, our algorithm will automatically scan through recordings to search for similar regions. In detail, we present a feature extraction approach based on the visual content of vocalisations - in this case ridges, and develop a generic regional representation of vocalisations for indexing. Our feature extraction method works best for bird vocalisations showing ridge characteristics. The regional representation method allows the content of an arbitrary region of a continuous recording to be described in a compressed format.

[1]  Daniel P. W. Ellis,et al.  Classifying Music Audio with Timbral and Chroma Features , 2007, ISMIR.

[2]  T Scott Brandes,et al.  Using image processing to detect and classify narrow-band cricket and frog calls. , 2006, The Journal of the Acoustical Society of America.

[3]  Joseph Razik,et al.  Sparse coding for large scale bioacoustic similarity function improved by multiscale scattering , 2013 .

[4]  Paul Roe,et al.  Scaling Acoustic Data Analysis through Collaboration and Automation , 2010, 2010 IEEE Sixth International Conference on e-Science.

[5]  Panu Somervuo,et al.  Classification of the harmonic structure in bird vocalization , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  T. S. Brandes,et al.  Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  M. Muller,et al.  Chroma-based statistical audio features for audio matching , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[8]  Gonzalo Vaca-Castano,et al.  Using syllabic Mel cepstrum features and k-nearest neighbors to identify anurans and birds species , 2010, 2010 IEEE Workshop On Signal Processing Systems.

[9]  Paul Roe,et al.  Archiving nature's heartbeat using smartphones , 2010 .

[10]  Rolf Bardeli,et al.  Similarity Search in Animal Sound Databases , 2009, IEEE Transactions on Multimedia.

[11]  Paul Roe,et al.  A toolbox for animal call recognition , 2012 .

[12]  Andreas Spanias,et al.  Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  E. D. Chesmore,et al.  Application of time domain signal coding and artificial neural networks to passive acoustical identification of animals , 2001 .

[14]  David L. Olson,et al.  Advanced Data Mining Techniques , 2008 .

[15]  Friedhelm Schwenker,et al.  Classification of bioacoustic time series based on the combination of global and local decisions , 2004, Pattern Recognit..