Ensemble extraction for classification and detection of bird species

Advances in technology have enabled new approaches for sensing the environment and collecting data about the world. Once collected, sensor readings can be assembled into data streams and transmitted over computer networks for storage and processing at observatories or to evoke an immediate response from an autonomic computer system. However, such automated collection of sensor data produces an immense quantity of data that is time consuming to organize, search and distill into meaningful information. In this paper, we explore the design and use of distributed pipelines for automated processing of sensor data streams. In particular, we focus on the detection and extraction of meaningful sequences, called ensembles, from acoustic data streamed from natural areas. Our goal is automated detection and classification of various species of birds.

[1]  Linda A. Weir,et al.  North American Amphibian Monitoring Program (NAAMP) , 2005 .

[2]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[3]  Richard E. Grandy,et al.  Orlando, Florida, USA , 2011 .

[4]  Jane S. Paulsen Memory in the Cerebral Cortex: An Empirical Approach to Neural Networks in the Human and Nonhuman Primate , 1996 .

[5]  S. E. Brodie New York, New York, USA , 1996 .

[6]  D K Mellinger,et al.  Recognizing transient low-frequency whale sounds by spectrogram correlation. , 2000, The Journal of the Acoustical Society of America.

[7]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[8]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[9]  D. Bystrak,et al.  The north american breeding bird survey. , 1981 .

[10]  Ralph Arnote,et al.  Hong Kong (China) , 1996, OECD/G20 Base Erosion and Profit Shifting Project.

[11]  Eyke Hüllermeier,et al.  Online clustering of parallel data streams , 2006, Data Knowl. Eng..

[12]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[13]  Philip K. McKinley,et al.  Automated Ensemble Extraction and Analysis of Acoustic Data Streams , 2007, 27th International Conference on Distributed Computing Systems Workshops (ICDCSW'07).

[14]  B. Truax Acoustic Communication , 1985 .

[15]  Dana Ron,et al.  The power of amnesia: Learning probabilistic automata with variable memory length , 1996, Machine Learning.

[16]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[17]  Joseph M. Hellerstein,et al.  Eddies: continuously adaptive query processing , 2000, SIGMOD '00.

[18]  Shashi Shekhar,et al.  Environmental Sensor Networks , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[19]  David G. Stork,et al.  Pattern Classification , 1973 .

[20]  Simon Kasif,et al.  A System for Induction of Oblique Decision Trees , 1994, J. Artif. Intell. Res..

[21]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[22]  Jiong Yang,et al.  AGILE: a general approach to detect transitions in evolving data streams , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[23]  John Anderson,et al.  An analysis of a large scale habitat monitoring application , 2004, SenSys '04.

[24]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[25]  Seppo Ilmari Fagerlund,et al.  Bird Species Recognition Using Support Vector Machines , 2007, EURASIP J. Adv. Signal Process..

[26]  Jessica Lin,et al.  HOT SAX: Finding the Most Unusual Time Series Subsequence: Algorithms and Applications , 2004 .

[27]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[28]  Jennifer Widom,et al.  STREAM: The Stanford Stream Data Manager , 2003, IEEE Data Eng. Bull..

[29]  C. Goose,et al.  Glossary of Terms , 2004, Machine Learning.

[30]  P. Arzberger,et al.  Sensors for Environmental Observatories , 2005 .

[31]  Nitin Kumar,et al.  Time-series Bitmaps: a Practical Visualization Tool for Working with Large Time Series Databases , 2005, SDM.

[32]  R. Anderson,et al.  Studying Wind Energy/Bird Interactions: A Guidance Document , 1999 .

[33]  P. Hanson,et al.  Wireless Sensor Networks for Ecology , 2005 .

[34]  Panu Somervuo,et al.  Bird song recognition based on syllable pair histograms , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[35]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[36]  Beatrice Gralton,et al.  Washington DC - USA , 2008 .

[37]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[38]  Chang-Hsing Lee,et al.  Bird Species Recognition by Comparing the HMMs of the Syllables , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[39]  Deborah Estrin,et al.  Habitat monitoring with sensor networks , 2004, CACM.

[40]  Philip K. McKinley,et al.  MESO: Supporting Online Decision Making in Autonomic Computing Systems , 2007, IEEE Transactions on Knowledge and Data Engineering.

[41]  W. Thorpe,et al.  Bird-Song: The Biology of Vocal Communication and Expression in Birds , 1961 .

[42]  Tarek F. Abdelzaher,et al.  EnviroMic: Towards Cooperative Storage and Retrieval in Audio Sensor Networks , 2007, 27th International Conference on Distributed Computing Systems (ICDCS '07).

[43]  O Tchernichovski,et al.  Studying the Song Development Process: Rationale and Methods , 2004, Annals of the New York Academy of Sciences.

[44]  D Margoliash,et al.  An introduction to birdsong and the avian song system. , 1997, Journal of neurobiology.

[45]  Stephen Emmott,et al.  Towards 2020 Science , 2006 .

[46]  S. Gage,et al.  Soundscape characteristics of an environment: A new ecological indicator of ecosystem health , 2008 .

[47]  Philip K. Chan,et al.  MORPHEUS: motif oriented representations to purge hostile events from unlabeled sequences , 2004, VizSEC/DMSEC '04.

[48]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[49]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[50]  Aki Härmä,et al.  Parametrization of inharmonic bird sounds for automatic recognition , 2005, 2005 13th European Signal Processing Conference.

[51]  P. Slater,et al.  Bird Song: Biological Themes and Variations , 1995 .

[52]  José Luis Carles,et al.  Sound influence on landscape values , 1999 .

[53]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[54]  Carlo Zaniolo,et al.  An adaptive learning approach for noisy data streams , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[55]  Jennifer Widom,et al.  Content-Based Routing: Different Plans for Different Data , 2005, VLDB.

[56]  James P. Egan,et al.  Signal detection theory and ROC analysis , 1975 .

[57]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[58]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[59]  Eliza Ward,et al.  An Introduction to Acoustic Ecology , 2001 .

[60]  J. E. Porter,et al.  Normalizations and selection of speech segments for speaker recognition scoring , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[61]  Charles E. Taylor,et al.  Data Mining Applied to Acoustic Bird Species Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).