A comparison of similarity-based approaches in the classification of flight calls of four species of North American wood-warblers (Parulidae)

Abstract Numerous methods are available for analysis of avian vocalizations, but few research efforts have compared recent methods for calculating and evaluating similarity among calls, particularly those collected in the field. This manuscript compares a suite of methodologies for analyzing flight calls of New World warblers, investigating the effectiveness of four techniques for calculating call similarity: (1) spectrographic cross-correlation, (2) dynamic time warping, (3) Euclidean distance between spectrogram-based feature measurements, and (4) random forest distance between spectrogram-based feature measurements. We tested these methods on flight calls, which are short, structurally simple vocalizations typically used during nocturnal migration, as these signals may contain important ecological or demographic information. Using the four techniques listed above, we classified flight calls from three datasets, one collected from captive birds and two collected from wild birds in the field. Each dataset contained an equal number of calls from four warbler species commonly recorded during acoustic monitoring: American Redstart, Chestnut-sided Warbler, Hooded Warbler, and Ovenbird. Using captive recordings to train the classification models, we created four similarity-based classifiers which were then tested on the captive and field datasets. We show that these classification methods are limited in their ability to successfully classify the calls of these warbler species, and that classification accuracy was lower on field recordings than captive recordings for each of the tested methods. Of the four methods we compared, the random forest technique had the highest classification accuracy, enabling correct classification of 67.6% of field recordings. To compare the performance of the automated techniques to manual classification, the most common method used in flight call research, human experts were also asked to classify calls from each dataset. The experts correctly classified approximately 90% of field recordings, indicating that although the automated techniques are faster, they remain less accurate than manual classification. However, because of the challenges inherent to these data, such as the structural similarity among the flight calls of focal species and the presence of environmental noise in the field recordings, some of the tested automated classification techniques may be acceptable for real-world applications. We believe that this comparison of broadly applicable methodologies provides information that will prove to be useful for analysis, detection and classification of short duration signals. Based on our results, we recommend that a combination of feature measurements and random forest classification can be used to assign flight calls to species, while human experts oversee the process.

[1]  John A Hildebrand,et al.  Classification of behavior using vocalizations of Pacific white-sided dolphins (Lagenorhynchus obliquidens). , 2011, The Journal of the Acoustical Society of America.

[2]  A. Farnsworth,et al.  A comparison of nocturnal call counts of migrating birds and reflectivity measurements on Doppler radar , 2004 .

[3]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[4]  Peter Marler,et al.  Bird Calls: Their Potential for Behavioral Neurobiology , 2004, Annals of the New York Academy of Sciences.

[5]  K. Cortopassi,et al.  GEOGRAPHICAL VARIATION IN THE CONTACT CALLS OF ORANGE-FRONTED PARAKEETS , 2001 .

[6]  Michael Lanzone,et al.  Revealing Undocumented or Poorly Known Flight Calls of Warblers (Parulidae) Using a Novel Method of Recording Birds in Captivity , 2009 .

[7]  M. C. Baker,et al.  Population Differentiation in a Complex Bird Sound: A Comparison of Three Bioacoustical Analysis Procedures , 2003 .

[8]  Philip K. McKinley,et al.  Ensemble extraction for classification and detection of bird species , 2010, Ecol. Informatics.

[9]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10]  Jinhai Cai,et al.  Sensor Network for the Monitoring of Ecosystem: Bird Species Recognition , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[11]  D Margoliash,et al.  Template-based automatic recognition of birdsong syllables from continuous recordings. , 1996, The Journal of the Acoustical Society of America.

[12]  Sarah C. Goslee,et al.  The ecodist Package for Dissimilarity-based Analysis of Ecological Data , 2007 .

[13]  Harold Mills,et al.  Automatic detection and classification of nocturnal migrant bird calls , 1995 .

[14]  O. G. Libby The Nocturnal Flight of Migrating Birds , 1899 .

[15]  Judith C Brown,et al.  Classification of vocalizations of killer whales using dynamic time warping. , 2006, The Journal of the Acoustical Society of America.

[16]  Marc O. Lammers,et al.  Comparison of real-time and post-cruise acoustic species identification of dolphin whistles using ROCCA (Real-time Odontocete Call Classification Algorithm) , 2011 .

[17]  L. Baptista,et al.  ADVANCES IN STUDIES OF AVIAN SOUND COMMUNICATION , 1994 .

[18]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Jean,et al.  Comparaison entre des dénombrements auditifs de passereaux la nuit et la réflectivité d ’ un radar de surveillance météorologique Canadien , 2010 .

[21]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[22]  P. Slater,et al.  Bird Song: Biological Themes and Variations , 1995 .

[23]  Geoffrey A. Williamson,et al.  Comparison of methods for automated recognition of avian nocturnal flight calls , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  P. Marler,et al.  Quantitative Analysis of Animal Vocal Phonology: an Application to Swamp Sparrow Song , 2010 .

[25]  M. Morrison,et al.  Assessing Impacts of Wind-Energy Development on Nocturnally Active Birds and Bats: A Guidance Document , 2007 .

[26]  Aaron E. Rosenberg,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .

[27]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[28]  Andrew Farnsworth,et al.  Phylogenetic and ecological effects on interspecific variation in structurally simple avian vocalizations , 2008 .

[29]  C. E. Ho,et al.  A procedure for an automated measurement of song similarity , 2000, Animal Behaviour.

[30]  Andrew Farnsworth,et al.  Evolution of nocturnal flight calls in migrating wood‐warblers: apparent lack of morphological constraints , 2005 .

[31]  K. Cortopassi,et al.  Contact call diversity in wild orange-fronted parakeet pairs, Aratinga canicularis , 2006, Animal Behaviour.

[32]  Andreas M. Ali,et al.  Acoustic monitoring in terrestrial environments using microphone arrays: applications, technological considerations and prospectus , 2011 .

[33]  William A. Watkins,et al.  Characterizing acoustic features of marine animal sounds , 1992 .

[34]  Andrew Farnsworth FLIGHT CALLS OF WOOD-WARBLERS ARE NOT EXCLUSIVELY ASSOCIATED WITH MIGRATORY BEHAVIORS , 2007 .

[35]  Paul Roe,et al.  Sampling environmental acoustic recordings to determine bird species richness. , 2013, Ecological applications : a publication of the Ecological Society of America.

[36]  Ronald P. Larkin,et al.  Nocturnal flight calls of Dickcissels and Doppler radar echoes over south Texas in spring , 2002 .

[37]  Eric P. Kasten,et al.  The remote environmental assessment laboratory's acoustic library: An archive for studying soundscape ecology , 2012, Ecol. Informatics.

[38]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[39]  David W. Armitage,et al.  A comparison of supervised learning techniques in the classification of bat echolocation calls , 2010, Ecol. Informatics.

[40]  Robert W. Russell,et al.  Monitoring flight calls of migrating birds from an oil platform in the northern Gulf of Mexico , 2007 .

[41]  Theodoros Damoulas,et al.  Bayesian Classification of Flight Calls with a Novel Dynamic Time Warping Kernel , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[42]  Brian D. Ripley,et al.  Modern Applied Statistics with S Fourth edition , 2002 .

[43]  Andrew Farnsworth,et al.  Ecological and Evolutionary Characteristics of Flight-Calls of the Wood-Warblers (Parulidae) , 2006 .

[44]  Seppo Ilmari Fagerlund,et al.  Bird Species Recognition Using Support Vector Machines , 2007, EURASIP J. Adv. Signal Process..

[45]  Andrew Farnsworth,et al.  FLIGHT CALLS AND THEIR VALUE FOR FUTURE ORNITHOLOGICAL STUDIES AND CONSERVATION RESEARCH , 2005 .

[46]  P. Marler Chapter 5 – Bird calls: a cornucopia for communication , 2004 .

[47]  P. Mundinger,et al.  Vocal Imitation and Individual Recognition of Finch Calls , 1970, Science.

[48]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[49]  Panu Somervuo,et al.  Parametric Representations of Bird Sounds for Automatic Species Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[50]  T. Scott Brandes,et al.  Automated sound recording and analysis techniques for bird surveys and conservation , 2008, Bird Conservation International.

[51]  J. Bradbury,et al.  Responses to playback of local vs. distant contact calls in the orange-fronted conure, Aratinga canicularis , 2003 .

[52]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[53]  Charles E Taylor,et al.  Automated species recognition of antbirds in a Mexican rainforest using hidden Markov models. , 2008, The Journal of the Acoustical Society of America.

[54]  Paul E. Allen,et al.  Random Forest for improved analysis efficiency in passive acoustic monitoring , 2014, Ecol. Informatics.

[55]  K. Cortopassi,et al.  THE COMPARISON OF HARMONICALLY RICH SOUNDS USING SPECTROGRAPHIC CROSS-CORRELATION AND PRINCIPAL COORDINATES ANALYSIS , 2000 .

[56]  Aki Härmä,et al.  Parametrization of inharmonic bird sounds for automatic recognition , 2005, 2005 13th European Signal Processing Conference.

[57]  Louis Ranjard,et al.  Unsupervised bird song syllable classification using evolving neural networks. , 2008, The Journal of the Acoustical Society of America.

[58]  William R. Evans,et al.  Nocturnal flight call of bicknell's thrush , 1994 .

[59]  William A. Watkins,et al.  Marine animal sound classification , 1993 .