The Navigation and Visualisation of Environmental Audio Using Zooming Spectrograms

Acoustic recordings play an increasingly important role in monitoring terrestrial and aquatic environments. However, rapid advances in technology make it possible to accumulate thousands of hours of recordings, more than ecologists can ever listen to. Our approach to this big-data challenge is to visualize the content of long-duration audio recordings on multiple scales, from minutes, hours, days to years. The visualization should facilitate navigation and yield ecologically meaningful information prior to listening to the audio. To construct images, we calculate acoustic indices, statistics that describe the distribution of acoustic energy and reflect content of ecological interest. We combine various indices to produce false-color spectrogram images that reveal acoustic content and facilitate navigation. The technical challenge we investigate in this work is how to navigate recordings that are days or even months in duration. We introduce a method of zooming through multiple temporal scales, analogous to Google Maps. However, the "landscape" to be navigated is not geographical and not therefore intrinsically visual, but rather a graphical representation of the underlying audio. We describe solutions to navigating spectrograms that range over three orders of magnitude of temporal scale. We make three sets of observations: 1. We determine that at least ten intermediate scale steps are required to zoom over three orders of magnitude of temporal scale, 2. We determine that three different visual representations are required to cover the range of temporal scales, 3. We present a solution to the problem of maintaining visual continuity when stepping between different visual representations. Finally, we demonstrate the utility of the approach with four case studies.

[1]  Arco J. van Strien,et al.  Wild Bird Indicators: Using Composite Population Trends of Birds as Measures of Environmental Health , 2010 .

[2]  Yves Guiard,et al.  Beyond the 10-bit Barrier: Fitts' Law in Multi-Scale Electronic Worlds , 2001, BCS HCI/IHM.

[3]  Paul Roe,et al.  Practical Analysis of Big Acoustic Sensor Data for Environmental Monitoring , 2014, 2014 IEEE Fourth International Conference on Big Data and Cloud Computing.

[4]  Kristoffer Jensen,et al.  Retrieving and Recreating Musical Form , 2007, CMMR.

[5]  Almo Farina,et al.  A new methodology to infer the singing activity of an avian community: The Acoustic Complexity Index (ACI) , 2011 .

[6]  Huamin Qu,et al.  Visualizing the Semantic Structure in Classical Music Works , 2010, IEEE Transactions on Visualization and Computer Graphics.

[7]  D. A. Green,et al.  A colour scheme for the display of astronomical intensity images , 2011, 1108.5083.

[8]  Doris Dransch,et al.  A Visual Analytics Approach to Multiscale Exploration of Environmental Time Series , 2012, IEEE Transactions on Visualization and Computer Graphics.

[9]  Eric P. Kasten,et al.  The remote environmental assessment laboratory's acoustic library: An archive for studying soundscape ecology , 2012, Ecol. Informatics.

[10]  Samuel S. Silva,et al.  There is More to Color Scales than Meets the Eye: A Review on the Use of Color in Visualization , 2007, 2007 11th International Conference Information Visualization (IV '07).

[11]  Hans G. Kaper,et al.  Data sonification and sound visualization , 1999, Comput. Sci. Eng..

[12]  Michael W. Towsey,et al.  Visualization of Long-duration Acoustic Recordings of the Environment , 2014, ICCS.

[13]  Min Chen,et al.  SoundRiver: Semantically‐Rich Sound Illustration , 2010, Comput. Graph. Forum.

[14]  Mohamad Adnan Al-Alaoui,et al.  Sound Visualization for the Hearing Impaired , 2007, iJET.

[15]  Sandrine Pavoine,et al.  Rapid Acoustic Survey for Biodiversity Appraisal , 2008, PloS one.

[16]  Dan Stowell,et al.  Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning , 2014, PeerJ.

[17]  Paul Roe,et al.  Sampling environmental acoustic recordings to determine bird species richness. , 2013, Ecological applications : a publication of the Ecological Society of America.

[18]  Michael W. Towsey,et al.  Managing and Analysing Big Audio Data for Environmental Monitoring , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.

[19]  Paul Roe,et al.  The use of acoustic indices to determine avian species richness in audio-recordings of the environment , 2014, Ecol. Informatics.

[20]  Michael Towsey Noise removal from wave-forms and spectrograms derived from natural recordings of the environment , 2013 .

[21]  P. Fitts,et al.  INFORMATION CAPACITY OF DISCRETE MOTOR RESPONSES. , 1964, Journal of experimental psychology.

[22]  Michael Towsey,et al.  A practical comparison of manual and autonomous methods for acoustic monitoring , 2013 .

[23]  Sarah L. Dumyahn,et al.  What is soundscape ecology? An introduction and overview of an emerging new science , 2011, Landscape Ecology.

[24]  Sandrine Pavoine,et al.  Author's Personal Copy Ecological Indicators Monitoring Animal Diversity Using Acoustic Indices: Implementation in a Temperate Woodland , 2022 .

[25]  Michael Towsey,et al.  Report on a workshop to investigate the current status of environmental bio-acoustic monitoring , 2012 .

[26]  Sandrine Pavoine,et al.  Biodiversity Sampling Using a Global Acoustic Approach: Contrasting Sites with Microendemics in New Caledonia , 2013, PloS one.

[27]  Sanjay Jha,et al.  The design and evaluation of a hybrid sensor network for cane-toad monitoring , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[28]  Daniel A. Keim,et al.  Visual Analytics , 2009, Encyclopedia of Database Systems.

[29]  Wen Hu,et al.  Lightweight acoustic classification for cane-toad monitoring , 2008, 2008 42nd Asilomar Conference on Signals, Systems and Computers.