SPREAD: sound propagation and perception for autonomous agents in dynamic environments

The perception of sensory information and its impact on behavior is a fundamental component of being human. While visual perception is considered for navigation, collision, and behavior selection, the acoustic domain is relatively unexplored. Recent work in acoustics focuses on synthesizing sound in 3D environments; however, the perception of acoustic signals by a virtual agent is a useful and realistic adjunct to any behavior selection mechanism. In this paper, we present SPREAD, a novel agent-based sound perception model using a discretized sound packet representation with acoustic features including amplitude, frequency range, and duration. SPREAD simulates how sound packets are propagated, attenuated, and degraded as they traverse the virtual environment. Agents perceive and classify the sounds based on the locally-received packet set using a hierarchical clustering scheme, and have individualized hearing and understanding of their surroundings. Using this model, we demonstrate several simulations that greatly enrich controls and outcomes.

[1]  Thomas A. Funkhouser,et al.  A beam tracing approach to acoustic modeling for interactive virtual environments , 1998, SIGGRAPH.

[2]  M. A. Bee,et al.  The cocktail party problem: what is it? How can it be solved? And why should animal behaviorists study it? , 2008, Journal of comparative psychology.

[3]  Tapio Lokki,et al.  Creating Interactive Virtual Acoustic Environments , 1999 .

[4]  Yoram Singer,et al.  An Online Algorithm for Hierarchical Phoneme Classification , 2004, MLMI.

[5]  Dinesh Manocha,et al.  AD-Frustum: Adaptive Frustum Tracing for Interactive Sound Propagation , 2008, IEEE Transactions on Visualization and Computer Graphics.

[6]  T. D. Mast Empirical relationships between acoustic parameters in human soft tissues , 2000 .

[7]  Angélica de Antonio Jiménez,et al.  Introducing human-like hearing perception in intelligent virtual agents , 2003, AAMAS '03.

[8]  Torben Dabelsteen,et al.  Degradation of wren Troglodytes troglodytes song: Implications for information transfer and ranging , 1998 .

[9]  Terri L. Bonebright Perceptual structure of everyday sounds: A multidimensional scaling approach , 2001 .

[10]  Nadine Martin,et al.  Spectrogram segmentation by means of statistical features for non-stationary signal interpretation , 2002, IEEE Trans. Signal Process..

[11]  Renate Sitte,et al.  Comparison of techniques for environmental sound recognition , 2003, Pattern Recognit. Lett..

[12]  George Drettakis,et al.  Progressive perceptual audio rendering of complex scenes , 2007, SI3D.

[13]  Dinesh K. Pai,et al.  Precomputed acoustic transfer: output-sensitive, accurate sound generation for geometrically complex vibration sources , 2006, SIGGRAPH 2006.

[14]  S. Musse,et al.  A Perceptive Model for Virtual Agents in Crowds , 2007 .

[15]  Ravish Mehra,et al.  Precomputed wave simulation for real-time sound propagation of dynamic sources in complex scenes , 2010, SIGGRAPH 2010.

[16]  Glenn Reinman,et al.  A Behavior-Authoring Framework for Multiactor Simulations , 2011, IEEE Computer Graphics and Applications.

[17]  Rahul Narain,et al.  Efficient and accurate sound propagation using adaptive rectangular decomposition. , 2009, IEEE transactions on visualization and computer graphics.

[18]  Daniel Thalmann,et al.  Crowd Simulation , 2007, Wiley Encyclopedia of Computer Science and Engineering.

[19]  Carol O'Sullivan,et al.  Metropolis: Multisensory Simulation of a Populated City , 2011, 2011 Third International Conference on Games and Virtual Worlds for Serious Applications.

[20]  Daniel P. W. Ellis,et al.  Ground-truth transcriptions of real music from force-aligned MIDI syntheses , 2003, ISMIR.

[21]  Brian Gygi,et al.  Similarity and categorization of environmental sounds , 2007, Perception & psychophysics.

[22]  Y. Kagawa,et al.  Discrete Huygens' model approach to sound wave propagation , 1998 .

[23]  Norman I. Badler,et al.  Virtual Crowds: Methods, Simulation, and Control , 2008, Virtual Crowds: Methods, Simulation, and Control.

[24]  Stéphane Donikian,et al.  A synthetic-vision based steering approach for crowd simulation , 2010, ACM Transactions on Graphics.

[25]  Daniel Thalmann,et al.  A Sound Propagation Model for Interagents Communication , 2000, Virtual Worlds.

[26]  Tapio Takala,et al.  Sound rendering , 1992, SIGGRAPH.

[27]  Changsheng Xu,et al.  Automatic music classification and summarization , 2005, IEEE Transactions on Speech and Audio Processing.

[28]  Murray H. Loew,et al.  Adjacency detection using quadcodes , 1987, CACM.