Analysis and Synthesis of Auditory Scenes

Audition is the most important human sensory modality in interindividual communication. Consequently, acoustics has always dealt with communication. Yet recently, due to the high amount of computing power available, communication-acoustical systems become increasingly complex and sophisticated. In particular, they become more intelligent and knowledgeable. This trend will be discussed in this chapter by taking two complementary research streams as examples which have been pursued at the Institute of Communication Acoustics at Bochum during the past three decades, namely, (a) analysis of auditory scenes with the goal of arriving at parametric representations and, complementary, (b) synthesis of auditory scenes from parametric representations. The discussion is based on two software systems which have been developed for research purposes, namely, a binaural-analysis system and an auditory-virtual-environment generator — both of which will be roughly explained. It is, however, not the aim of the chapter to introduce scientific or technological details, but rather to bring a trend to the fore which may well coin the profile of communication acoustics in the years to come.

[1]  Ute Jekosch,et al.  Assigning Meaning to Sounds — Semiotics in the Context of Product-Sound Design , 2005 .

[2]  Henrik Møller,et al.  Binaural Technique: Basic Methods for Recording, Synthesis, and Reproduction , 2005 .

[3]  Sebastian Möller,et al.  Assessment and Prediction of Speech Quality in Telecommunications , 2000 .

[4]  Benjamin B. Bauer,et al.  Stereophonic Earphones and Binaural Loudspeakers , 1961 .

[5]  Sebastian Mller,et al.  Quality of Telephone-Based Spoken Dialogue Systems , 2004 .

[6]  Sebastian Möller Quality Assessment in Telecommunications , 2000 .

[7]  Ulrich Heute,et al.  Speech and Audio Coding — Aiming at High Quality and Low Data Rates , 2005 .

[8]  A. Krokstad,et al.  Calculating the acoustical room response by the use of a ray tracing technique , 1968 .

[9]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[10]  Ag Armin Kohlrausch,et al.  Audio—Visual Interaction in the Context of Multi-Media Applications , 2005 .

[11]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[12]  Jont B. Allen,et al.  Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[13]  Pedro Novo,et al.  Auditory Virtual Environments , 2005 .

[14]  M R Schroeder Computers in acoustics: symbiosis of an old science and a new tool. , 1969, The Journal of the Acoustical Society of America.

[15]  Daniel P. W. Ellis,et al.  Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures , 1999, Speech Commun..

[16]  L. Rayleigh,et al.  The theory of sound , 1894 .

[17]  Arild Lacroix Speech Production — Acoustics, Models, and Applications , 2005 .

[18]  John Mourjopoulos,et al.  The Evolution of Digital Audio Technology , 2005 .