3D Spatial Sound Systems Compatible with Human's Active Listening to Realize Rich High-Level kansei Information

In future communications with rich high-level kansei information, such as presence, verisimilitude, realism and naturalness, the role of sound is extremely important to enhance the quality and versatility of communications because sound itself can provide rich semantic and emotional information. Moreover, sound (auditory information) has good synergy effects with pictures (visual information). In this paper, we introduce our recent research results toward capturing and synthesizing comprehensive 3D sound space information as well as a high-definition 3D audio-visual display realizing strict audio-visual synchronization. We believe that these systems are useful to advance universal communications, which require particularly high-quality and versatile communications technologies for all.

[1]  Yukio Iwaya,et al.  High order Ambisonic decoding method for irregular loudspeaker arrays , 2010 .

[2]  W R Thurlow,et al.  Effect of induced head movements on localization of direction of sounds. , 1967, The Journal of the Acoustical Society of America.

[3]  Willard R. Thurlow Erratum: Effect of Induced Head Movements on Localization of Direction of Sounds [J. Acoust. Soc. Am. 42, 480–488 (1967)] , 1967 .

[4]  D. Cabrera,et al.  Improving sound field reproduction in a small room based on higher-order Ambisonics with 157-loudspeaker array , 2010 .

[5]  S. Ise A principle of sound field control based on the Kirchhoff-Helmholtz integral equation and the theory of inverse systems , 1999 .

[6]  Yukio Iwaya,et al.  Implementation of Real-Time Room Auralization Using a Surrounding 157 Loudspeaker Array , 2009 .

[7]  Yukio Iwaya,et al.  Implementation of a high-definition 3D audio-visual display based on higher-order ambisonics using a 157-loudspeaker array combined with a 3D projection display , 2010, 2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content.

[8]  T. Okamoto,et al.  ESTIMATION OF HIGH-RESOLUTION SOUND PROPERTIES FOR REALIZING AN EDITABLE SOUND-SPACE SYSTEM , 2011 .

[9]  A. Berkhout,et al.  Acoustic control by wave field synthesis , 1993 .

[10]  Mark A. Poletti,et al.  Three-Dimensional Surround Sound Systems Based on Spherical Harmonics , 2005 .

[11]  Paul Bertelson,et al.  Temporal ventriloquism: crossmodal interaction on the time dimension. 2. Evidence from sensorimotor synchronization. , 2003, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[12]  B. Katz,et al.  Framework for Real-Time Auralization in Architectural Acoustics , 2008 .

[13]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[14]  W R Thurlow,et al.  Head movements during sound localization. , 1967, The Journal of the Acoustical Society of America.

[15]  Shuichi Sakamoto,et al.  Effects of microphone arrangements on the accuracy of a spherical microphone array (SENZI) in acquiring high-definition 3D sound space information , 2011 .

[16]  Iwaki Toshima,et al.  Sound Localization During Head Movement Using an Acoustical Telepresence Robot: TeleHead , 2009, Adv. Robotics.

[17]  Yukio Iwaya,et al.  Improving sound field reproduction based on higher-order ambisonics in a small room with a 157-loudspeaker array , 2010 .

[18]  R. Baddeley,et al.  Multisensory temporal order judgments: When two locations are better than one , 2003, Perception & psychophysics.

[19]  S. Perrett,et al.  The effect of head rotations on vertical plane sound localization. , 1997, The Journal of the Acoustical Society of America.

[20]  Yukio Iwaya,et al.  Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter , 2012 .

[21]  Gene H. Golub,et al.  Matrix computations , 1983 .

[22]  Yukio Iwaya,et al.  Effects of head movement on front-back error in sound localization , 2003 .

[23]  Yôiti Suzuki,et al.  Sound localization in headphone reproduction by simulating transfer functions from the sound source to the external ear , 1991 .

[24]  Ramani Duraiswami,et al.  Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Michael J. Gerzon Periphony: With-Height Sound Reproduction , 1973 .

[26]  F. J. Anscombe,et al.  On estimating binomial response relations , 1956 .

[27]  Yukio Iwaya,et al.  The Effects of Ambient Sounds on the Quality of 3D Virtual Sound Space , 2009, 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[28]  Yukio Iwaya,et al.  Blind directivity estimation of a sound source in a room using a surrounding microphone array , 2010 .

[29]  F. Asano,et al.  Role of spectral cues in median plane localization. , 1990, The Journal of the Acoustical Society of America.

[30]  Yukio Iwaya,et al.  Estimation of sound source positions using a surrounding microphone array , 2007 .