论文信息 - Personal 3D audio system with loudspeakers

Personal 3D audio system with loudspeakers

Traditional 3D audio systems often have a limited sweet spot for the user to perceive 3D effects successfully. In this paper, we present a personal 3D audio system with loudspeakers that has unlimited sweet spots. The idea is to have a camera track the user's head movement, and recompute the crosstalk canceller filters accordingly. As far as the authors are aware of, our system is the first non-intrusive 3D audio system that adapts to both the head position and orientation with six degrees of freedom. The effectiveness of the proposed system is demonstrated with subjective listening tests comparing our system against traditional non-adaptive systems.

[1] Duane H. Cooper,et al. Prospects for Transaural Recording , 1989 .

[2] Hareo Hamada,et al. Adaptive inverse filters for stereophonic sound reproduction , 1992, IEEE Trans. Signal Process..

[3] Jerry Bauck,et al. Generalized transaural stereo and applications , 1996 .

[4] Ville Pulkki,et al. Virtual Sound Source Positioning Using Vector Base Amplitude Panning , 1997 .

[5] Athanasios Mouchtaris,et al. Head-related transfer function synthesis for immersive audio , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[6] Chris Kyriakakis. Fundamental and technological limitations of immersive audio systems , 1998 .

[7] Tomlinson Holman,et al. Immersive audio for the desktop , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8] Tomlinson Holman,et al. Video-Based Head Tracking for Improvements in Multichannel Loudspeaker Audio , 1998 .

[9] W. G. Gardner,et al. 3-D Audio Using Loudspeakers , 1998 .

[10] 3-D AUDIO WITH DYNAMIC TRACKING FOR MULTIMEDIA ENVIRONTMENTS , 2000 .

[11] Stergios I. Roumeliotis,et al. Immersive Sound Rendering Using Laser-Based Tracking , 2000 .

[12] Chris Kyriakakis,et al. Multirate adaptive filtering for immersive audio , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[13] Oliver Schmitz,et al. Realisation of an Adaptive Cross-talk Cancellation System for a Moving Listener , 2002 .

[14] Yi Zhou,et al. Bayesian tangent shape model: estimating shape and pose parameters via Bayesian inference , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[15] Tobias Lentz. Dynamic crosstalk cancellation for binaural synthesis in virtual reality environments , 2006 .

[16] Harry Shum,et al. Real-Time Bayesian 3-D Pose Tracking , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[17] Paul A. Viola,et al. Multiple-Instance Pruning For Learning Efficient Cascade Detectors , 2007, NIPS.

[18] Seong-Cheol Jang,et al. Adaptive Virtual Surround Sound Rendering System for an Arbitrary Listening Position , 2008 .