A low-cost solution to 3D pinna modeling for HRTF prediction

We propose an infrared (IR) stereo-vision system for estimating the 3D model of the pinna, based on low-cost devices. A commercial IR calibrated stereo camera is used in conjunction with a structured IR light projector, to acquire highly textured snapshots of the pinna. A point cloud is computed for each snapshot by triangulating the stereo correspondences detected in the acquired IR images. A complete 3D model is computed by aligning and merging the point clouds, and then creating a polygonal mesh surface. The nominal accuracy of the proposed system turns to be about 1 mm, which enables an accurate prediction of the Head Related Transfer Function (HRTF) through numerical acoustic simulation.

[1]  Michael M. Kazhdan,et al.  Poisson surface reconstruction , 2006, SGP '06.

[2]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..

[3]  R. Duda,et al.  Approximating the head-related transfer function using simple geometric models of the head and torso. , 2002, The Journal of the Acoustical Society of America.

[4]  A. Bronkhorst Localization of real and virtual sound sources , 1995 .

[5]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Oscar A. Ramos,et al.  Usage of Spectral Distortion for Objective Evalua- tion of Personalized HRTF in the Median Plane , 2015 .

[7]  Kazuya Takeda,et al.  Estimation of HRTFs on the horizontal plane using physical features , 2007 .

[8]  P. Guillon Individualisation des indices spectraux pour la synthèse binaurale : recherche et exploitation des similarités inter-individuelles pour l’adaptation ou la reconstruction de HRTF , 2009 .

[9]  Rajesh M. Hegde,et al.  Fast modelling of pinna spectral notches from HRTFs using linear prediction residual cepstrum , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Rafael C. González,et al.  Digital image processing, 3rd Edition , 2008 .

[12]  Simone Spagnol,et al.  A Head-Related Transfer Function Model for Real-Time Customized 3-D Sound Rendering , 2011, 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems.

[13]  Navarun Gupta,et al.  Modeling of Pinna Related Transfer Functions (PRTF) using Finite Element Method (FEM) , 2013 .

[14]  Nobuhiko Kitawaki,et al.  Common-acoustical-pole and zero modeling of head-related transfer functions , 1999, IEEE Trans. Speech Audio Process..

[15]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[16]  Craig T. Jin,et al.  Creating the Sydney York Morphological and Acoustic Recordings of Ears Database , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[17]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Ramani Duraiswami,et al.  Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation. , 2010, The Journal of the Acoustical Society of America.

[19]  Armando Barreto,et al.  Augmented Hankel Total Least-Squares Decomposition of Head-Related Transfer Functions , 2010 .

[20]  Piotr Majdak,et al.  Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range. , 2009, The Journal of the Acoustical Society of America.