Large Deformation Diffeomorphic Metric Mapping Provides New Insights into the Link Between Human Ear Morphology and the Head-Related Transfer Functions

This thesis aims to provide a better understanding on the relationship between the human head and ear morphology to a set of individualized filters known as the head related impulse response functions (HRIRs). The HRIR filters contain all the information required by the auditory system to localize sound in 3D space. The analyses presented in this thesis uses the Large Deformation Diffeomorphic Metric Mapping (LDDMM) technique and the Fast-Multipole Boundary Element Method, better known as FM-BEM, to study the effect of morphological changes on the ears to the corresponding acoustic changes. This work relies heavily on the morphological and acoustic data that is available in the SYMARE database, which contains acoustic and morphological information for 62 subjects. The research findings presented in this thesis is composed of four sections. In the first section of this thesis, the LDDMM framework and its applications to analysing ear shapes is explained. Further, important functions describing various operations for deforming shapes are introduced and elaborated. In order to examine the quality of the ear deformations, tools are developed to compare and measure the differences between the acoustic responses obtained from BEM simulations for two ear shapes. This part of the work also involved the development of techniques to compare and measure differences in 3D shapes using the framework of currents. Finally this section introduces the multi-scale approach for mapping ear shapes. The second section of the thesis estimates a template ear, head and torso shape from the shapes available in the SYMARE database. This part of the thesis explains a new procedure for developing the template ear shape. The procedure builds upon the multi-scale approach, which is described and used in the first part of this research investigation. The template ear and head shapes were examined and verified by comparing the features in the template shapes to corresponding features in the population. In addition, the acoustic and morphological differences of the multi-scale template ear shape to a template ear that was estimated at a single-scale

[1]  R. M. Sachs,et al.  Anthropometric manikin for acoustic research. , 1975, The Journal of the Acoustical Society of America.

[2]  Larry S. Davis,et al.  Rendering localized spatial audio in a virtual auditory space , 2004, IEEE Transactions on Multimedia.

[3]  Craig T. Jin,et al.  Generating a morphable model of ears , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4]  Parham Mokhtari,et al.  Comparison of Simulated and Measured HRTFs: FDTD Simulation Using MRI Head Data , 2007 .

[5]  Yufei Tao,et al.  The differential pressure synthesis method for efficient acoustic pressure estimation , 2003 .

[6]  V R Algazi,et al.  Elevation localization and head-related transfer function analysis at low frequencies. , 2001, The Journal of the Acoustical Society of America.

[7]  H. Takemoto,et al.  Mechanism for generating peaks and notches of head-related transfer functions in the median plane. , 2012, The Journal of the Acoustical Society of America.

[8]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  V. Ralph Algazi,et al.  The Use of Head-and-Torso Models for Improved Spatial Sound Synthesis , 2002 .

[10]  R Meddis,et al.  A physical model of sound diffraction and reflections in the human concha. , 1996, The Journal of the Acoustical Society of America.

[11]  Gabriele Gillessen-Kaesbach,et al.  Elements of morphology: Standard terminology for the ear , 2009, American journal of medical genetics. Part A.

[12]  Alain Trouvé,et al.  A Forward Model to Build Unbiased Atlases from Curves and Surfaces , 2008 .

[13]  Naoya Inoue HRTF Modeling using Physical Features , 2005 .

[14]  E. Shaw,et al.  External-ear acoustic models with simple geometry. , 1968, The Journal of the Acoustical Society of America.

[15]  Craig T. Jin,et al.  Large Deformation Diffeomorphic Metric Mapping and Fast-Multipole Boundary Element Method provide new insights for Binaural acoustics , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Marie Chupin,et al.  Analysis of anatomical variability using diffeomorphic iterative centroid in patients with Alzheimer's disease , 2017, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[17]  Ramani Duraiswami,et al.  Fast Multipole Accelerated Boundary Elements for Numerical Computation of the Head Related Transfer Function , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[18]  Piotr Majdak,et al.  Fast multipole boundary element method to calculate head-related transfer functions for a wide frequency range. , 2009, The Journal of the Acoustical Society of America.

[19]  K. S. Park,et al.  A three-dimensional Fourier descriptor for human body representation/reconstruction from serial cross sections. , 1987, Computers and biomedical research, an international journal.

[20]  G. F. Kuhn Model for the interaural time differences in the azimuthal plane , 1977 .

[21]  A. Hunter,et al.  The external ear: More attention to detail may aid syndrome diagnosis and contribute answers to embryological questions , 2005 .

[22]  Scott H. Foster,et al.  A Virtual Display System for Conveying Three-Dimensional Acoustic Information , 1988 .

[23]  L. G. Copley,et al.  Integral Equation Method for Radiation from Vibrating Bodies , 1967 .

[24]  M. Gardner,et al.  Problem of localization in the median plane: effect of pinnae cavity occlusion. , 1973, The Journal of the Acoustical Society of America.

[25]  Ramani Duraiswami,et al.  A broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation. , 2009, The Journal of the Acoustical Society of America.

[26]  G. F. Miller,et al.  The application of integral equation methods to the numerical solution of some exterior boundary-value problems , 1971, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences.

[27]  J. C. Middlebrooks,et al.  Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. , 2002, The Journal of the Acoustical Society of America.

[28]  Parham Mokhtari,et al.  Pinna sensitivity patterns reveal reflecting and diffracting surfaces that generate the first spectral notch in the front median plane , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Joan Alexis Glaunès,et al.  Surface Matching via Currents , 2005, IPMI.

[30]  Piotr Majdak,et al.  Numerical calculation of listener-specific head-related transfer functions and sound localization: Microphone model and mesh discretization. , 2015, The Journal of the Acoustical Society of America.

[31]  E. Langendijk,et al.  Contribution of spectral cues to human sound localization. , 1999, The Journal of the Acoustical Society of America.

[32]  Robert A. Butler,et al.  On the relative usefulness of monaural and binaural cues in locating sound in space , 1969 .

[33]  Thomas Vetter,et al.  Face Recognition Based on Fitting a 3D Morphable Model , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  L. Younes,et al.  On the metrics and euler-lagrange equations of computational anatomy. , 2002, Annual review of biomedical engineering.

[35]  Laurent Younes,et al.  Geodesic Interpolating Splines , 2001, EMMCVPR.

[36]  G. Wahba Spline Interpolation and Smoothing on the Sphere , 1981 .

[37]  F. Wightman,et al.  The dominant role of low-frequency interaural time differences in sound localization. , 1992, The Journal of the Acoustical Society of America.

[38]  Weimin Han,et al.  Stiff differential equations , 2011 .

[39]  J. C. Middlebrooks,et al.  Psychophysical customization of directional transfer functions for virtual sound localization. , 2000, The Journal of the Acoustical Society of America.

[40]  F L Wightman,et al.  Headphone simulation of free-field listening. II: Psychophysical validation. , 1989, The Journal of the Acoustical Society of America.

[41]  R. Butler,et al.  Factors that influence the localization of sound in the vertical plane. , 1968, The Journal of the Acoustical Society of America.

[42]  Yu Liu,et al.  Calculation of Individualized Near-Field Head-Related Transfer Function Database Using Boundary Element Method , 2013 .

[43]  S. Carlile,et al.  Enabling Individualized Virtual Auditory Space using Morphological Measurments , 2000 .

[44]  Simone Spagnol,et al.  Structural modeling of pinna-related transfer functions , 2010 .

[45]  J. C. Middlebrooks Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency. , 1999, The Journal of the Acoustical Society of America.

[46]  Ramani Duraiswami,et al.  Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation. , 2010, The Journal of the Acoustical Society of America.

[47]  R. Butler,et al.  Spectral cues utilized in the localization of sound in the median sagittal plane. , 1977, The Journal of the Acoustical Society of America.

[48]  Carl Hetherington,et al.  Parameterizing human pinna shape for the estimation of head-related transfer functions , 2003 .

[49]  Michael I. Miller,et al.  Large Deformation Diffeomorphic Metric Curve Mapping , 2008, International Journal of Computer Vision.

[50]  E. Shaw,et al.  Sound pressure generated in an external-ear replica and real human ears by a nearby point source. , 1968, The Journal of the Acoustical Society of America.

[51]  Bo-sun Xie,et al.  Head-Related Transfer Function and Virtual Auditory Display: E-Book , 2013 .

[52]  Alain Trouvé,et al.  Geodesic Shooting for Computational Anatomy , 2006, Journal of Mathematical Imaging and Vision.

[53]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[54]  Jens Blauert,et al.  The Technology of Binaural Listening , 2013 .

[55]  Gregory H. Wakefield,et al.  Introduction to Head-Related Transfer Functions (HRTFs): Representations of HRTFs in Time, Frequency, and Space , 2001 .

[56]  Durand R. Begault,et al.  3-D Sound for Virtual Reality and Multimedia Cambridge , 1994 .

[57]  J. C. Middlebrooks,et al.  Individual differences in external-ear transfer functions reduced by scaling in frequency. , 1999, The Journal of the Acoustical Society of America.

[58]  M. A. Blanco,et al.  Evaluation of the rotation matrices in the basis of real spherical harmonics , 1997 .

[59]  R. Häusler,et al.  Sound localization in subjects with impaired hearing. Spatial-discrimination and interaural-discrimination tests. , 1983, Acta oto-laryngologica. Supplementum.

[60]  Alain Trouvé,et al.  Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms , 2005, International Journal of Computer Vision.

[61]  Dietmar Saupe,et al.  3D Model Retrieval with Spherical Harmonics and Moments , 2001, DAGM-Symposium.

[62]  B. Shinn-Cunningham Applications of virtual auditory displays , 1998, Proceedings of the 20th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Vol.20 Biomedical Engineering Towards the Year 2000 and Beyond (Cat. No.98CH36286).

[63]  Ping Yan,et al.  Ear Biometrics Using 2D and 3D Images , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[64]  Mark R. Anderson,et al.  Direct comparison of the impact of head tracking, reverberation, and individualized head-related transfer functions on the spatial perception of a virtual speech source. , 2001, Journal of the Audio Engineering Society. Audio Engineering Society.

[65]  Craig T. Jin,et al.  Creating the Sydney York Morphological and Acoustic Recordings of Ears Database , 2014, IEEE Transactions on Multimedia.

[66]  Kazuya Takeda,et al.  Estimation of HRTFs on the horizontal plane using physical features , 2007 .

[67]  Alain Trouvé,et al.  Diffeomorphic matching of distributions: a new approach for unlabelled point-sets and sub-manifolds matching , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[68]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[69]  Olivier Colliot,et al.  Template Estimation for Large Database: A Diffeomorphic Iterative Centroid Method Using Currents , 2013, GSI.

[70]  Yuvi Kahana,et al.  Numerical modelling of the spatial acoustic response of the human pinna , 2006 .

[71]  U. Grenander,et al.  Computational anatomy: an emerging discipline , 1998 .

[72]  Peter Balazs,et al.  Multiple Exponential Sweep Method for Fast Measurement of Head-Related Transfer Functions , 2007 .

[73]  Michael I. Miller,et al.  Diffeomorphic metric surface mapping in subregion of the superior temporal gyrus , 2007, NeuroImage.

[74]  L. Younes,et al.  Statistics on diffeomorphisms via tangent space representations , 2004, NeuroImage.

[75]  Szymon Rusinkiewicz,et al.  Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[76]  F. Asano,et al.  Role of spectral cues in median plane localization. , 1990, The Journal of the Acoustical Society of America.

[77]  A D Musicant,et al.  The influence of pinnae-based spectral cues on sound localization. , 1984, The Journal of the Acoustical Society of America.

[78]  J. Reijniers,et al.  Simulated head related transfer function of the phyllostomid bat Phyllostomus discolor. , 2008, The Journal of the Acoustical Society of America.

[79]  Simon Carlile,et al.  The nature and distribution of errors in sound localization by human listeners , 1997, Hearing Research.

[80]  S. Joshi,et al.  Template estimation form unlabeled point set data and surfaces for Computational Anatomy , 2006 .

[81]  Kazuya Takeda,et al.  Evaluation of HRTFs estimated using physical features , 2005 .

[82]  George Chertock,et al.  Sound Radiation from Vibrating Surfaces , 1963 .

[83]  L. Younes Shapes and Diffeomorphisms , 2010 .

[84]  Yufei Tao,et al.  Three-dimensional elliptic Fourier methods for the parameterization of human pinna shape , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[85]  D R Perrott,et al.  Minimum audible angle thresholds for sources varying in both elevation and azimuth. , 1990, The Journal of the Acoustical Society of America.

[86]  J. C. Middlebrooks Narrow-band sound localization related to external ear acoustics. , 1992, The Journal of the Acoustical Society of America.

[87]  Olivier Colliot,et al.  Diffeomorphic Iterative Centroid Methods for Template Estimation on Large Datasets , 2014 .

[88]  J. Hebrank,et al.  Spectral cues used in the localization of sound sources on the median plane. , 1974, The Journal of the Acoustical Society of America.

[89]  Michael I. Miller,et al.  Group Actions, Homeomorphisms, and Matching: A General Framework , 2004, International Journal of Computer Vision.

[90]  Jean-Marc Chassery,et al.  Approximated Centroidal Voronoi Diagrams for Uniform Polygonal Mesh Coarsening , 2004, Comput. Graph. Forum.

[91]  Elizabeth M. Wenzel,et al.  Localization with non-individualized virtual acoustic display cues , 1991, CHI.

[92]  Michael I. Miller,et al.  Landmark matching via large deformation diffeomorphisms , 2000, IEEE Trans. Image Process..

[93]  Philip A. Nelson,et al.  Boundary element simulations of the transfer function of human heads and baffled pinnae using accurate geometric models , 2007 .

[94]  Sean F. Wu,et al.  The Boundary Element Method in Acoustics , 2000 .

[95]  Michael Friis Sørensen,et al.  Head-Related Transfer Functions of Human Subjects , 1995 .

[96]  Larry S. Davis,et al.  HRTF personalization using anthropometric measurements , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[97]  W. G. Gardner,et al.  3-D Audio Using Loudspeakers , 1998 .

[98]  Claire Cury,et al.  Statistical shape analysis of the anatomical variability of the human hippocampus in large populations. , 2015 .

[99]  Rémy Prost,et al.  Generic Remeshing of 3D Triangular Meshes with Metric-Dependent Discrete Voronoi Diagrams , 2008, IEEE Transactions on Visualization and Computer Graphics.

[100]  Andrew W. Fitzgibbon,et al.  What Shape Are Dolphins? Building 3D Morphable Models from 2D Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[101]  Yufei Tao,et al.  A Study on Head Shape Simplification Using Spherical Harmonics for HRTF Computation at Low Frequencies , 2003 .

[102]  Parham Mokhtari,et al.  Acoustic sensitivity to micro-perturbations of KEMAR's pinna surface geometry , 2010 .

[103]  Gregory H. Wakefield,et al.  Spatial frequency response surfaces: an alternative visualization tool for head-related transfer functions (HRTFs) , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[104]  Anthony I. Tew,et al.  Morphoacoustic perturbation analysis: principles and validation , 2012 .

[105]  Volker Schönefeld Spherical Harmonics , 2019, An Introduction to Radio Astronomy.

[106]  Craig T. Jin,et al.  A multiscale LDDMM template algorithm for studying ear shape variations , 2014, 2014 8th International Conference on Signal Processing and Communication Systems (ICSPCS).