Personalization of Head-Related Transfer Function Based on Sparse Principle Component Analysis and Sparse Representation of 3D Anthropometric Parameters

Personalization of head-related transfer functions (HRTFs) is an essential task in the research on virtual hearing. In this paper, a method based on sparse principal component analysis (SPCA) and sparse representation (SR) was proposed to personalize HRTFs. The fundamental assumption is that an equivalent sparse combination can express the same anthropometric parameters. SPCA was first used to reduce the dimensionality of three-dimensional anthropometric parameters, and the reduced physiological parameters were used to reconstruct a physiological parameter database. SR was performed on the reconstructed physiological parameters of all subjects. For each subject, SR was used on the anthropometric parameters that were consistent with those in the reconstruction database. The matching pursuit algorithm was used to obtain the subjects in the database that had the same SR as the subject, and the HRTFs of the matched subject were used as the HRTFs of the new subject. The effect of the proposed method was evaluated by spectral distortion. The results showed that the method did better than others regardless of whether the Chinese pilots’ database or the CIPIC database was used, especially in the 0–8 kHz bandwidth.

[1]  Simone Spagnol,et al.  On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Augusto Sarti,et al.  HRTF personalization based on weighted sparse representation of anthropometric features , 2017, 2017 International Conference on 3D Immersion (IC3D).

[3]  Augusto Sarti,et al.  Analyzing notch patterns of head related transfer functions in CIPIC and SYMARE databases , 2016, 2016 24th European Signal Processing Conference (EUSIPCO).

[4]  Ivan Tashev,et al.  HRTF phase synthesis via sparse representation of anthropometric features , 2014, 2014 Information Theory and Applications Workshop (ITA).

[5]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[6]  Zhenyang Wu,et al.  HRTF personalization based on artificial neural network in individual virtual auditory space , 2008 .

[7]  Gavriel Salvendy,et al.  Individualized head-related transfer functions based on population grouping. , 2008, The Journal of the Acoustical Society of America.

[8]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[9]  Gavriel Salvendy,et al.  Improved method to individualize head-related transfer function using anthropometric measurements , 2008 .

[10]  F L Wightman,et al.  Localization using nonindividualized head-related transfer functions. , 1993, The Journal of the Acoustical Society of America.

[11]  Edgar A. Torres-Gallegos,et al.  Personalization of head-related transfer functions (HRTF) based on automatic photo-anthropometry and inference from a database , 2015 .

[12]  Larry S. Davis,et al.  HRTF personalization using anthropometric measurements , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[13]  Ee-Leng Tan,et al.  On the preprocessing and postprocessing of HRTF individualization based on sparse representation of anthropometric features , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  B F Katz,et al.  Boundary element method calculation of individual head-related transfer function. I. Rigid model calculation. , 2001, The Journal of the Acoustical Society of America.

[15]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[16]  Kazuya Takeda,et al.  Estimation of HRTFs on the horizontal plane using physical features , 2007 .

[17]  John C. Platt,et al.  HRTF magnitude synthesis via sparse representation of anthropometric features , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Yanyan Wang,et al.  Head-Related Transfer Function Database of Chinese Male Pilots , 2016 .

[19]  Federico Avanzini,et al.  Frequency Estimation Of The First Pinna Notch In Head-Related Transfer Functions With A Linear Anthropometric Model , 2015 .

[20]  Bill Gardner,et al.  HRTF Measurements of a KEMAR Dummy-Head Microphone , 1994 .