A study on amplitude variation of bone conducted speech compared to air conducted speech

This paper investigates the amplitude variation of bone conducted (BC) speech compared to air conducted (AC) speech. During vocalization, vibrations travel through the vocal tract wall and skull bone, which can be captured by placing a bone-conductive microphone on the talker's head. Amplitude of this recorded BC speech is influenced by the mechanical properties of bone conduction pathways. This influence has relation with the vocal tract shape that determines the resonances of the vocal tract filter. Referring the vocal tract output as AC speech for simplicity, amplitude variation of BC speech can be described with respect to the location of the formants of AC speech. In this paper, amplitude variation of BC speech of Japanese vowels and long utterances have been investigated by exploiting the locations of first two formants of AC speech. Our observation suggests that when the first formant is very low with higher second formant, the relative amplitude of BC speech is amplified. As opposed to this, relatively higher first formant and lower second formant of AC speech cause reduction of the relative BC amplitude.

[1]  Raymond M. Stanley,et al.  Intelligibility of Bone-conducted Speech at Different Locations Compared to Air-conducted Speech , 2022 .

[2]  William A. Ahroon,et al.  Speech Intelligibility with Acoustic and Contact Microphones , 2005 .

[3]  Tomasz Letowski,et al.  The effect of bone conduction microphone locations on speech intelligibility and sound quality. , 2011, Applied ergonomics.

[4]  T. Shimamura,et al.  A reconstruction filter for bone-conducted speech , 2005, 48th Midwest Symposium on Circuits and Systems, 2005..

[5]  Maranda McBride,et al.  Using the Callsign Acquisition Test (CAT) to compare the speech intelligibility of air versus bone conduction , 2007 .

[6]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[7]  Raymond M. Stanley,et al.  Towards a transfer function used to adjust audio for bone‐conduction transducers , 2008 .

[8]  Tetsuya Shimamura,et al.  Low-frequency band noise suppression using bone conducted speech , 2011, Proceedings of 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.

[9]  Stefan Stenfelt,et al.  Bone-Conducted Sound: Physiological and Clinical Aspects , 2005, Otology & neurotology : official publication of the American Otological Society, American Neurotology Society [and] European Academy of Otology and Neurotology.

[10]  Paula P. Henry,et al.  Spatial audio through a bone conduction interface , 2006, International journal of audiology.

[11]  Zicheng Liu,et al.  Direct filtering for air- and bone-conductive microphones , 2004, IEEE 6th Workshop on Multimedia Signal Processing, 2004..

[12]  Stefan Stenfelt,et al.  Hearing one's own voice during phoneme vocalization--transmission by air and bone conduction. , 2010, The Journal of the Acoustical Society of America.

[13]  Georg v. Békésy,et al.  The Structure of the Middle Ear and the Hearing of One's Own Voice by Bone Conduction , 1949 .

[14]  Stefan Stenfelt,et al.  Transmission properties of bone conducted sound: measurements in cadaver heads. , 2005, The Journal of the Acoustical Society of America.

[15]  Kazuaki Yano,et al.  A self-organizing map with twin units capable of describing a nonlinear input-output relation applied to speech code vector mapping , 2007, Inf. Sci..

[16]  A Tjellström,et al.  The mechanical point impedance of the human head, with and without skin penetration. , 1986, The Journal of the Acoustical Society of America.

[17]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .