Guidelines for selecting microphones for human voice production research.

PURPOSE This tutorial addresses fundamental characteristics of microphones (frequency response, frequency range, dynamic range, and directionality), which are important for accurate measurements of voice and speech. METHOD Technical and voice literature was reviewed and analyzed. The following recommendations on desirable microphone characteristics were formulated: The frequency response of microphones should be flat (i.e., variation of less than 2 dB) within the frequency range between the lowest expected fundamental frequency of voice and the highest spectral component of interest. The equivalent noise level of the microphones is recommended to be at least 15 dB lower than the sound level of the softest phonations. The upper limit of the dynamic range of the microphone should be above the sound level of the loudest phonations. Directional microphones should be placed at the distance that corresponds to their maximally flat frequency response, to avoid the proximity effect; otherwise, they will be unsuitable for spectral and level measurements. Numerical values for these recommendations were derived for the microphone distances of 30 cm and 5 cm. CONCLUSIONS The recommendations, while preliminary and in need of further numerical justification, should provide the basis for better accuracy and repeatability of studies on voice and speech production in the future.

[1]  F L Wuyts,et al.  Normative voice range profiles of male and female professional voice users. , 2002, Journal of voice : official journal of the Voice Foundation.

[2]  E. Vilkman,et al.  Effects of voice training and voice hygiene education on acoustic and perceptual speech parameters and self-reported vocal well-being in female teachers , 2008, Logopedics, phoniatrics, vocology.

[3]  P. Dejonckere,et al.  A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques , 2001, European Archives of Oto-Rhino-Laryngology.

[4]  H. Herzel,et al.  Nonlinear phenomena in contemporary vocal music. , 2004, Journal of voice : official journal of the Voice Foundation.

[5]  Paavo Alku,et al.  Monopitched Expression of Emotions in Different Vowels , 2008, Folia Phoniatrica et Logopaedica.

[6]  Gary W. Elko,et al.  A History of Consumer Microphones: The Electret Condenser Microphone Meets Micro-Electro-Mechanical-Systems , 2009 .

[7]  Erkki Vilkman,et al.  Acoustic measures and self-reports of vocal fatigue by female teachers. , 2008, Journal of voice : official journal of the Voice Foundation.

[8]  John Watkinson,et al.  The art of sound reproduction , 1998 .

[9]  Sten Ternström,et al.  Personal computers in the voice laboratory: Part two—audio devices , 2009 .

[10]  D. W. Farnsworth,et al.  Exploration of Pressure Field Around the Human Head During Speech , 1938 .

[11]  H. K. Schutte,et al.  Differences in phonetogram features between male and female subjects with and without vocal training. , 1995, Journal of voice : official journal of the Voice Foundation.

[12]  E. Björkner Musical theater and opera singing--why so different? A study of subglottal pressure, voice source, and formant frequency characteristics. , 2008, Journal of voice : official journal of the Voice Foundation.

[13]  C. Neuschaefer-Rube,et al.  Sound pressure level examinations of the calling and speaking voice in healthy persons and in patients with hyperfunctional dysphonia , 1998 .

[14]  W S Winholtz,et al.  Miniature head-mounted microphone for voice perturbation analysis. , 1997, Journal of speech, language, and hearing research : JSLHR.

[15]  Densil Cabrera,et al.  Recording the Operatic Voice for Acoustic Analysis , 2002 .

[16]  Gerhard M. Sessler,et al.  Self‐Biased Condenser Microphone with High Capacitance , 1962 .

[17]  M. Rothenberg A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. , 1970, The Journal of the Acoustical Society of America.

[18]  W S Winholtz,et al.  Effect of microphone type and placement on voice perturbation measurements. , 1993, Journal of speech and hearing research.

[19]  Nancye C. Roussel,et al.  The clinical utility of the soft phonation index , 2006, Clinical linguistics & phonetics.

[20]  Johan Sundberg,et al.  Relation between perceived voice register and flow glottogram parameters in males. , 2008, The Journal of the Acoustical Society of America.

[21]  Ronald J. Baken,et al.  Clinical measurement of speech and voice , 1987 .

[22]  Dimitar D Deliyski,et al.  Regression Tree Approach to Studying Factors Influencing Acoustic Voice Analysis , 2006, Folia Phoniatrica et Logopaedica.

[23]  Estella Ma,et al.  Reliability of speaking and maximum voice range measures in screening for dysphonia. , 2007, Journal of voice : official journal of the Voice Foundation.

[24]  J. Švec,et al.  Comparison of biomechanical modeling of register transitions and voice instabilities with excised larynx experiments. , 2007, The Journal of the Acoustical Society of America.

[25]  Jan G. Svec,et al.  Basic requirements on microphones for voice recordings , 2009, MAVEBA.

[26]  Kenzo Miura,et al.  Sensitivity change with long-term preservation and practical use of electret condenser microphone , 2006 .

[27]  W Seidner,et al.  Recommendation by the Union of European Phoniatricians (UEP): standardizing voice area measurement/phonetography. , 1983, Folia phoniatrica.

[28]  A. Laukkanen,et al.  Acoustic and perceptual analyses of Brazilian male actors' and nonactors' voices: long-term average spectrum and the "actor's formant". , 2008, Journal of voice : official journal of the Voice Foundation.

[29]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[30]  Dimitar D. Deliyski,et al.  Acoustic model and evaluation of pathological voice production , 1993, EUROSPEECH.

[31]  Josef Merhaut,et al.  Theory of electroacoustics , 1981 .

[32]  H. K. Schutte,et al.  International association of logopedics and phoniatrics (IALP) voice committee discussion of assessment topics , 1992 .

[33]  H. K. Schutte,et al.  A subharmonic vibratory pattern in normal vocal folds. , 1996, Journal of speech and hearing research.

[34]  K Honda,et al.  Acoustic characteristics of the piriform fossa in models and humans. , 1997, The Journal of the Acoustical Society of America.

[35]  J. Laver,et al.  Acoustic waveform perturbations and voice disorders , 1992 .

[36]  C.E. Shannon,et al.  Communication in the Presence of Noise , 1949, Proceedings of the IRE.

[37]  M. D. Fresneda,et al.  Acoustic and Perceptual Indicators of Normal and Pathological Voice , 2003, Folia Phoniatrica et Logopaedica.

[38]  Anne-Maria Laukkanen,et al.  Assessment of Vocal Capacity of Finnish University Students , 2008, Folia Phoniatrica et Logopaedica.

[39]  Gunnar Fant,et al.  Acoustic analysis and synthesis of speech with applications to Swedish , 1959 .

[40]  W. Bigenzahn,et al.  Influence of glottal closure configuration on vocal efficacy in young normal-speaking women. , 2003, Journal of Voice.

[41]  M. A. Palmer,et al.  Effects of environmental noise on computer-derived voice estimates from female speakers. , 2000, Journal of voice : official journal of the Voice Foundation.

[42]  Ingo R Titze,et al.  Comparison of the produced and perceived voice range profiles in untrained and trained classical singers. , 2006, Journal of voice : official journal of the Voice Foundation.