Evaluating iPhone Recordings for Acoustic Voice Assessment

Aims: This study examined the viability of using iPhone recordings for acoustic measurements of voice quality. Methods: Acoustic measures were compared between voice signals simultaneously recorded from 11 normal speakers (6 females and 5 males) through an iPhone (model A1303, Apple, USA) and a comparison recording system. Comparisons were also conducted between the pre- and post-operative voices recorded from 10 voice patients (4 females and 6 males) through the iPhone. Participants aged between 27 and 79 years. Results: Measures from iPhone and comparison signals were found to be highly correlated. Findings of the effects of vowel type on the selected measures were consistent between the two recording systems and congruent with previous findings. Analysis of the patient data revealed that a selection of acoustic measures, such as vowel space area and voice perturbation measures, consistently demonstrated a positive change following phonosurgery. Conclusion: The present findings indicated that the iPhone device tested was useful for tracking voice changes for clinical management. Preliminary findings regarding factors such as gender and type of pathology suggest that intra-subject, instead of norm-referenced, comparisons of acoustic measures would be more useful in monitoring the progression of a voice disorder or tracking the treatment effect.

[1]  M P Karnell,et al.  Comparison of acoustic voice perturbation measures among three independent voice laboratories. , 1991, Journal of speech and hearing research.

[2]  D. Childers,et al.  Acoustic correlates of vocal quality. , 1990, Journal of speech and hearing research.

[3]  M P Gelfer,et al.  Fundamental frequency, intensity, and vowel selection: effects on measures of phonatory stability. , 1995, Journal of speech and hearing research.

[4]  J Kreiman,et al.  Comparison of voice analysis systems for perturbation measurement. , 1993, Journal of speech and hearing research.

[5]  K. Omori,et al.  Singing power ratio: quantitative evaluation of singing voice quality. , 1996, Journal of voice : official journal of the Voice Foundation.

[6]  B. Atal Automatic Speaker Recognition Based on Pitch Contours , 1969 .

[7]  A Löfqvist,et al.  Long-time average spectrum of speech and voice analysis. , 1987, Folia phoniatrica.

[8]  W S Winholtz,et al.  Suitability of minidisc (MD) recordings for voice perturbation analysis. , 1998, Journal of voice : official journal of the Voice Foundation.

[9]  J Jiang,et al.  Effect of tape recording on perturbation measures. , 1998, Journal of speech, language, and hearing research : JSLHR.

[10]  M. Maves,et al.  Acoustic Characteristics of Post-Thyroplasty Patients , 1992, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[11]  G. Fairbanks Voice and articulation drillbook , 1960 .

[12]  Joseph S. Perkell,et al.  Phonatory function associated with hyperfunctionally related vocal fold lesions , 1990 .

[13]  P. Lieberman Perturbations in Vocal Pitch , 1960 .

[14]  Directional perturbation factors for jitter and for shimmer. , 1984, Journal of communication disorders.

[15]  James D Garnett,et al.  Perceptual evaluation of voice quality and its correlation with acoustic measurements. , 2004, Journal of voice : official journal of the Voice Foundation.

[16]  Dimitar D Deliyski,et al.  Influence of data acquisition environment on accuracy of acoustic voice quality measurements. , 2005, Journal of voice : official journal of the Voice Foundation.

[17]  P. Milenkovic,et al.  Least mean square measures of voice perturbation. , 1987, Journal of speech and hearing research.

[18]  H. Hoffman,et al.  Reliability of clinician-based (GRBAS and CAPE-V) and patient-based (V-RQOL and IPVI) documentation of voice disorders. , 2007, Journal of voice : official journal of the Voice Foundation.

[19]  J. Hufnagle,et al.  An investigation of the relationship between speaking fundamental frequency and vocal quality improvement. , 1984, Journal of communication disorders.

[20]  Y. Horii Fundamental frequency perturbation observed in sustained phonation. , 1979, Journal of speech and hearing research.

[21]  M. Ng,et al.  Some aerodynamic and acoustic characteristics of acute laryngitis. , 1997, Journal of voice : official journal of the Voice Foundation.

[22]  Y Horii,et al.  Cigarette smoking and voice fundamental frequency. , 1982, Journal of communication disorders.

[23]  Alexander Stojadinovic,et al.  Clinical versus laboratory ratings of voice using the CAPE-V. , 2011, Journal of voice : official journal of the Voice Foundation.

[24]  J F Deem,et al.  The automatic extraction of pitch perturbation using microcomputers: some methodological considerations. , 1989, Journal of speech and hearing research.

[25]  P. Kuhl,et al.  The effect of reduced vowel working space on speech intelligibility in Mandarin-speaking young adults with cerebral palsy. , 2005, The Journal of the Acoustical Society of America.

[26]  I. Titze,et al.  Comparison of Fo extraction methods for high-precision voice perturbation measurements. , 1993, Journal of speech and hearing research.

[27]  V. Wolfe,et al.  Acoustic correlates of dysphonia: type and severity. , 1997, Journal of communication disorders.

[28]  Shimon Sapir,et al.  Articulatory changes in muscle tension dysphonia: evidence of vowel space expansion following manual circumlaryngeal therapy. , 2009, Journal of communication disorders.

[29]  W S Winholtz,et al.  Effect of microphone type and placement on voice perturbation measurements. , 1993, Journal of speech and hearing research.

[30]  G. de Krom,et al.  Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. , 1995, Journal of speech and hearing research.

[31]  Y. Koike Vowel amplitude modulations in patients with laryngeal diseases. , 1969, The Journal of the Acoustical Society of America.

[32]  Youri Maryn,et al.  Perturbation Measures of Voice: A Comparative Study between Multi-Dimensional Voice Program and Praat , 2009, Folia Phoniatrica et Logopaedica.

[33]  M P Karnell,et al.  Comparison of fundamental frequency and perturbation measurements among three analysis systems. , 1995, Journal of voice : official journal of the Voice Foundation.

[34]  M P Gelfer,et al.  Comparisons of jitter, shimmer, and signal-to-noise ratio from directly digitized versus taped voice samples. , 1995, Journal of voice : official journal of the Voice Foundation.

[35]  R L Beckett,et al.  Pitch perturbation as a function of subjective vocal constriction. , 1969, Folia phoniatrica.

[36]  Janet Slifka,et al.  Towards models of phonation , 2001, J. Phonetics.

[37]  Cecyle Perry Carson,et al.  The effect of noise on computer-aided measures of voice: a comparison of CSpeechSP and the Multi-Dimensional Voice Program software using the CSL 4300B Module and Multi-Speech for Windows. , 2003, Journal of voice : official journal of the Voice Foundation.

[38]  D. Berry,et al.  Fundamental frequency stability in functional dysphonia. , 1993, Acta oto-laryngologica.

[39]  Ingo R. Titze,et al.  Role of the thyroarytenoid muscle in regulation of fundamental frequency , 1989 .

[40]  D. Whalen,et al.  The universality of intrinsic F0 of vowels , 1995 .

[41]  N. Roy Speaking Fundamental Frequency ( SFF ) Changes Following Successful Management of Functional Dysphonia , 2006 .