Bayesian Analysis of Phoneme Confusion Matrices

This paper presents a parametric Bayesian approach to the statistical analysis of phoneme confusion matrices measured for groups of individual listeners in one or more test conditions. Two different bias problems in conventional estimation of mutual information are analyzed and explained theoretically. Evaluations with synthetic datasets indicate that the proposed Bayesian method can give satisfactory estimates of mutual information and response probabilities, even for phoneme confusion tests using a very small number of test items for each phoneme category. The proposed method can reveal overall differences in performance between two test conditions with better power than conventional Wilcoxon significance tests or conventional confidence intervals. The method can also identify sets of confusion-matrix cells that are credibly different between two test conditions, with better power than a similar approximate frequentist method.

[1]  Wang-Shu Lu Improved confidence intervals for a binomial parameter using the bayesian method , 2000 .

[2]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[3]  B. Hagerman,et al.  Reliability in the Determination of Speech Discrimination , 1976 .

[4]  Woojae Han Methods for robust characterization of consonant perception in hearing-impaired listeners , 2011 .

[5]  Alan D. Hutson,et al.  Calculating nonparametric confidence intervals for quantiles using fractional order statistics , 1999 .

[6]  B. Moore,et al.  Using transposition to improve consonant discrimination and detection for listeners with severe high-frequency hearing loss , 2007, International Journal of Audiology.

[7]  Jont B. Allen,et al.  Consonant confusions in white noise. , 2008, The Journal of the Acoustical Society of America.

[8]  Thomas M. Cover,et al.  Elements of information theory (2. ed.) , 2006 .

[9]  Jont B. Allen,et al.  Relationship between consonant recognition in noise and hearing threshold. , 2012, Journal of Speech, Language and Hearing Research.

[10]  B. Kollmeier,et al.  Human phoneme recognition depending on speech-intrinsic variability. , 2010, The Journal of the Acoustical Society of America.

[11]  Glenn Shafer,et al.  Scientific Reasoning: The Bayesian Approach (3rd ed.), Colin Howson and Peter Urbach , 2007 .

[12]  Systematic Groupings in Hearing Impaired Consonant Perception , 2013 .

[13]  A. Boothroyd,et al.  Mathematical treatment of context effects in phoneme and word recognition. , 1988, The Journal of the Acoustical Society of America.

[14]  R. Newcombe,et al.  Interval estimation for the difference between independent proportions: comparison of eleven methods. , 1998, Statistics in medicine.

[15]  E. S. Pearson,et al.  THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[16]  D D Dirks,et al.  Log-linear modeling of consonant confusion data. , 1986, The Journal of the Acoustical Society of America.

[17]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[18]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[19]  Jont B. Allen Consonant recognition and the articulation index. , 2005, Journal of the Acoustical Society of America.

[20]  H. Fletcher,et al.  The Perception of Speech and Its Relation to Telephony , 1950 .

[21]  Jont B. Allen,et al.  The influence of stop consonants' perceptual features on the Articulation Index model. , 2012, The Journal of the Acoustical Society of America.

[22]  Jont B. Allen,et al.  Within-consonant perceptual differences in the hearing impaired ear. , 2013, The Journal of the Acoustical Society of America.

[23]  K. Krishnamoorthy,et al.  Some Properties of the Exact and Score Methods for Binomial Proportion and Sample Size Calculation , 2007, Commun. Stat. Simul. Comput..

[24]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[25]  Francis Kuk,et al.  Efficacy of linear frequency transposition on consonant identification in quiet and in noise. , 2009, Journal of the American Academy of Audiology.

[27]  J. Allen,et al.  Harvey Fletcher's role in the creation of communication acoustics. , 1996, The Journal of the Acoustical Society of America.

[28]  R. Newcombe Two-sided confidence intervals for the single proportion: comparison of seven methods. , 1998, Statistics in medicine.

[29]  Mario A Svirsky,et al.  Information transfer analysis: a first look at estimation bias. , 2008, The Journal of the Acoustical Society of America.

[30]  Jont B. Allen,et al.  Consonant recognition loss in hearing impaired listeners. , 2009, The Journal of the Acoustical Society of America.

[31]  Emanuel Parzen,et al.  Quantile Probability and Statistical Data Modeling , 2004 .

[32]  S. Fienberg When did Bayesian inference become "Bayesian"? , 2006 .

[33]  T. Brand,et al.  Microscopic prediction of speech recognition for listeners with normal hearing in noise using an auditory model. , 2009, The Journal of the Acoustical Society of America.

[34]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[35]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[36]  Peter Urbach,et al.  Scientific Reasoning: The Bayesian Approach , 1989 .

[37]  Jont B. Allen,et al.  Consonant and vowel confusions in speech-weighted noise , 2007, INTERSPEECH.

[38]  Adrianus J. M. Houtsma Estimation of mutual information from limited experimental data , 1983 .

[39]  W. Marsden I and J , 2012 .

[40]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[41]  J. P. Egan Articulation testing methods , 1948, The Laryngoscope.

[42]  H. Jeffreys An invariant form for the prior probability in estimation problems , 1946, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[43]  J. C. Steinberg,et al.  Factors Governing the Intelligibility of Speech Sounds , 1945 .

[44]  Jont B. Allen,et al.  Across- and within-consonant errors for isolated syllables in noise. , 2014, Journal of speech, language, and hearing research : JSLHR.

[45]  A. Thornton,et al.  Speech-discrimination scores modeled as a binomial variable. , 1978, Journal of speech and hearing research.