Towards increasing speech recognition error rates

[1]  Yochai Konig,et al.  Remap: recursive estimation and maximization of a posteriori probabilities in transition-based speech recognition , 1996 .

[2]  Yochai Konig,et al.  REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition , 1995, NIPS.

[3]  Philip C. Woodland,et al.  Rapid speaker adaptation using model prediction , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Steve Renals,et al.  Efficient search using posterior phone probability estimates , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Steve Renals,et al.  Recent improvements to the ABBOT large vocabulary CSR system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Hynek Hermansky,et al.  Speech enhancement based on temporal processing , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Steven Greenberg,et al.  Stochastic perceptual models of speech , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Jordan Cohen,et al.  Vocal tract normalization in speech recognition: Compensating for systematic speaker variability , 1995 .

[9]  Hervé Bourlard,et al.  Neural networks for statistical recognition of continuous speech , 1995, Proc. IEEE.

[10]  Jont B. Allen,et al.  How do humans process and recognize speech? , 1993, IEEE Trans. Speech Audio Process..

[11]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[12]  Aaron E. Rosenberg,et al.  Cepstral channel normalization techniques for HMM-based speaker verification , 1994, ICSLP.

[13]  Jayant M. Naik,et al.  Connected digit recognition using connectionist probability estimators and mixture-Gaussian densities , 1994, ICSLP.

[14]  James R. Glass,et al.  Statistical trajectory models for phonetic recognition , 1994, ICSLP.

[15]  H. Hermansky,et al.  Temporal masking in automatic speech recognition , 1994 .

[16]  Leonardo Neumeyer,et al.  Probabilistic optimum filtering for robust speech recognition , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Richard M. Schwartz,et al.  Adaptation to new microphones using tied-mixture normalization , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Robert A. Jacobs,et al.  Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[19]  Xiaodong Sun,et al.  Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states , 1994, IEEE Trans. Speech Audio Process..

[20]  Yoshua Bengio,et al.  An Input Output HMM Architecture , 1994, NIPS.

[21]  Hervé Bourlard,et al.  Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[22]  Yochai Konig,et al.  A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project , 1993, EUROSPEECH.

[23]  Dieter Geller,et al.  Improvements in connected digit recognition using linear discriminant analysis and mixture densities , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Ronald A. Cole,et al.  City name recognition over the telephone , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  Hideki Kawahara,et al.  A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26]  Oded Ghitza,et al.  Hidden Markov models with templates as non-stationary states: an application to speech recognition , 1993, Comput. Speech Lang..

[27]  Paul Duchnowski,et al.  A new structure for automatic speech recognition , 1993 .

[28]  Esther Levin Hidden control neural architecture modeling of nonlinear time varying systems and its applications , 1993, IEEE Trans. Neural Networks.

[29]  Horacio Franco,et al.  Hybrid neural network/hidden Markov model continuous-speech recognition , 1992, ICSLP.

[30]  Elliot Singer,et al.  A speech recognizer using radial basis function neural networks in an HMM framework , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[31]  Mari Ostendorf,et al.  Context modeling with the stochastic segment model , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  George Zavaliagkos,et al.  Improving State-of-the-Art Continuous Speech Recognition Systems Using the N-Best Paradigm with Neural Networks , 1992, HLT.

[33]  Biing-Hwang Juang,et al.  New discriminative training algorithms based on the generalized probabilistic descent method , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[34]  Ronald A. Cole,et al.  Speaker-independent phonetic classification in continuous English letters , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[35]  R. Kompe,et al.  Global optimization of a neural network-hidden Markov model hybrid , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[36]  Frank Fallside,et al.  A recurrent error propagation network speech recognition system , 1991 .

[37]  Brian Hanson,et al.  Regression features for recognition of speech in quiet and in noise , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[38]  Alex Waibel,et al.  Continuous speech recognition using linked predictive neural networks , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[39]  Biing-Hwang Juang,et al.  A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[40]  Bernard Mérialdo,et al.  A Dynamic Language Model for Speech Recognition , 1991, HLT.

[41]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[42]  H. Bourlard,et al.  Links Between Markov Models and Multilayer Perceptrons , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  James K. Baker,et al.  On the interaction between true source, training, and testing language models , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[44]  H. Gish,et al.  A probabilistic approach to the understanding and training of neural network classifiers , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[45]  Hervé Bourlard,et al.  Continuous speech recognition using multilayer perceptrons with hidden Markov models , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[46]  S. M. Peeling,et al.  The ARM continuous speech recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[47]  A. Waibel,et al.  Connectionist Viterbi training: a new hybrid method for continuous speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[48]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[49]  John Makhoul,et al.  Automatic Detection Of New Words In A Large Vocabulary Continuous Speech Recognition System , 1989, HLT.

[50]  J R Cohen,et al.  Application of an auditory model to speech recognition. , 1989, The Journal of the Acoustical Society of America.

[51]  C. Lefebvre,et al.  A comparison of several acoustic representations for speech recognition with degraded and undegraded speech , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[52]  George R. Doddington Phonetically sensitive discriminants for improved speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[53]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[54]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[55]  S. Seneff A joint synchrony/mean-rate model of auditory speech processing , 1990 .

[56]  A. Nadas,et al.  Decoder selection based on cross-entropies , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[57]  S. Greenberg Representation of Speech in the Auditory Periphery , 1988 .

[58]  H. Hermansky,et al.  An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[59]  C. J. Wellekens,et al.  Explicit time correlation in hidden Markov models for speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[60]  A. Poritz,et al.  On hidden Markov models in isolated word recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[61]  Sadaoki Furui,et al.  Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[62]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[63]  V.W. Zue,et al.  The use of speech knowledge in automatic speech recognition , 1985, Proceedings of the IEEE.

[64]  L. A. Chistovich Central auditory processing of peripheral vowel spectra. , 1985, The Journal of the Acoustical Society of America.

[65]  Steven F. Boll,et al.  Optimal estimators for spectral restoration of noisy speech , 1984, ICASSP.

[66]  Brian A. Hanson,et al.  The harmonic magnitude suppression (EMS) technique for intelligibility enhancement in the presence of interfering speech , 1984, ICASSP.

[67]  Hynek Hermansky,et al.  Analysis and synthesis of speech based on spectral transform linear predictive method , 1983, ICASSP.

[68]  Shozo Makino,et al.  Recognition of consonant based on the perceptron model , 1983, ICASSP.

[69]  Louis A. Liporace,et al.  Maximum likelihood estimation for multivariate observations of Markov sources , 1982, IEEE Trans. Inf. Theory.

[70]  A. B. Poritz,et al.  Linear predictive hidden Markov models and the speech signal , 1982, ICASSP.

[71]  Mats Blomberg,et al.  Effects of emphasizing transitional or stationary parts of the speech signal in a discrete utterance recognition system , 1982, ICASSP.

[72]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[73]  J. Lim Spectral root homomorphic deconvolution system , 1979 .

[74]  Charles C. Tappert,et al.  Memory and time improvements in a dynamic programming algorithm for matching speech patterns , 1978 .

[75]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[76]  Shuichi Itahashi,et al.  Automatic formant extraction utilizing mel scale and equal loudness contour , 1976, ICASSP.

[77]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[78]  P. Mermelstein,et al.  Distance measures for speech recognition, psychological and instrumental , 1976 .

[79]  Ch Chen,et al.  Pattern recognition and artificial intelligence , 1976 .

[80]  Lalit R. Bahl,et al.  Design of a linguistic statistical decoder for the recognition of continuous speech , 1975, IEEE Trans. Inf. Theory.

[81]  T. M. Cannon,et al.  Blind deconvolution through digital signal processing , 1975, Proceedings of the IEEE.

[82]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[83]  M. Sanders Handbook of Sensory Physiology , 1975 .

[84]  S. Furui,et al.  Cepstral analysis technique for automatic speaker verification , 1981 .

[85]  Louis C. W. Pols,et al.  Real-Time Recognition of Spoken Words , 1971, IEEE Transactions on Computers.

[86]  R. Plomp,et al.  Perceptual and physical space of vowel sounds. , 1969, The Journal of the Acoustical Society of America.

[87]  J. C. Stevens,et al.  Brightness and loudness as functions of stimulus duration , 1966 .

[88]  W. R. Webster,et al.  Click-evoked response patterns of single units in the medial geniculate body of the cat. , 1966, Journal of neurophysiology.

[89]  I. Good Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables , 1963 .

[90]  S. S. Stevens On the psychophysical law. , 1957, Psychological review.

[91]  Harvey b. Fletcher,et al.  Speech and hearing in communication , 1953 .