Connectionist Speech Recognition: A Hybrid Approach

From the Publisher: Connectionist Speech Recognition: A Hybrid Approach describes the theory and implementation of a method to incorporate neural network approaches into state-of-the-art continuous speech recognition systems based on Hidden Markov Models (HMMs) to improve their performance. In this framework, neural networks (and in particular, multilayer perceptrons or MLPs) have been restricted to well-defined subtasks of the whole system, i.e., HMM emission probability estimation and feature extraction. The book describes a successful five year international collaboration between the authors. The lessons learned form a case study that demonstrates how hybrid systems can be developed to combine neural networks with more traditional statistical approaches. The book illustrates both the advantages and limitations of neural networks in the framework of a statistical system. Using standard databases and comparing with some conventional approaches, it is shown that MLP probability estimation can improve recognition performance. Other approaches are discussed, though there is no such unequivocal experimental result for these methods. Connectionist Speech Recognition: A Hybrid Approach is of use to anyone intending to use neural networks for speech recognition or within the framework provided by an existing successful statistical approach. This includes research and development groups working in the field of speech recognition, both with standard and neural network approaches, as well as other pattern recognition and/or neural network researchers. This book is also suitable as a text for advanced courses on neural networks or speech processing.

[1]  K. Davis,et al.  Automatic Recognition of Spoken Digits , 1952 .

[2]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[3]  H. Kesten Accelerated Stochastic Approximation , 1958 .

[4]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .

[5]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[6]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[7]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[8]  Shun-ichi Amari,et al.  A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..

[9]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[10]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[11]  Jack David Cowan,et al.  A mathematical theory of central nervous activity , 1967 .

[12]  G. Golub Least squares, singular values and matrix approximations , 1968 .

[13]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[14]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[15]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[16]  Enrique H. Ruspini,et al.  Numerical methods for fuzzy clustering , 1970, Inf. Sci..

[17]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[18]  G. Stewart Introduction to matrix computations , 1973 .

[19]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[20]  H. Akaike A new look at the statistical model identification , 1974 .

[21]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[22]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[23]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[24]  James K. Baker,et al.  Stochastic modeling as a means of automatic speech recognition. , 1975 .

[25]  R. Bakis Continuous speech recognition via centisecond acoustic states , 1976 .

[26]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[27]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[28]  J. Bunch,et al.  Updating the singular value decomposition , 1978 .

[29]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[30]  K. R. Rao,et al.  Orthogonal Transforms for Digital Signal Processing , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[31]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[32]  James C. Bezdek,et al.  A Convergence Theorem for the Fuzzy ISODATA Clustering Algorithms , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  S. S. Marcus ERIS-context sensitive coding in speech perception , 1981 .

[34]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  L. Rabiner,et al.  A two-pass pattern-recognition approach to isolated word recognition , 1981, The Bell System Technical Journal.

[36]  R. Engle Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation , 1982 .

[37]  Louis A. Liporace,et al.  Maximum likelihood estimation for multivariate observations of Markov sources , 1982, IEEE Trans. Inf. Theory.

[38]  E. Hirschman,et al.  The Moving Target , 1982 .

[39]  Michael D. Brown,et al.  An algorithm for connected word recognition , 1982, ICASSP.

[40]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[41]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[42]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[43]  Gene H. Golub,et al.  Matrix computations , 1983 .

[44]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[46]  Hermann Ney,et al.  The use of a one-stage dynamic programming algorithm for connected word recognition , 1984 .

[47]  B.-H. Juang,et al.  On the hidden Markov model and dynamic time warping for speech recognition — A unified view , 1984, AT&T Bell Laboratories Technical Journal.

[48]  Nelson Morgan,et al.  "Ignorance-based" systems , 1984, ICASSP.

[49]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[50]  Bernard Widrow,et al.  Adaptive Signal Processing , 1985 .

[51]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[52]  Pierre A. Devijver,et al.  Baum's forward-backward algorithm revisited , 1985, Pattern Recognit. Lett..

[53]  John Makhoul,et al.  Context-dependent modeling for acoustic-phonetic recognition of continuous speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[54]  L. R. Rabiner,et al.  Recognition of isolated digits using hidden Markov models with continuous mixture densities , 1985, AT&T Technical Journal.

[55]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[56]  Hirotugu Akaike,et al.  Use of statistical models for time series analysis , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[57]  Stephen E. Levinson,et al.  A unified theory of composite pattern analysis for automatic speech recognition , 1986 .

[58]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[59]  Biing-Hwang Juang,et al.  Maximum likelihood estimation for multivariate mixture observations of markov chains , 1986, IEEE Trans. Inf. Theory.

[60]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[61]  Lalit R. Bahl,et al.  Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[62]  Robert W. Brodersen,et al.  An integrated-circuit-based speech recognition system , 1986, IEEE Trans. Acoust. Speech Signal Process..

[63]  C. J. Wellekens,et al.  Global connected digit recognition using Baum-Welch algorithm , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[64]  Richard P. Lippmann,et al.  Two-stage discriminant analysis for improved isolated-word recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[65]  Peter F. Brown,et al.  The acoustic-modeling problem in automatic speech recognition , 1987 .

[66]  John J. Hopfield,et al.  CONCENTRATION INFORMATION IN TIME: ANALOG NEURAL NETWORKS WITH APPLICATIONS TO SPEECH RECOGNITION PROBLEMS. , 1987 .

[67]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[68]  Andreas Noll,et al.  A data-driven organization of the dynamic programming beam search for continuous speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[69]  Pineda,et al.  Generalization of back-propagation to recurrent neural networks. , 1987, Physical review letters.

[70]  Anthony J. Robinson,et al.  Static and Dynamic Error Propagation Networks with Application to Speech Coding , 1987, NIPS.

[71]  Xavier L. Aubert Supervised segmentation with application to speech recognition , 1987, ECST.

[72]  M. J. D. Powell,et al.  Radial basis functions for multivariable interpolation: a review , 1987 .

[73]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[74]  C. J. Wellekens,et al.  Explicit time correlation in hidden Markov models for speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[75]  Esther Levin,et al.  Accelerated Learning in Layered Neural Networks , 1988, Complex Syst..

[76]  Hermann Ney,et al.  Phoneme modelling using continuous mixture densities , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[77]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[78]  Terrence J. Sejnowski,et al.  NETtalk: a parallel network that learns to read aloud , 1988 .

[79]  Teuvo Kohonen,et al.  The 'neural' phonetic typewriter , 1988, Computer.

[80]  Lalit R. Bahl,et al.  A new algorithm for the estimation of hidden Markov model parameters , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[81]  S. M. Peeling,et al.  Isolated digit recognition experiments using the multi-layer perceptron , 1988, Speech Commun..

[82]  Yann LeCun,et al.  A theoretical framework for back-propagation , 1988 .

[83]  S. Greenberg,et al.  The ear as a speech analyzer , 1988 .

[84]  Alex Waibel,et al.  Phoneme recognition: neural networks vs. hidden Markov models vs. hidden Markov models , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[85]  M. B. Priestley,et al.  Non-linear and non-stationary time series analysis , 1990 .

[86]  Alex Waibel,et al.  Phoneme Recognition: Neural Networks vs , 1988 .

[87]  Raymond L. Watrous Learning Algorithms for Connectionist Networks: Applied Gradient Methods of Nonlinear Optimization , 1988 .

[88]  A. Poritz,et al.  Hidden Markov models: a guided tour , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[89]  Robert A. Jacobs,et al.  Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.

[90]  David S. Broomhead,et al.  Multivariable Functional Interpolation and Adaptive Networks , 1988, Complex Syst..

[91]  B. Merialdo Phonetic recognition using hidden Markov models and maximum mutual information training , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[92]  Alberto L. Sangiovanni-Vincentelli,et al.  Efficient Parallel Learning Algorithms for Neural Networks , 1988, NIPS.

[93]  D Zipser,et al.  Learning the hidden structure of speech. , 1988, The Journal of the Acoustical Society of America.

[94]  L. B. Lmeida Backpropagation in perceptrons with feedback , 1988 .

[95]  Pentti Kanerva,et al.  Sparse Distributed Memory , 1988 .

[96]  Hervé Bourlard,et al.  Speech dynamics and recurrent neural networks , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[97]  J. Doyne Farmer,et al.  Exploiting Chaos to Predict the Future and Reduce Noise , 1989 .

[98]  Richard Lippmann,et al.  Review of Neural Networks for Speech Recognition , 1989, Neural Computation.

[99]  Mitch Weintraub,et al.  SRI's DECIPHER System , 1989, HLT.

[100]  Eduardo D. Sontag,et al.  Backpropagation Can Give Rise to Spurious Local Minima Even for Networks without Hidden Layers , 1989, Complex Syst..

[101]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[102]  D. B. Paul,et al.  The Lincoln robust continuous speech recognizer , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[103]  David Haussler,et al.  What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[104]  H. Hackbarth,et al.  Scaly artificial neural networks for speaker-independent recognition of isolated words , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[105]  Ronald J. Williams,et al.  Experimental Analysis of the Real-time Recurrent Learning Algorithm , 1989 .

[106]  Hervé Bourlard,et al.  A Continuous Speech Recognition System Embedding MLP into HMM , 1989, NIPS.

[107]  Yann LeCun,et al.  Improving the convergence of back-propagation learning with second-order methods , 1989 .

[108]  Shigeru Katagiri,et al.  Shift-invariant, multi-category phoneme recognition using Kohonen's LVQ2 , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[109]  Hervé Bourlard,et al.  Generalization and Parameter Estimation in Feedforward Netws: Some Experiments , 1989, NIPS.

[110]  Hervé Bourlard,et al.  Statistical Inference in Multilayer Perceptrons and Hidden Markov Models with Applications in Continuous Speech Recognition , 1989, NATO Neurocomputing.

[111]  Sontag,et al.  Backpropagation separates when perceptrons do , 1989 .

[112]  M. A. Bush,et al.  How limited training data can allow a neural network to outperform an 'optimal' statistical classifier , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[113]  K. Shikano,et al.  Parallelism, hierarchy, scaling in time-delay neural networks for spotting Japanese phonemes CV-syllables , 1989, International 1989 Joint Conference on Neural Networks.

[114]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[115]  Kurt Hornik,et al.  Neural networks and principal component analysis: Learning from examples without local minima , 1989, Neural Networks.

[116]  Hervé Bourlard,et al.  Speech pattern discrimination and multilayer perceptrons , 1989 .

[117]  Geoffrey E. Hinton Connectionist Learning Procedures , 1989, Artif. Intell..

[118]  S. Makram-Ebeid,et al.  A rationalized error back-propagation learning algorithm , 1989, International 1989 Joint Conference on Neural Networks.

[119]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[120]  Richard Rohwer,et al.  The "Moving Targets" Training Algorithm , 1989, NIPS.

[121]  Mei-Yuh Hwang,et al.  The SPHINX speech recognition system , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[122]  James L. McClelland,et al.  Finite State Automata and Simple Recurrent Networks , 1989, Neural Computation.

[123]  Lotfi A. Zadeh,et al.  Phonological structures for speech recognition , 1989 .

[124]  Hy Murveit,et al.  Linguistic constraints in hidden Markov model based speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[125]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[126]  Yoh-Han Pao,et al.  Adaptive pattern recognition and neural networks , 1989 .

[127]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[128]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory, Third Edition , 1989, Springer Series in Information Sciences.

[129]  Philippe Delsarte,et al.  Low Rank Matrices with a Given Sign Pattern , 1989, SIAM J. Discret. Math..

[130]  Ken-ichi Iso,et al.  Speaker-independent word recognition using a neural prediction model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[131]  Jerome R. Bellegarda,et al.  Tied mixture continuous parameter modeling for speech recognition , 1990, IEEE Trans. Acoust. Speech Signal Process..

[132]  Mitch Weintraub,et al.  The decipher speech recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[133]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[134]  Raymond L. Watrous,et al.  Connected recognition with a recurrent network , 1990, Speech Commun..

[135]  Thomas Jackson,et al.  Neural Computing - An Introduction , 1990 .

[136]  H. Gish,et al.  A probabilistic approach to the understanding and training of neural network classifiers , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[137]  Naftali Tishby,et al.  A dynamical systems approach to speech processing , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[138]  Steve Renals Speech and neural network dynamics , 1990 .

[139]  Athanasios Kehagias,et al.  Optimal Control for training: The missing link between Hidden Markov Models and Connectionist Networks , 1990 .

[140]  Luís B. Almeida,et al.  Speeding up Backpropagation , 1990 .

[141]  L. B. Almeida A learning rule for asynchronous perceptrons with feedback in a combinatorial environment , 1990 .

[142]  Patrick Kenny,et al.  A linear predictive HMM for vector-valued observations with applications to speech recognition , 1990, IEEE Trans. Acoust. Speech Signal Process..

[143]  Hervé Bourlard,et al.  Continuous speech recognition on the resource management database using connectionist probability estimation , 1990, ICSLP.

[144]  Ronald A. Cole,et al.  Spoken Letter Recognition , 1990, HLT.

[145]  Xuedong Huang,et al.  Semi-continuous hidden Markov models for speech signals , 1990 .

[146]  Jeff A. Bilmes,et al.  The RAP: a ring array processor for layered network calculations , 1990, [1990] Proceedings of the International Conference on Application Specific Array Processors.

[147]  Amro El-Jaroudi,et al.  A new error criterion for posterior probability estimation with neural nets , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[148]  Hisashi Wakita,et al.  Neural predictive hidden Markov model , 1990, ICSLP.

[149]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[150]  Kai-Fu Lee,et al.  Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition , 1990 .

[151]  Brian Hanson,et al.  Robust speaker-independent word recognition using static, dynamic and acceleration features: experiments with Lombard and noisy speech , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[152]  John S. Bridle,et al.  Alpha-nets: A recurrent 'neural' network architecture with a hidden Markov model interpretation , 1990, Speech Commun..

[153]  Hervé Bourlard,et al.  Continuous speech recognition using multilayer perceptrons with hidden Markov models , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[154]  Alex Waibel,et al.  Large vocabulary recognition using linked predictive neural networks , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[155]  Michael I. Jordan Attractor dynamics and parallelism in a connectionist sequential machine , 1990 .

[156]  Geoffrey E. Hinton,et al.  A time-delay neural network architecture for isolated word recognition , 1990, Neural Networks.

[157]  Frédéric Bimbot,et al.  Speech processing and recognition using integrated neurocomputing techniques (Esprit Basic Research Action 3228: SPRINT) , 1990, Neurocomputing.

[158]  Sadaoki Furui,et al.  Line spectrum pair frequency - based distance measures for speech recognition , 1990, ICSLP.

[159]  H. Bourlard,et al.  Links Between Markov Models and Multilayer Perceptrons , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[160]  A. Waibel,et al.  Connectionist Viterbi training: a new hybrid method for continuous speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[161]  Douglas B. Paul,et al.  The Lincoln tied-mixture HMM continuous speech recognizer , 1990, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[162]  Alex Waibel,et al.  Integrating time alignment and neural networks for high performance continuous speech recognition , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[163]  Hynek Hermansky,et al.  Continuous speech recognition using PLP analysis with multilayer perceptrons , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[164]  Richard M. Schwartz,et al.  Continuous speech recognition using segmental neural nets , 1991 .

[165]  Alex Waibel,et al.  Continuous speech recognition using linked predictive neural networks , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[166]  Hervé Bourlard,et al.  Probability estimation by feed-forward networks in continuous speech recognition , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[167]  Shigeki Sagayama,et al.  Phoneme recognition by phoneme filter neural networks , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[168]  R.A. Cole,et al.  Speaker-independent name retrieval from spellings using a database of 50000 names , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[169]  Ken-ichi Iso,et al.  Large vocabulary speech recognition using neural prediction model , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[170]  James K. Baker,et al.  On the interaction between true source, training, and testing language models , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[171]  Frank Fallside,et al.  A recurrent error propagation network speech recognition system , 1991 .

[172]  Li Deng,et al.  Neural-network architecture for linear and nonlinear predictive hidden Markov models: application to speech recognition , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[173]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[174]  J. S. Bridle,et al.  An Alphanet approach to optimising input transformations for continuous speech recognition , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[175]  Christopher L. Scofield,et al.  Neural networks and speech processing , 1991, The Kluwer international series in engineering and computer science.

[176]  Sadaoki Furui Recent advances in speech recognition , 1991, EUROSPEECH.

[177]  Shigeru Katagiri,et al.  LVQ-based shift-tolerant phoneme recognition , 1991, IEEE Trans. Signal Process..

[178]  B. Townshend,et al.  Nonlinear prediction of speech , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[179]  Naftali Z. Tisby On the application of mixture AR hidden Markov models to text independent speaker recognition , 1991, IEEE Trans. Signal Process..

[180]  Steve Renals,et al.  Connectionist probability estimation in the DECIPHER speech recognition system , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[181]  Alexander H. Waibel,et al.  Integrated phoneme and function word architecture of hidden control neural networks for continuous speech recognition , 1992, Speech Commun..

[182]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[183]  Philip C. Woodland,et al.  Hidden Markov models using vector linear prediction and discriminative output distributions , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[184]  Jeff A. Bilmes,et al.  The Ring Array Processor: A Multiprocessing Peripheral for Connection Applications , 1992, J. Parallel Distributed Comput..

[185]  Horacio Franco,et al.  Hybrid neural network/hidden Markov model continuous-speech recognition , 1992, ICSLP.

[186]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[187]  John Wawrzynek,et al.  SPERT: a VLIW/SIMD microprocessor for artificial neural network computations , 1992, [1992] Proceedings of the International Conference on Application Specific Array Processors.

[188]  Li Deng,et al.  A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal , 1992, Signal Process..

[189]  Hervé Bourlard,et al.  Factoring Networks by a Statistical Method , 1992, Neural Computation.

[190]  Yoshua Bengio,et al.  Global optimization of a neural network-hidden Markov model hybrid , 1992, IEEE Trans. Neural Networks.

[191]  Douglas B. Paul,et al.  The Lincoln large-vocabulary stack-decoder HMM CSR , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[192]  Esther Levin Hidden control neural architecture modeling of nonlinear time varying systems and its applications , 1993, IEEE Trans. Neural Networks.

[193]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[194]  Marco Saerens,et al.  Linear and nonlinear prediction for speech recognition with hidden Markov models , 1993, EUROSPEECH.

[195]  Anastasios N. Venetsanopoulos,et al.  Artificial neural networks - learning algorithms, performance evaluation, and applications , 1992, The Kluwer international series in engineering and computer science.

[196]  Marco Saerens,et al.  A continuous-time dynamic formulation of Viterbi algorithm for one-Gaussian-per-state hidden Markov models , 1993, Speech Commun..