Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition

From the Publisher: This book takes an empirical approach to language processing, based on applying statistical and other machine-learning algorithms to large corpora.Methodology boxes are included in each chapter. Each chapter is built around one or more worked examples to demonstrate the main idea of the chapter. Covers the fundamental algorithms of various fields, whether originally proposed for spoken or written language to demonstrate how the same algorithm can be used for speech recognition and word-sense disambiguation. Emphasis on web and other practical applications. Emphasis on scientific evaluation. Useful as a reference for professionals in any of the areas of speech and language processing.

[1]  R. Dodge The apperception of the spoken sentence: A study in the psychology of language. , 2022 .

[2]  T. Bayes,et al.  Facsimiles of two papers by Bayes , 1941 .

[3]  D. Howes On the Relation between the Intelligibility and Frequency of Occurrence of English Words , 1957 .

[4]  C. D. Forgie,et al.  Automatic Recognition of Spoken Digits , 1958 .

[5]  P. Denes,et al.  The design and operation of the mechanical speech recognizer at University College London , 1959 .

[6]  D. B. Fry,et al.  Theoretical aspects of mechanical speech recognition , 1959 .

[7]  W. W. Bledsoe,et al.  Pattern recognition and reading by machine , 1959, IRE-AIEE-ACM '59 (Eastern).

[8]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[9]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[10]  Joseph Weizenbaum,et al.  ELIZA—a computer program for the study of natural language communication between man and machine , 1966, CACM.

[11]  F. Mosteller,et al.  Inference and Disputed Authorship: The Federalist , 1966 .

[12]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[13]  A. Oppenheim,et al.  Nonlinear filtering of multiplied and convolved signals , 1968 .

[14]  E. Schegloff Sequencing in Conversational Openings , 1968 .

[15]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[16]  F. Jelinek Fast sequential decoding algorithm using a stack , 1969 .

[17]  J. Hintikka Semantics for Propositional Attitudes , 1969 .

[18]  V. Yngve On getting a word in edgewise , 1970 .

[19]  R. M. Warren Perceptual Restoration of Missing Speech Sounds , 1970, Science.

[20]  N. G. Zagoruyko,et al.  Automatic recognition of 200 words , 1970 .

[21]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .

[22]  Kenneth Mark Colby,et al.  Artificial Paranoia , 1975, Artif. Intell..

[23]  Richard Fikes,et al.  STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[24]  WILLIAM MARSLEN-WILSON,et al.  Linguistic Structure and Speech Shadowing at Very Short Latencies , 1973, Nature.

[25]  R. Cole Listening for mispronunciations: A measure of what we hear during speech , 1973 .

[26]  R. Reddy Eyes and Ears for Computers , 1973 .

[27]  E. Schegloff,et al.  A simplest systematics for the organization of turn-taking for conversation , 1974 .

[28]  Lalit R. Bahl,et al.  Design of a linguistic statistical decoder for the recognition of continuous speech , 1975, IEEE Trans. Inf. Theory.

[29]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[30]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[31]  L. Mondshein,et al.  The CASPERS linguistic analysis system , 1975 .

[32]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[33]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[34]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[35]  E. Schegloff,et al.  The preference for self-correction in the organization of repair in conversation , 1977 .

[36]  Barbara J. Grosz,et al.  The representation and use of focus in dialogue understanding. , 1977 .

[37]  Daniel G. Bobrow,et al.  GUS, A Frame-Driven Dialog System , 1986, Artif. Intell..

[38]  William D Marslen-Wilson,et al.  Processing interactions and lexical access during word recognition in continuous speech , 1978, Cognitive Psychology.

[39]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[40]  D. Massaro,et al.  Integration of featural information in speech perception. , 1978, Psychological review.

[41]  H. Grice Further Notes on Logic and Conversation , 1978 .

[42]  Frank Burton,et al.  Order in Court , 1979, The Routledge Handbook of Forensic Linguistics.

[43]  C. Raymond Perrault,et al.  Analyzing Intention in Utterances , 1986, Artif. Intell..

[44]  C. Raymond Perrault,et al.  A Plan-Based Analysis of Indirect Speech Act , 1980, CL.

[45]  F Grosjean,et al.  Spoken word recognition processes and the gating paradigm , 1980, Perception & psychophysics.

[46]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  A. Samuel Phonemic restoration: insights from a new methodology. , 1981, Journal of experimental psychology. General.

[48]  E. Schegloff Discourse as an interactional achievement : Some uses of "Uh huh" and other things that come between sentences , 1982 .

[49]  John D. Gould,et al.  Composing letters with a simulated listening typewriter , 1982, CHI '82.

[50]  Alexander I. Rudnicky,et al.  What's new in speech perception? The research and ideas of William Chandler Bagley, 1874-1946. , 1983, Psychological review.

[51]  D W Massaro,et al.  American Psychological Association, Inc. Evaluation and Integration of Visual and Auditory Information in Speech Perception , 2022 .

[52]  A. Nadas,et al.  A decision theorectic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood , 1983 .

[53]  Clayton Lewis,et al.  Designing for usability—key principles and what designers think , 1983, CHI '83.

[54]  Dennis R. Wixon,et al.  Building a user-derived interface , 1984, CACM.

[55]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[56]  Gail Jefferson,et al.  Notes on a systematic deployment of the acknowledgement tokens “Yeah”; and “Mm Hm”; , 1984 .

[57]  L. Tyler The structure of the initial cohort: Evidence from gating , 1984, Perception & Psychophysics.

[58]  Rachel Reichman,et al.  Getting computers to talk like you and me , 1985 .

[59]  A. Salasoo,et al.  Interaction of Knowledge Sources in Spoken Word Identification. , 1985, Journal of memory and language.

[60]  S.E. Levinson,et al.  Structural methods in automatic speech recognition , 1985, Proceedings of the IEEE.

[61]  John Makhoul,et al.  Context-dependent modeling for acoustic-phonetic recognition of continuous speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[62]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[63]  Diane J. Litman,et al.  Plan recognition and discourse analysis: an integrated approach for understanding dialogues , 1986 .

[64]  Jeffrey L. Elman,et al.  Interactive processes in speech perception: the TRACE model , 1986 .

[65]  Lalit R. Bahl,et al.  Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[66]  Anne Cutler,et al.  The predominance of strong initial syllables in the English vocabulary , 1987 .

[67]  J. Pierrehumbert The phonology and phonetics of English intonation , 1987 .

[68]  James F. Allen,et al.  A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[69]  C M Connine,et al.  Interactive use of lexical information in speech perception. , 1987, Journal of experimental psychology. Human perception and performance.

[70]  Emanuel A. Schegloff,et al.  Presequences and indirection: Applying speech act theory to ordinary conversation , 1988 .

[71]  P. Mermelstein,et al.  Fast search strategy in a large vocabulary word recognizer , 1988 .

[72]  Colin Potts,et al.  Design of Everyday Things , 1988 .

[73]  Patti Price,et al.  The DARPA 1000-word resource management database for continuous speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[74]  Anne Cutler,et al.  The role of strong syllables in segmentation for lexical access , 1988 .

[75]  Alex Waibel,et al.  Prosody and speech recognition , 1988 .

[76]  Wayne H. Ward,et al.  Modelling Non-verbal Sounds for Speech Recognition , 1989, HLT.

[77]  Victor Zue,et al.  Preliminary Evaluation of the Voyager Spoken Language System , 1989, HLT.

[78]  Stephen Cox,et al.  Some statistical issues in the comparison of speech recognition algorithms , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[79]  Herbert H. Clark,et al.  Contributing to Discourse , 1989, Cogn. Sci..

[80]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[81]  Steve Young,et al.  Token passing: a simple conceptual model for connected speech recognition systems , 1989 .

[82]  Lotfi A. Zadeh,et al.  Phonological structures for speech recognition , 1989 .

[83]  Candace L. Sidner,et al.  Models of Plans to Support Communication: An Initial Report , 1990, AAAI.

[84]  George R. Doddington,et al.  The ATIS Spoken Language Systems Pilot Corpus , 1990, HLT.

[85]  J. Pierrehumbert,et al.  The Meaning of Intonational Contours in the Interpretation of Discourse , 1990 .

[86]  Li Deng,et al.  Large vocabulary word recognition using context-dependent allophonic hidden Markov models☆ , 1990 .

[87]  Marilyn A. Walker,et al.  Mixed Initiative in Dialogue: An Investigation into Discourse Segmentation , 1990, ACL.

[88]  Dennis H. Klatt,et al.  Review of the ARPA speech understanding project , 1990 .

[89]  R. Schwartz,et al.  The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypotheses , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[90]  Douglas B. Paul,et al.  Algorithms for an Optimal A* Search and Linearizing the Search in the Stack Decoder* , 1991, HLT.

[91]  Sandra Carberry,et al.  Plan Recognition in Natural Language Dialogue , 1990 .

[92]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[93]  Hector J. Levesque,et al.  On Acting Together , 1990, AAAI.

[94]  Renato De Mori,et al.  A Cache-Based Natural Language Model for Speech Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[95]  R. Schwartz,et al.  A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[96]  Cynthia Connine,et al.  Effects of sentence context and lexical knowledge in speech processing , 1991 .

[97]  Chin-Hui Lee,et al.  Stochastic Representation of Conceptual Structure in the ATIS Task , 1991, HLT.

[98]  Nigel Gilbert,et al.  Simulating speech systems , 1991 .

[99]  David B. Pisoni,et al.  Similarity neighborhoods of spoken words , 1991 .

[100]  Joakim Nivre,et al.  On the Semantics and Pragmatics of Linguistic Feedback , 1992, J. Semant..

[101]  Jakob Nielsen,et al.  The usability engineering life cycle , 1992, Computer.

[102]  Victor Zue,et al.  Statistical and linguistic analyses of F0 in read and spontaneous speech , 1992, ICSLP.

[103]  Julia Hirschberg,et al.  Some intonational characteristics of discourse structure , 1992, ICSLP.

[104]  Johanna D. Moore,et al.  A Problem for RST: The Need for Multi-Level Discourse Analysis , 1992, CL.

[105]  Victor Zue,et al.  Experiments in Evaluating Interactive Spoken Language Systems , 1992, HLT.

[106]  Vassilios Digalakis,et al.  Segment-based stochastic models of spectral dynamics for continuous speech recognition , 1992 .

[107]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[108]  Elizabeth Shriberg,et al.  Human-Machine Problem Solving Using Spoken Language Systems (SLS): Factors Affecting Performance and User Satisfaction , 1992, HLT.

[109]  Michael Picheny,et al.  A fast match for continuous speech recognition using allophonic models , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[110]  Barry Arons,et al.  VoiceNotes: a speech interface for a hand-held voice notetaker , 1993, INTERCHI.

[111]  Elmar Nöth,et al.  Prosody takes over: a prosodically guided dialog system , 1993, EUROSPEECH.

[112]  Mitch Weintraub,et al.  Large-vocabulary dictation using SRI's DECIPHER speech recognition system: progressive search techniques , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[113]  Rebecca J. Passonneau,et al.  Intention-Based Segmentation: Human Reliability and Correlation with Linguistic Cues , 1993, ACL.

[114]  H. H. Clark,et al.  On the Course of Answering Questions , 1993 .

[115]  Hervé Bourlard,et al.  Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[116]  Biing-Hwang Juang,et al.  Minimum error rate training based on N-best string models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[117]  J C Junqua,et al.  The Lombard reflex and its role on human listeners and automatic speech recognizers. , 1993, The Journal of the Acoustical Society of America.

[118]  Lynette Hirschman,et al.  The cost of errors in a spoken language system , 1993, EUROSPEECH.

[119]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[120]  Wayne H. Ward,et al.  CMLPs robust spoken language understanding system , 1993, EUROSPEECH.

[121]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[122]  Julia Hirschberg,et al.  Empirical Studies on the Disambiguation of Cue Phrases , 1993, Comput. Linguistics.

[123]  Sharon L. Oviatt,et al.  A Simulation-Based Research Strategy for Designing Complex NL Systems , 1993, HLT.

[124]  Monika Woszczyna,et al.  Inferring linguistic structure in spoken language , 1994, ICSLP.

[125]  Hermann Ney,et al.  Improvements in beam search for 10000-word continuous-speech recognition , 1994, IEEE Trans. Speech Audio Process..

[126]  Joanne L. Miller On the internal structure of phonetic categories: a progress report , 1994, Cognition.

[127]  S. J. Young,et al.  Tree-based state tying for high accuracy acoustic modelling , 1994 .

[128]  Andreas Stolcke,et al.  Multiple-pronunciation lexical modeling in a speaker independent speech understanding system , 1994, ICSLP.

[129]  Richard M. Schwartz,et al.  Hidden Understanding Models of Natural Language , 1994, ACL.

[130]  Alexander H. Waibel,et al.  Towards better language models for spontaneous speech , 1994, ICSLP.

[131]  Harry Bunt,et al.  Context and Dialogue Control , 1994 .

[132]  Ronald A. Cole,et al.  Towards automatic collection of the US census , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[133]  Wayne H. Ward,et al.  Recent Improvements in the CMU Spoken Language Understanding System , 1994, HLT.

[134]  David R. Traum,et al.  Discourse Obligations in Dialogue Processing , 1994, ACL.

[135]  Masaaki Nagata,et al.  First steps towards statistical modeling of dialogue to predict the speech act type of the next utterance , 1994, Speech Communication.

[136]  C Kamm,et al.  User interfaces for voice applications. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[137]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[138]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[139]  Jj Odell,et al.  The Use of Context in Large Vocabulary Speech Recognition , 1995 .

[140]  P.C. Woodland,et al.  The 1994 HTK large vocabulary speech recognition system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[141]  Daniel Jurafsky,et al.  Building multiple pronunciation models for novel words using exploratory computational phonology , 1995, EUROSPEECH.

[142]  Steve Young,et al.  The HTK book , 1995 .

[143]  P R Cohen,et al.  The role of voice input for human-machine communication. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[144]  Hermann Ney,et al.  Large vocabulary continuous speech recognition using word graphs , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[145]  Gina-Anne Levow,et al.  Designing SpeechActs: issues in speech user interfaces , 1995, CHI '95.

[146]  Mosur Ravishankar,et al.  Efficient Algorithms for Speech Recognition. , 1996 .

[147]  P. Kidwell,et al.  The trouble with computers: Usefulness, usability and productivity , 1996, IEEE Annals of the History of Computing.

[148]  Richard Sproat,et al.  Compilation of Weighted Finite-State Transducers from Decision Trees , 1996, ACL.

[149]  Norbert Reithinger,et al.  Predicting dialogue acts for a speech-to-speech translation system , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[150]  Yves Normandin Maximum Mutual Information Estimation of Hidden Markov Models , 1996 .

[151]  Julia Hirschberg,et al.  A Prosodic Analysis of Discourse Segments in Direction-Giving Monologues , 1996, ACL.

[152]  Cecilia E. Ford,et al.  Interaction and grammar: Interactional units in conversation: syntactic, intonational, and pragmatic resources for the management of turns , 1996 .

[153]  Andreas Stolcke,et al.  Automatic linguistic segmentation of conversational speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[154]  Mari Ostendorf,et al.  From HMM's to segment models: a unified view of stochastic modeling for speech recognition , 1996, IEEE Trans. Speech Audio Process..

[155]  Elmar Nöth,et al.  Dialog act classification with the help of prosody , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[156]  Kate Hunicke-Smith,et al.  Effect of Speaking Style on LVCSR Performance , 1996 .

[157]  Morena Danieli,et al.  Metrics for Evaluating Dialogue Strategies in a Spoken Language System , 1996, ArXiv.

[158]  Barbara A. Fox,et al.  Practices in the Construction of Turns: The "TCU" Revisited , 1996 .

[159]  Rukmini Iyer,et al.  Modeling Conversational Speech for Speech Recognition , 1996, EMNLP.

[160]  Kenji Kita,et al.  Automatic acquisition of probabilistic dialogue models , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[161]  Richard M. Schwartz,et al.  A Fully Statistical Approach to Natural Language Interfaces , 1996, ACL.

[162]  D. Massaro Perceiving talking faces: from speech perception to a behavioral principle , 1999 .

[163]  A. Stolcke,et al.  Automatic detection of discourse structure for speech recognition and understanding , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[164]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[165]  Ronald A. Cole,et al.  Experiments with a spoken dialogue system for taking the US census , 1997, Speech Commun..

[166]  Gwyneth Doherty-Sneddon,et al.  The Reliability of a Dialogue Structure Coding Scheme , 1997, CL.

[167]  Marilyn A. Walker,et al.  Standards for Dialogue Coding in Natural Language Processing , 1997 .

[168]  Norbert Reithinger,et al.  Dialogue act classification using language models , 1997, EUROSPEECH.

[169]  Kate Knill,et al.  Hidden Markov Models in Speech and Language Processing , 1997 .

[170]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[171]  Ronnie W. Smith,et al.  Effects of Variable Initiative on Linguistic Behavior in Human-Computer Spoken Natural Language Dialogue , 1997, Comput. Linguistics.

[172]  Elmar Nöth,et al.  Integrated dialog act segmentation and classification using prosodic features and language models , 1997, EUROSPEECH.

[173]  Jennifer Chu-Carroll,et al.  Tracking Initiative in Collaborative Dialogue Interactions , 1997, ACL.

[174]  Hermann Ney,et al.  A word graph algorithm for large vocabulary continuous speech recognition , 1994, Comput. Speech Lang..

[175]  Jennifer Chu-Carroll,et al.  Collaborative Response Generation in Planning Dialogues , 1998, Comput. Linguistics.

[176]  Karen E. Lochbaum,et al.  A Collaborative Planning Model of Intentional Structure , 1998, CL.

[177]  Geoffrey Zweig,et al.  Speech Recognition with Dynamic Bayesian Networks , 1998, AAAI/IAAI.

[178]  Ken Samuel,et al.  Dialogue Act Tagging with Transformation-Based Learning , 1998, ACL.

[179]  Andreas Stolcke,et al.  Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.

[180]  William D. Raymond,et al.  Reduction of English function words in switchboard , 1998, ICSLP.

[181]  Dan Jurafsky,et al.  Dialog Act Modeling for Conversational Speech , 1998 .

[182]  P Taylor,et al.  Intonation and dialogue context as constraints for speech recognition , 1998 .

[183]  Sharon L. Oviatt,et al.  The efficiency of multimodal interaction: a case study , 1998, ICSLP.

[184]  Marilyn A. Walker,et al.  Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email , 1998, COLING-ACL.

[185]  Daniel Jurafsky,et al.  Towards better integration of semantic predictors in statistical language modeling , 1998, ICSLP.

[186]  Sharon L. Oviatt,et al.  Predicting hyperarticulate speech during human-computer error resolution , 1998, Speech Commun..

[187]  Jennifer Chu-Carroll,et al.  A Statistical Model for Discourse Act Recognition in Dialogue Interactions , 1998 .

[188]  Marilyn A. Walker,et al.  Automatic Detection of Poor Speech Recognition at the Dialogue Level , 1999, ACL.

[189]  Partha Niyogi,et al.  Distinctive feature detection using support vector machines , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[190]  Emiel Krahmer,et al.  Error spotting in human-machine interaction , 1999 .

[191]  Harriet J. Nock,et al.  Pronunciation modeling by sharing gaussian densities across phonetic models , 1999, EUROSPEECH.

[192]  Jerome R. Bellegarda,et al.  Speech recognition experiments using multi-span statistical language models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[193]  Mark G. Core,et al.  The Report of The Third Workshop of the Discourse Resource Initiative, Chiba University and Kazusa Academia Hall , 1999 .

[194]  Richard M. Schwartz,et al.  Single-tree method for grammar-directed search , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[195]  Eric Fosler-Lussier,et al.  Multi-level decision trees for static and dynamic pronunciation models , 1999, EUROSPEECH.

[196]  James F. Allen,et al.  Speech repains, intonational phrases, and discourse markers: modeling speakers’ utterances in spoken dialogue , 1999, CL.

[197]  Lou Boves,et al.  Incorporating confidence measures in the Dutch train timetable information system developed in the ARISE project , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[198]  Bob Carpenter,et al.  Vector-based Natural Language Call Routing , 1999, Comput. Linguistics.

[199]  木村 和夫 Pragmatics , 1997, Language Teaching.

[200]  Alexander I. Rudnicky,et al.  Task-based dialog management using an agenda , 2000 .

[201]  Jens Allwood,et al.  An activity-based approach to pragmatics , 2000, Abduction, Belief and Context in Dialogue.

[202]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[203]  Julia Hirschberg,et al.  Predicting Automatic Speech Recognition Performance Using Prosodic Cues , 2000, ANLP.

[204]  Chao Huang,et al.  Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition , 2000, INTERSPEECH.

[205]  Julia Hirschberg,et al.  Corrections in spoken dialogue systems , 2000, INTERSPEECH.

[206]  Marilyn A. Walker,et al.  Towards developing general models of usability with PARADISE , 2000, Natural Language Engineering.

[207]  Marilyn A. Walker,et al.  Learning to Predict Problematic Situations in a Spoken Dialogue System: Experiments with How May I Help You? , 2000, ANLP.

[208]  Roberto Pieraccini,et al.  A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[209]  Harry Bunt,et al.  The ABC of Computational Pragmatics , 2000, Abduction, Belief and Context in Dialogue.

[210]  Gunnar Evermann,et al.  Large vocabulary decoding and confidence estimation using word posterior probabilities , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[211]  Scott Miller,et al.  A Novel Use of Statistical Parsing to Extract Information from Text , 2000, ANLP.

[212]  Andreas Stolcke,et al.  Finding consensus in speech recognition: word error minimization and other applications of confusion networks , 2000, Comput. Speech Lang..

[213]  Alex Waibel,et al.  Adaptation Methods For Non-Native Speech , 2001 .

[214]  Rubén San-Segundo-Hernández,et al.  Designing Confirmation Mechanisms and Error Recover Techniques in a Railway Information System for Spanish , 2001, SIGDIAL Workshop.

[215]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[216]  James F. Allen,et al.  An architecture for more realistic conversational systems , 2001, IUI '01.

[217]  Philip C. Woodland Speaker adaptation for continuous density HMMs: a review , 2001 .

[218]  George R. Doddington,et al.  Speaker recognition based on idiolectal differences between speakers , 2001, INTERSPEECH.

[219]  Xiuyang Yu,et al.  What kind of pronunciation variation is hard for triphones to model? , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[220]  Julia Hirschberg,et al.  Identifying User Corrections Automatically in Spoken Dialogue Systems , 2001, NAACL.

[221]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[222]  Daniel Povey,et al.  New features in the CU-HTK system for transcription of conversational telephone speech , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[223]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[224]  Daniel Povey,et al.  Large scale discriminative training of hidden Markov models for speech recognition , 2002, Comput. Speech Lang..

[225]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[226]  Shankar Kumar,et al.  Risk based lattice cutting for segmental minimum Bayes-risk decoding , 2002, INTERSPEECH.

[227]  Marilyn A. Walker,et al.  Spoken language generation , 2002, Comput. Speech Lang..

[228]  Thomas Hain,et al.  IMPLICIT PRONUNCIATION MODELLING IN ASR , 2002 .

[229]  S. Singh,et al.  Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System , 2011, J. Artif. Intell. Res..

[230]  Gina-Anne Levow Characterizing and Recognizing Spoken Corrections in Human-Computer Dialogue , 1998, COLING.