论文信息 - FeasPar - a feature structure parser learning to parse spontaneous speech

FeasPar - a feature structure parser learning to parse spontaneous speech

Traditionally, automatic natural language parsing and translation have been performed with various symbolic approaches. Many of these have the advantage of a highly speciic output formalism, allowing ne-grained parse analyses and, therefore, very precise translations. Within the last decade, statistical and connectionist techniques have been proposed to learn the parsing task in order to avoid the tedious manual modeling of grammar and malformation. How to learn a detailed output representation and how to learn to parse robustly even ill-formed input, has until now remained an open question. This thesis provides an answer to this question by presenting a connectionist parser that needs a small corpus and a minimum of hand modeling, that learns, and that is robust towards spontaneous speech and speech recognizer eeects. The parser delivers feature structure parses, and has a performance comparable to a good hand modeled uniication based parser. The connectionist parser FeasPar consists of several neural networks and a Consistency Checking Search. The number of, architecture of, and other parameters of the neural networks are automatically derived from the training data. The search nds the combination of the neural net outputs that produces the most probable consistent analysis. To demonstrate learnability and robustness, FeasPar is trained with transcribed sentences from the English Spontaneous Scheduling Task and evaluated for network, overall parse, and translation performance, with transcribed and speech data. The latter contains speech recognition errors. FeasPar requires only minor human eeort and performs better or comparable to a good symbolic parser developed with a 2 year, human expert eeort. A key result is obtained by using speech data to evaluate the JANUS speech-to-speech translation system with diierent parsers. With FeasPar, acceptable translation performance is 60.5 %, versus 60.8 % with a GLR* parser. FeasPar requires two weeks of human labor to prepare the lexicon and 600 sentences of training data, whereas the GLR* parser required signiicant human expert grammar modeling. Presented in this thesis are the Chunk'n'Label Principle, showing how to divide the entire parsing tasks into several small tasks performed by neural networks , as well as the FeasPar architecture, and various methods for network performance improvement. Further, a knowledge analysis and two methods for improving the overall parsing performance are presented. Several evaluations and comparisons with a GLR* parser, producing exactly the same output formalism , illustrate FeasPar's advantages. Ausgabeformalismus produzieren, zeigen deutlich die Vorteile von FeasPar.

Finn Dag Buø

[1] Risto Miikkulainen,et al. Natural Language Processing With Modular PDP Networks and Distributed Lexicon , 1991, Cogn. Sci..

[2] Anders Krogh,et al. Introduction to the theory of neural computation , 1994, The advanced book program.

[3] Masami Suzuki,et al. A Spoken Language Translation System: SL-TRANS2 , 1992, COLING.

[4] Ajay Jain,et al. A Connectionist Parser Aimed at Spoken Language , 1989, IWPT.

[5] Alex Waibel,et al. Robust connectionist parsing of spoken language , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[6] Frederick Jelinek,et al. Basic Methods of Probabilistic Context Free Grammars , 1992 .

[7] Martin Kay,et al. Text-Translation Alignment , 1993, Comput. Linguistics.

[8] Stephen Pulman. Basic Parsing Techniques , 1992 .

[9] Emmon W. Bach,et al. Universals in Linguistic Theory , 1970 .

[10] Wayne H. Ward,et al. CMLPs robust spoken language understanding system , 1993, EUROSPEECH.

[11] R. Miikkulainen,et al. A modular neural network architecture for sequential paraphrasing of script-based stories , 1989, International 1989 Joint Conference on Neural Networks.

[12] Alexander H. Waibel,et al. Concept-based speech translation , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[13] Monika Woszczyna,et al. Integrating different learning approaches into a multilingual spoken language translation system , 1995, Learning for Natural Language Processing.

[14] John D. Lafferty,et al. Decision Tree Parsing using a Hidden Derivation Model , 1994, HLT.

[15] Geoffrey E. Hinton,et al. Learning and relearning in Boltzmann machines , 1986 .

[16] Jordan B. Pollack,et al. Massively Parallel Parsing: A Strongly Interactive Model of Natural Language Interpretation , 1988, Cogn. Sci..

[17] Ajay Naresh Jain,et al. Parsec: a connectionist learning architecture for parsing spoken language , 1992 .

[18] Stephanie Seneff,et al. TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[19] Jaime G. Carbonell,et al. An Efficient Interlingua Translation System for Multi-lingual Document Production , 1991, MTSUMMIT.

[20] Ajay N. Jain,et al. Generalization Performance in PARSEC - A Structured Connectionist Parsing Architecture , 1991, NIPS.

[21] Richard M. Schwartz,et al. Statistical Language Processing Using Hidden Understanding Models , 1994, HLT.

[22] Ajay N. Jain. A connectionist architecture for sequential symbolic domains , 1989 .

[23] Hans Uszkoreit. Linear precedence in discontinuous constituents : complex fronting in German , 1986 .

[24] Jaime G. Carbonell,et al. The Universal Parser Architecture for Knowledge-based Machine Translation , 1987, IJCAI.

[25] H. Uszkoreit. Constraints on order , 1986 .

[26] Ajay N. Jain,et al. Parsing Complex Sentences with Structured Connectionist Networks , 1991, Neural Computation.

[27] Masaru Tomita,et al. An Efficient Augmented-Context-Free Parsing Algorithm , 1987, Comput. Linguistics.

[28] David Stallard,et al. Syntactic/Semantic Coupling in the BBN DELPHI System , 1992, HLT.

[29] Wolfgang Minker. An English Version of the LIMSI L'ATIS System , 1995 .

[30] J. Elman. Distributed Representations, Simple Recurrent Networks, And Grammatical Structure , 1991 .

[31] Helmut Schnelle,et al. A Connectionist Parser for Context-Free Phrase Structure Grammars , 1990, ÖGAI.

[32] A. Lavie,et al. Glr* { an Eecient Noise-skipping Parsing Algorithm for Context Free Grammars , 1993 .

[33] David Stallard,et al. Fragment Processing in the DELPHI System , 1992, HLT.

[34] Robert L. Mercer,et al. Aligning Sentences in Parallel Corpora , 1991, ACL.

[35] James F. Allen. Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[36] Sandiway Fong,et al. On the Applicability of Neural Network and Machine Learning Methodologies to Natural Language Processing , 1998 .

[37] Hans Uszkoreit,et al. Word Order and Constituent Structure in German , 1987, CSLI Lecture Notes.

[38] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[39] Geoffrey K. Pullum,et al. Generalized Phrase Structure Grammar , 1985 .

[40] J. Cleary,et al. \self-organized Language Modeling for Speech Recognition". In , 1997 .

[41] Alexander H. Waibel,et al. Incremental Parsing by Modular Recurrent Connectionist Networks , 1989, NIPS.

[42] Klaas Sikkel,et al. A Parallel Bottom-up Tomita Parser , 1992, KONVENS.

[43] Fuliang Weng,et al. Handling Syntactic Extra-Grammaticality , 1993, IWPT.

[44] Monika Woszczyna,et al. JANUS: Speech-to-Speech Translation Using Connectionist and Non-Connectionist Techniques , 1991, NIPS.

[45] Masaru Tomita,et al. Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems , 1985 .

[46] Giorgio Satta,et al. Computation of Probabilities for an Island-Driven Parser , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[47] Wolfgang Minker,et al. A spoken language system for information retrieval , 1994, ICSLP.

[48] Geoffrey E. Hinton,et al. Distributed Representations , 1986, The Philosophy of Artificial Intelligence.

[49] Wayne H. Ward. Understanding spontaneous speech: the Phoenix system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[50] Robert L. Mercer,et al. Dividing and Conquering Long Sentences in a Translation System , 1992, HLT.

[51] George Berg,et al. Learning Recursive Phrase Structure: Combining the Strengths of PDP and X-Bar Syntax , 1991 .

[52] Alex Waibel,et al. Testing generality in JANUS: a multi-lingual speech translation system , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[53] Alex Waibel,et al. JANUS: a speech-to-speech translation system using connectionist and symbolic processing strategies , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[54] Keiko Horiguchi,et al. Towards Spontaneous Speech Translation , 1994 .

[55] M. Baltin,et al. The Mental representation of grammatical relations , 1985 .

[56] Maike Paritong. Constituent Coordination in HPSG , 1992, KONVENS.

[57] L. Prechelt,et al. Transportable natural language interfaces for taxonomic knowledge representation systems , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[58] Christoph Schommer,et al. PAPADEUS - Parallel Parsing of Ambiguous Sentences , 1993 .

[59] Stefan Wermter,et al. Learning Fault-Tolerant Speech Parsing with SCREEN , 1994, AAAI.