An Efficient Natural Language Processing System Specially Designed for the Chinese Language

In this paper an efficient natural language processing system specially designed for the Chinese language is presented. The center of the present system is a bottom-up chart parser with head-driven operation; i.e., phrases are built up by starting with their heads and adjoining constituents to the left or right of the heads instead of strictly from left to right. In this way many more unnecessary searching actions can be effectively eliminated. The present system also includes several efficient approaches such as a direction-selective chart to simplify the control of the head-driven operation; a heuristic scheduling policy and a bidirectional look-ahead approach to eliminate many unnecessary searching actions, and an improved raise-bind mechanism combined with check rules to treat the difficult problems of movement transformations and empty categories and to simplify the design of grammar rules. The present design is based on careful consideration of some special syntactic phenomena of the Chinese language, such as head-final and head-initial structures and empty categories. A prototype of the present system has been successfully implemented and extensive experiments have been performed. In the test results significant improvement in the efficiency in processing many very complicated Chinese sentences has been observed. The detailed discussion on the various approaches, the overall system design, and the experimental results will all be presented in this paper.

[1]  Mark Steedman,et al.  Dependency and Coordination in the Grammar of Dutch and English , 1985 .

[2]  William A. Woods,et al.  Computational Linguistics Transition Network Grammars for Natural Language Analysis , 2022 .

[3]  Masaru Tomita,et al.  Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems , 1985 .

[4]  Mitchell P. Marcus,et al.  A theory of syntactic recognition for natural language , 1979 .

[5]  Keh-Jiann Chen,et al.  A Chinese Natural Language Processing System Based Upon the Theory of Empty Categories , 1986, AAAI.

[6]  Hsin-Hsi Chen,et al.  A Logical Approach to Movement Transformations in Mandarin Chinese , 1988, Int. J. Pattern Recognit. Artif. Intell..

[7]  Terry Winograd,et al.  Language as a Cognitive Process , 1983, CL.

[8]  Terry Winograd,et al.  Language as a cognitive process 1: Syntax , 1982 .

[9]  Martin Kay,et al.  Algorithm schemata and data structures in syntactic processing , 1986 .

[10]  趙 元任,et al.  A grammar of spoken Chinese = 中國話的文法 , 1968 .

[11]  Donald Nute,et al.  Logical relations , 1984 .

[12]  Madeleine Bates,et al.  The Theory and Practice of Augmented Transition Network Grammars , 1978, Natural Language Communication with Computers.

[13]  Yiming Yang Combining Prediction, Syntactic Analysis and Semantic Analysis in Chinese Sentence Analysis , 1987, IJCAI.

[14]  Noam Chomsky,et al.  Lectures on Government and Binding , 1981 .

[15]  Stuart M. Shieber,et al.  An Introduction to Unification-Based Approaches to Grammar , 1986, CSLI Lecture Notes.

[16]  Jessika Eichel Natural Language Processing In The 1980s , 1980 .

[17]  Peter Hellwig,et al.  Chart Parsing According to the Slot and Filler Principle , 1988, COLING.

[18]  Eric Wehrli,et al.  Parsing with a GB-Grammar , 1988 .

[19]  Bonnie Lynn Webber,et al.  Natural Language I , 1989, HLT.

[20]  Rino Falcone,et al.  Island Parsing and Bidirectional Charts , 1988, COLING.

[21]  Andrew Radford,et al.  Transformational Syntax : A Student's Guide to Chomsky's Extended Standard Theory , 1981 .

[22]  Uwe Reyle,et al.  Natural Language Parsing and Linguistic Theories , 1988 .

[23]  Mark Steedman,et al.  A Lazy way to Chart-Parse with Categorial Grammars , 1987, ACL.

[24]  Peter Sells,et al.  Lectures on contemporary syntactic theories : an introduction to government-binding theory, generalized phrase structure grammar, and lexical-functional grammar , 1988 .