Multi-parser architecture for query processing

Natural language queries provide a natural means for common people to interact with computers and access to on-line information. Due to the complexity of natural language, the traditional way of using a single grammar for a single language parser leads to an inefficient, fragile, and often very large language processing system. Multi-Parser Architecture (MPA) intends to alleviate these problems, and the modularized MPA also has the advantage of easier portability to new domains and distributed computing. In this paper, we investigate the effect of using different types of parsers on different types of query data in MPA. Three data sets and two types of sub-parsers have been examined. Results show that partitioning grammars leads to superior speed performance for the Earley parser across the three data sets. The GLR parser is faster than the Earley parser for the partitioned grammar, but the GLR parser may have excessive memory usage for the un-partitioned grammars.

[1]  Tobias Ruland,et al.  Making the most of multiplicity: a multi-parser multi-strategy architecture for the robust processing of spoken language , 1998, ICSLP.

[2]  Fuliang Weng,et al.  Handling Syntactic Extra-Grammaticality , 1993, IWPT.

[3]  Helen M. Meng,et al.  Grammar partitioning and parser composition for natural language understanding , 2000, INTERSPEECH.

[4]  Helen M. Meng,et al.  Semi-automatic acquisition of domain-specific semantic structures , 1999, EUROSPEECH.

[5]  A. J. Korenjak A practical method for constructing LR (k) processors , 1969, CACM.

[6]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[7]  Finn Dag Buø,et al.  JANUS 93: towards spontaneous speech translation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Helen Meng,et al.  Parsing a Lattice with Multiple Grammars , 2000, IWPT.

[9]  Andreas Stolcke,et al.  Partitioning Grammars and Composing Parsers , 1995, IWPT.

[10]  Masaru Tomita,et al.  Efficient parsing for natural language , 1985 .

[11]  Robert C. Moore,et al.  Improved Left-corner Chart Parsing for Large Context-free Grammars , 2000, IWPT.

[12]  Robert C. Berwick,et al.  Principle-Based Parsing: Computation and Psycholinguistics , 1991 .

[13]  Masaru Tonlita,et al.  An Efficient Word Lattice Parsing Algorithm for Continuous Speech Recognition , 1986 .

[14]  François Andry,et al.  Interleaving Syntax and Semantics in an Effecient Bottom-Up Parser , 1994, ACL.

[15]  Jan W. Amtrup Parallel Parsing: Different Distribution Schemata for Charts , 1995, IWPT.