Lattice Parsing to Integrate Speech Recognition and Rule-Based Machine Translation

In this paper, we present a novel approach to integrate speech recognition and rule-based machine translation by lattice parsing. The presented approach is hybrid in two senses. First, it combines structural and statistical methods for language modeling task. Second, it employs a chart parser which utilizes manually created syntax rules in addition to scores obtained after statistical processing during speech recognition. The employed chart parser is a unification-based active chart parser. It can parse word graphs by using a mixed strategy instead of being bottom-up or top-down only. The results are reported based on word error rate on the NIST HUB-1 word-lattices. The presented approach is implemented and compared with other syntactic language modeling techniques.

[1]  Hermann Ney,et al.  Dynamic programming parsing for context-free grammars in continuous speech recognition , 1991, IEEE Trans. Signal Process..

[2]  Mark Johnson,et al.  Language modeling using efficient best-first bottom-up parsing , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[3]  Lin-Shan Lee,et al.  An Augmented Chart Data Structure with Efficient Word Lattice Parsing Scheme In Speech Recognition Applications , 1990, COLING.

[4]  Lin-Shan Lee,et al.  A best-first language processing model integrating the unification grammar and Markov language model for speech recognition applications , 1993, IEEE Trans. Speech Audio Process..

[5]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[6]  M. Baltin,et al.  The Mental representation of grammatical relations , 1985 .

[7]  Smaranda Muresan,et al.  Generalizing Word Lattice Translation , 2008, ACL.

[8]  M. Tomita,et al.  An efficient word lattice parsing algorithm for continuous speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Ciprian Chelba,et al.  Exploiting Syntactic Structure for Natural Language Modeling , 2000, ArXiv.

[10]  Brian Roark,et al.  Probabilistic Top-Down Parsing and Language Modeling , 2001, CL.

[11]  Eugene Charniak,et al.  Immediate-Head Parsing for Language Models , 2001, ACL.

[12]  Mark Johnson,et al.  Attention Shifting for Parsing Speech , 2004, ACL.

[13]  Jonathan G. Fiscus,et al.  1993 Benchmark Tests for the ARPA Spoken Language Program , 1994, HLT.

[14]  Peng Xu,et al.  A Study on Richer Syntactic Dependencies for Structured Language Modeling , 2002, ACL.

[15]  Annedore Paeseler Modification of Earley's algorithm for speech recognition , 1988 .

[16]  Yen-Lu Chow Salim RoJor SPEECH UNDERSTANDING USING A UNIFICATION GRAMMAR , 1989 .

[17]  Mehryar Mohri,et al.  A Rational Design for a Weighted Finite-State Transducer Library , 1997, Workshop on Implementing Automata.

[18]  Martin Rajman,et al.  Lattice Parsing for Speech Recognition , 1999 .

[19]  Hans Ingo Weber Time Synchronous Chart Parsing of Speech Integrating Unification Grammars with Statistics , 1994 .

[20]  Andreas Stolcke,et al.  An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities , 1994, CL.

[21]  Martin Kay,et al.  Algorithm schemata and data structures in syntactic processing , 1986 .

[22]  Mark Johnson,et al.  Best-first Word-lattice Parsing: Techniques for integrated syntactic language modeling , 2005 .