On Extracting Multiword NP Terminology for MT

The paper addresses the issue of MT knowledge acquisition and describes a new hybrid methodology for automatic extraction of multi-word nominal terminology. The approach is based on statistical techniques merged into a strongly lexicalized Constraint Grammar paradigm. It is targeted at intelligent output and computationally attractive properties.

[1]  A. Lavie,et al.  Experiments with a noun-phrase driven statistical machine translation system , 2007, MTSUMMIT.

[2]  Svetlana Sheremetyeva,et al.  On portability of resources for a quick ramp up of multilingual MT of patent claims , 2007, MTSUMMIT.

[3]  Eric Wehrli,et al.  Accurate Collocation Extraction Using a Multilingual Parser , 2006, ACL.

[4]  Pavel Pecina Reference Data for Czech Collocation Extraction , 2008 .

[5]  Emmanuel Morin,et al.  An Effective Compositional Model for Lexical Alignment , 2008, IJCNLP.

[6]  Anthony McEnery,et al.  The UCREL Semantic Analysis System , 2004 .

[7]  Pavel Pecina An Extensive Empirical Study of Collocation Extraction Methods , 2005, ACL.

[8]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[9]  Éric Gaussier,et al.  Towards Automatic Extraction of Monolingual and Bilingual Terminology , 1994, COLING.

[10]  Gábor Pohl English-Hungarian NP Alignment in MetaMorpho™ , 2006, EAMT.

[11]  Giorgio Pedrazzi,et al.  Text mining applied to patent mapping: a practical business case , 2003 .

[12]  Dawn Archer,et al.  Comparing and combining a semantic tagger and a statistical tool for MWE extraction , 2005, Comput. Speech Lang..

[13]  SmadjaFrank Retrieving collocations from text , 1993 .

[14]  Walt Detmar Meurers,et al.  GIDLP: A grammar format for linearization-based HPSG , 2004, Proceedings of the International Conference on Head-Driven Phrase Structure Grammar.

[15]  Hermann Ney,et al.  Phrase-Based Statistical Machine Translation , 2002, KI.

[16]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[17]  Matthew Hurst,et al.  A Language Model Approach to Keyphrase Extraction , 2003, ACL 2003.

[18]  Eric Gaussier,et al.  Language technologies and patent search and classification , 2001 .

[19]  Eric Laporte,et al.  A French Corpus Annotated for Multiword Nouns , 2008, LREC 2008.