MACAON An NLP Tool Suite for Processing Word Lattices

MACAON is a tool suite for standard NLP tasks developed for French. MACAON has been designed to process both human-produced text and highly ambiguous word-lattices produced by NLP tools. MACAON is made of several native modules for common tasks such as a tokenization, a part-of-speech tagging or syntactic parsing, all communicating with each other through XML files. In addition, exchange protocols with external tools are easily definable. MACAON is a fast, modular and open tool, distributed under GNU Public License.

[1]  Alexis Nasr,et al.  From the Definitions of the "Trésor de la Langue Française" To a Semantic Database of the French Language , 2010 .

[2]  Dan Klein,et al.  Learning Accurate, Compact, and Interpretable Tree Annotation , 2006, ACL.

[3]  Alexandra Kinyon,et al.  Building a Treebank for French , 2000, LREC.

[4]  Michael Strube,et al.  Beyond the Pipeline: Discrete Optimization in NLP , 2005, CoNLL.

[5]  Benoît Sagot,et al.  The Lefff 2 syntactic lexicon for French: architecture, acquisition, use , 2006, LREC.

[6]  Mehryar Mohri,et al.  The Design Principles of a Weighted Finite-State Transducer Library , 2000, Theor. Comput. Sci..

[7]  Nancy Ide,et al.  International Standard for a Linguistic Annotation Framework , 2003, Natural Language Engineering.

[8]  Benoît Sagot,et al.  SxPipe 2: architecture pour le traitement pré-syntaxique de corpus bruts , 2008 .

[9]  Georges Linarès,et al.  Phoneme Lattice Based A* Search Algorithm for Speech Recognition , 2002, TSD.

[10]  Ryan T. McDonald A Study of Global Inference Algorithms in Multi-document Summarization , 2007, ECIR.

[11]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[12]  Frédéric Béchet,et al.  Robust dependency parsing for spoken language understanding of spontaneous speech , 2009, INTERSPEECH.

[13]  Steven P. Abney Partial parsing via finite-state cascades , 1996, Natural Language Engineering.

[14]  Steve Young,et al.  The HTK hidden Markov model toolkit: design and philosophy , 1993 .

[15]  Josef van Genabith,et al.  Handling Unknown Words in Statistical Latent-Variable Parsing Models for Arabic, English and French , 2010, SPMRL@NAACL-HLT.

[16]  Christopher D. Manning,et al.  A Global Joint Model for Semantic Role Labeling , 2008, CL.

[17]  Laurent Romary,et al.  International standard for a linguistic annotation framework , 2003, HLT-NAACL 2003.

[18]  Matthieu Constant,et al.  Outilex, plate-forme logicielle de traitement de textes écrits , 2007, JEPTALNRECITAL.