La coédition langue UNL pour partager la révision entre langues d'un document multilingue. (Coedition Language UNL to share the postedition among languages in a multilingual document)

Etant donnee la demande croissante en communication multiligue, il est de plus en plus necessaire de creer et de maintenir des documents multilingues, pour les entreprises internationales comme pour les internautes. Pourtant, le probleme principal reste le cout de traduction et de revision d'un document multilingue croit lineairement en fonction du nombre de langues. Pour le resoudre, nous proposons de produire ces documents multilingues par traduction automatique (TA), de partager le travail de revision entre les langues, et de reviser incrementalement, a la demande et en mode cooperatif. Notre solution est fondee sur l'utilisation d'un systeme de TA a "pivot", et reprend l'idee de "coedition" utilisee dans certains systemes de generation multilingue. Pour des raisons developpees en detail, UNL (Universal Networking Language) semble le meilleur langage pivot pour un tel systeme. Dans notre approche, l'utilisateur peut non seulement editer directement le texte, mais aussi "coediter" le graphe a travers le texte. Pour cela, une heuristique construit automatiquement une correspondance fine entre le texte et le graphe UNL en n'utilisant que des ressources disponibles gratuitement pour beaucoup de langues (segmenteurs, lemmatiseurs, dictionnaires). Pour chaque fragment de texte ainsi relie au graphe, on peut construire un menu dont chaque item est forme d'une annotation dans le texte et d'una action sur le graphe. Le graphe modifie peut etre ensuite deconverti dans plusieurs langues, qui beneficient toutes des corrections effectuees. Une maquette permet de demontrer un scenario dans lequel l'utilisateur alterne entre lecture (monolingue) et coedition.

[1]  Yusoff Zaharin String-Tree Correspondence Grammar: A Declarative Grammar Formalism For Defining The Correspondence Between Strings Of Terms And Tree Structures , 1987, EACL.

[2]  Christian BOITET,et al.  A way to integrate context processing in the MT component of spoken, task-oriented translation systems , 2000 .

[3]  Arul Menezes,et al.  A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora , 2001, DDMMT@ACL.

[4]  V. Raskin,et al.  Universal Grammar and Lexis for Quick Ramp-Up of MT Systems , COLING.

[5]  Igor Boguslavsky,et al.  Creating a Universal Networking Language Module within an Advanced NLP System , 2000, COLING.

[6]  Yusoff Zaharin,et al.  Strategies and Heuristics in the Analysis of a Natural Language in Machine Translation , 1986, COLING.

[7]  José Coch,et al.  Interactive Multilingual Generation , 2001, CICLing.

[8]  Koichi Takeda Pattern-Based Machine Translation , 1996, COLING.

[9]  Sadao Kurohashi,et al.  Finding Structural Correspondences from Bilingual Parsed Corpus for Corpus-based Translation , 2000, COLING.

[10]  Christian Boitet,et al.  Towards Personal MT: general design, dialogue structure, potential role of speech , 1990, COLING.

[11]  Dietmar F. Rösner When Mariko Talks to Siegfried - Experiences from a Japanese/German Machine Translation Project , 1986, COLING.

[12]  Christian Boitet Un essai de réponse à quelques questions théoriques et pratiques liées à la traduction automatique : définition d'un système prototype , 1976 .

[13]  Christian Boitet,et al.  Representation trees and string-tree correspondences , 1988, COLING.

[14]  Aarne Ranta,et al.  XML and Multilingual Document Authoring: Convergent Trends , 2000, COLING.

[15]  Virach Sornlertlamvanich,et al.  Thai Lexical Semantic Annotation by UW , 2001 .

[16]  Yusoff Zaharin On Formalisms And Analysis, Generation And Synthesis In Machine Translation , 1989, EACL.

[17]  Bernard Vauquois,et al.  Static Grammars: A Formalism for the Description of Linguistic Models , 1985, TMI.

[18]  Stuart M. Shieber,et al.  Synchronous Tree-Adjoining Grammars , 1990, COLING.

[19]  John Hutchins,et al.  Latest Developments in Machine Translation Technology: Beginning a New Era in MT Research , 1993, MTSUMMIT.

[20]  Jaime G. Carbonell,et al.  Automated Corpus Analysis and the Acquisition of Large, Multi-Lingual Knowledge Bases for MT , 1993, TMI.

[21]  Donia Scott,et al.  PILLS: Multilingual generation of medical information documents with overlapping content , 2002, LREC.

[22]  Shinichi Doi,et al.  Translation Ambiguity Resolution Based On Text Corpora Of Source And Target Languages , 1992, COLING.

[23]  M. Al-Adhaileh Synchronous Structured String-Tree Correspondence ( S-SSTC ) , 2001 .

[24]  Donia Scott The Multilingual Generation Game: Authoring Fluent Texts in Unfamiliar Languages , 1999, IJCAI.

[25]  Yves Lepage,et al.  A Language for Transcriptions , 1986, COLING.

[26]  Taro Watanabe,et al.  Lessons Learned from a Task-based Evaluation of Speech-to-Speech Machine Translation , 2000, LREC.

[27]  Christian Boitet,et al.  A "Pivot" XML-Based Architecture for Multilingual, Multiversion Documents: Parallel Monolingual Documents Aligned Through a Central Correspondence Descriptor and Possible Use of UNL , 2005 .

[28]  Ivana Kruijff-Korbayová,et al.  Multilinguality in a Text Generation System For Three Slavic Languages , 2000, COLING.

[29]  Danail Dochev,et al.  AGILE – A System for Multilingual Generation of Technical Instructions , 2001 .

[30]  Hiroshi Uchida,et al.  ATLAS II: a machine translation system using conceptual structures as an interlingua , 1988, TMI.

[31]  H. Uchida,et al.  The Universal Networking Language beyond Machine Translation , 2001 .

[32]  Victor Sadler,et al.  Experiments with an MT-Directed Lexical Knowledge Bank , 1986, COLING.

[33]  Christian Boitet Geta GETA's MT methodology and its current development towards personal networking communication and spee , 1997 .

[34]  Alon Lavie,et al.  An interlingua based on domain actions for machine translation of task-oriented dialogues , 1998, ICSLP.

[35]  Caroline Brun,et al.  Document structure and multilingual authoring , 2000, INLG.

[36]  Teruko Mitamura,et al.  Can Practical Interlinguas Be Used for Difficult Analysis Problems , 1998 .

[37]  Nick Nicholas Lojban as a Machine Translation Interlanguage in the Pacific , 1996 .

[38]  Christian Boitet,et al.  Multilingual Dialogue-Based MT for monolingual authors: the LIDIA project and a first mockup , 1994, Machine Translation.

[39]  Mosleh Hmoud Al-Adhaileh,et al.  A Flexible Example-Based Parser Based on the SSTC , 1998, COLING-ACL.

[40]  Kees van Deemter,et al.  Authoring Multimedia Documents using WYSIWYM Editing , 2000, COLING.

[41]  Etienne Blanc From Graph to Tree: Processing UNL Graphs using an Existing MT System , 2001 .

[42]  Bernard Vauquois,et al.  A survey of formal grammars and algorithms for recognition and transformation in mechanical translation , 1968, IFIP Congress.

[43]  Christian Boitet,et al.  Four technical and organizational keys to handle more languages and improve quality (on demand) in MT , 2001, MTSUMMIT.

[44]  Munpyo Hong,et al.  Overcoming the Language Barriers in the Web: The UNL-Approach , 1999 .

[45]  Virach Sornlertlamvanich,et al.  UNL Document Summarization , 2001 .

[46]  Mosleh H. Al-Adhaileh Example-Based Machine Translation Based on the Synchronous SSTC Annotation Schema , 1999 .

[47]  C. Boitet,et al.  Structural and Lexical Transfer : From an UNL graph to an Equivalent NL Dependency , 2002 .

[48]  Christian Boitet,et al.  On UNL as the future “html of the linguistic content” & the reuse of existing NLP components in UNL-related applications with the example of a UNL-French deconverter , 2000, COLING.

[49]  Jan Odijk The Organization Of The Rosetta Grammars , 1989, EACL.

[50]  Hervé Blanchon,et al.  Speech translation for French in the NESPOLE! European project , 2001, INTERSPEECH.

[51]  Christian Boitet,et al.  Factors for success and failure in MT , 1995, MTSUMMIT.

[52]  John Hutchins,et al.  The development and use of machine translation systems and computer-based translation tools , 1999 .

[53]  Klaus R. Schubert The Architecture of DLT - Interlingual or Double Direct? , 1988 .

[54]  Gilles Sérasset lnterlinguai Lexical Organisation for Multilingual Lexical Databases in NADIA , 1994, COLING.

[55]  Nadia Mana,et al.  Balancing Expressiveness and Simplicity in an Interlingua for Task Based Dialogue , 2002, Speech-to-Speech Translation@ACL.

[56]  Rémi Zajac SCSL: A linguistic specification language for MT , 1986, COLING.

[57]  Christian Boitet,et al.  Coedition to Share Text Revision across Languages and Improve MT a Posteriori , 2002, COLING 2002.

[58]  Klaus Schubert,et al.  Implicitness as a Guiding Principle in Machine Translation , 1988, COLING.

[59]  Taro Watanabe,et al.  Evaluation of a Practical Interlingua for Task-Oriented Dialogue , 2000 .

[60]  Christian BOITET,et al.  Multilingual documents management by using Universal Networking Language UNL on Alignment Gestion Tool OGA , 2001 .

[61]  Sergei Nirenburg,et al.  Lexical and Conceptual Structure for Knowledge-Based Machine Translation , 1993 .

[62]  Steven Abney,et al.  Part-of-Speech Tagging and Partial Parsing , 1997 .

[63]  Tanja Schultz,et al.  Domain Portability in Speech-to-Speech Translation , 2001, HLT.