A Scalable Architecture for Bilingual Lexicography

SABLE is a Scalable Architecture for Bilingual LExicography. It is designed to produce clean broad-coverage translation lexicons from raw, unaligned parallel texts. Its black-box functionality makes it suitable for naive users. The architecture has been implemented for different language pairs, and has been tested on very large and noisy input. SABLE does not rely on language-specific resources such as part-of-speech taggers, but it can take advantage of them when they are available. Comments University of Pennsylvania Department of Computer and Information Science Technical Report No. MSCIS-97-01. This technical report is available at ScholarlyCommons: http://repository.upenn.edu/cis_reports/209 A Scalable Architecture for Bilingual Lexicography

[1]  I. Dan Melamed Porting SIMR to New Language Pairs , 1996 .

[2]  I. Dan Melamed,et al.  Automatic Evaluation and Uniform Filter Cascades for Inducing N-Best Translation Lexicons , 1995, VLC@ACL.

[3]  Philip Resnik,et al.  Semi-Automatic Acquisition of Domain-Specific Translation Lexicons , 1997, ANLP.

[4]  I. Dan Melamed Automatic Construction of Clean Broad-Coverage Translation Lexicons , 1996, AMTA.

[5]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[6]  I. Dan Melamed A Word-to-Word Model of Translational Equivalence , 1997, ACL.

[7]  Kenneth Ward Church,et al.  Identifying Word Correspondences in Parallel Texts , 1991, HLT.

[8]  Kenneth Ward Church Char_align: A Program for Aligning Parallel Texts at the Character Level , 1993, ACL.

[9]  I. Dan Melamed A Portable Algorithm for Mapping Bitext Correspondence , 1997, ACL.

[10]  Elliott Macklovitch Peut-on vérifier automatiquement la cohérence terminologique ? , 1996 .

[11]  Dania Egedi,et al.  A Freely Available Wide Coverage Morphological Analyzer for English , 1992, COLING.

[12]  Dania Egedi,et al.  A freely available wide coverage morphological analyzer for English , 1992, COLING 1992.

[13]  Michel Simard,et al.  Using cognates to align sentences in bilingual corpora , 1993, TMI.

[14]  Kenneth Ward Church,et al.  Identifying word correspondence in parallel texts , 1991 .

[15]  I. Dan Melamed,et al.  A Geometric Approach to Mapping Bitext Correspondence , 1996, EMNLP.