Lexical Markup Framework (LMF) for NLP Multilingual Resources

Optimizing the production, maintenance and extension of lexical resources is one the crucial aspects impacting Natural Language Processing (NLP). A second aspect involves optimizing the process leading to their integration in applications. With this respect, we believe that the production of a consensual specification on multilingual lexicons can be a useful aid for the various NLP actors. Within ISO, one purpose of LMF (ISO-24613) is to define a standard for lexicons that covers multilingual data.

[1]  Claudia Soria,et al.  Lexical Markup Framework (LMF) , 2006, LREC.

[2]  Chu-Ren Huang,et al.  Chinese Sketch Engine and the Extraction of Grammatical Collocations , 2005, SIGHAN@IJCNLP 2005.

[3]  Alain Polguère,et al.  Compiling a “classical” explanatory combinatorial lexicographic description into a relational database , 2005 .

[4]  G. Lakoff Moral Politics: How Liberals and Conservatives Think , 1996 .

[5]  Göran Kjellmer,et al.  A dictionary of English collocations : based on the Brown corpus , 1994 .

[6]  Jim Breen,et al.  JMdict: a Japanese-Multilingual Dictionary , 2004 .

[7]  Alain Polguère,et al.  Introduction à la lexicologie explicative et combinatoire , 1995 .

[8]  E. Wehrli Translation of words in context , 2003, MTSUMMIT.

[9]  Eric Wehrli,et al.  A tool for multi-word collocation extraction and visualization in multilingual corpora , 2004 .

[10]  Charles J. Fillmore,et al.  The Structure of the Framenet Database , 2003 .

[11]  Dekai Wu,et al.  Aligning a Parallel English-Chinese Corpus Statistically With Lexical Criteria , 1994, ACL.

[12]  George Foster,et al.  Real-Time Identification of Parallel Texts from BilingualNewsfeed , 2004 .

[13]  Thierry Fontenelle Collocation acquisition from a corpus or from a dictionary: a comparison , 1992 .

[14]  Sayori Shimohata,et al.  Retrieving Collocations by Co-Occurrences and Word Order Constraints , 1997, ACL.

[15]  Ivar Jacobson,et al.  The Unified Modeling Language Reference Manual, Second Edition , 2005 .

[16]  Henning Bergenholtz,et al.  Kollokationen im deutschen Wörterbuch. Ein Beitrag zur Theorie des lexikographischen Beispiels , 1985 .

[17]  Stefan Evert,et al.  Methods for the Qualitative Evaluation of Lexical Association Measures , 2001, ACL.

[18]  Gilles Sérasset lnterlinguai Lexical Organisation for Multilingual Lexical Databases in NADIA , 1994, COLING.

[19]  Mansuk Song,et al.  Retrieving Collocations From Korean Text , 1999, EMNLP.

[20]  Gaël Dias,et al.  Multiword Unit Hybrid Extraction , 2003, ACL 2003.

[21]  Philip Resnik,et al.  The Bible as a Parallel Corpus: Annotating the ‘Book of 2000 Tongues’ , 1999, Comput. Humanit..

[22]  Pavel Pecina An Extensive Empirical Study of Collocation Extraction Methods , 2005, ACL.

[23]  Alain Polguère Towards a theoretically-motivated general public dictionary of semantic derivations and collocations for French , 2000 .

[24]  Xiaoyi Ma,et al.  BITS: a method for bilingual text search over the Web , 1999, MTSUMMIT.

[25]  Gil Francopoulo,et al.  A generic model for reuseable lexicons: the genelex project , 1994 .

[26]  Ivar Jacobson,et al.  The unified modeling language reference manual , 2010 .

[27]  Yin Li,et al.  Improving Xtract for Chinese collocation extraction , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[28]  J. Sinclair Collins Cobuild English dictionary , 1995 .

[29]  Annick de Houwer,et al.  The Acquisition of Two Languages from Birth: A Case Study: Metalinguistic behaviour , 1990 .

[30]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[31]  Satoshi Shirai,et al.  Automatic Extraction of Uninterrupted and Interrupted Collocations from Very Large Japanese Corpora Using N - gram Statistics , 1995 .

[32]  Ulrich Heid,et al.  Significant Triples: Adjective+Noun+Verb Combinations , 2003 .

[33]  R. Moon Fixed Expressions and Idioms in English: A Corpus-Based Approach , 1998 .

[34]  Julian Kupiec,et al.  An Algorithm for Finding Noun Phrase Correspondences in Bilingual Corpora , 1993, ACL.

[35]  John Fry Assembling a Parallel Corpus from RSS News Feeds , 2005, MTSUMMIT.

[36]  Susan T. Dumais,et al.  Improving the retrieval of information from external sources , 1991 .

[37]  Sylviane Cardey,et al.  Computerised set expression dictionaries , 2002 .

[38]  Ray Jackendoff,et al.  The Architecture of the Language Faculty , 1996 .

[39]  Francesca Bertagna,et al.  Content Interoperability of Lexical Resources: Open Issues and "MILE" Perspectives , 2004, LREC.

[40]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[41]  Kenneth Ward Church,et al.  Parsing, Word Associations and Typical Predicate-Argument Relations , 1989, HLT.

[42]  L. Dekang,et al.  Extracting collocations from text corpora , 1998 .

[43]  Leo Wanner,et al.  Towards a Lexicographic Approach to Lexical Transfer in Machine Translation (Illustrated by the German–Russian Language Pair) , 2001, Machine Translation.

[44]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[45]  Didier Bourigault,et al.  Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases , 1992, COLING.

[46]  Piek Vossen Introduction to EuroWordNet , 1998 .

[47]  A.G. Clapham,et al.  Panama , 1916, Revue Internationale de la Croix-Rouge.

[48]  Thierry Fontenelle,et al.  Turning a bilingual dictionary into a lexical semantic database , 1997 .

[49]  Jason Eisner,et al.  Lexical Semantics , 2020, The Handbook of English Linguistics.

[50]  J. Aitchison Words in the Mind: An Introduction to the Mental Lexicon , 1987 .

[51]  Violeta Seretan Induction of Syntactic Collocation Patterns from Generic Syntactic Relations , 2005, IJCAI.

[53]  Hiyan Alshawi,et al.  Training and Scaling Preference Functions for Disambiguation , 1994, Comput. Linguistics.

[54]  Jin Yang,et al.  Customizing complex lexical entries for high-quality MT , 2003, MTSUMMIT.

[55]  Michael Z. Spivey,et al.  The Z notation , 1989 .

[56]  Hideki Mima,et al.  Automatic recognition of multi-word terms:. the C-value/NC-value method , 2000, International Journal on Digital Libraries.

[57]  Darren Pearce,et al.  Synonymy in collocation extraction , 2001 .

[58]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[59]  Elisabeth Breidt,et al.  Extraction of V-N-Collocations from Text Corpora: A Feasibility Study for German , 1996, VLC@ACL.

[60]  Jian-Yun Nie,et al.  Automatic construction of parallel English-Chinese corpus for cross-language information retrieval , 2000, ANLP.

[61]  Robert L. Mercer,et al.  Word-Sense Disambiguation Using Statistical Methods , 1991, ACL.

[62]  Carol Peters,et al.  Cross-Language Information Retrieval and Evaluation , 2001, Lecture Notes in Computer Science.

[63]  Julio Gonzalo Language Resources in Cross-Language Text Retrieval: A CLEF Perspective , 2000, CLEF.

[64]  Yuji Matsumoto,et al.  Automatic Extraction of Word Sequence Correspondences in Parallel Corpora , 1996, VLC@COLING.

[65]  Noah A. Smith,et al.  The Web as a Parallel Corpus , 2003, CL.

[66]  Vasileios Hatzivassiloglou,et al.  Translating Collocations for Bilingual Lexicons: A Statistical Approach , 1996, CL.

[67]  Frank Smadja,et al.  Retrieving Collocations from Text: Xtract , 1993, CL.

[68]  M. Benson,et al.  Collocations and General-purpose Dictionaries , 1990 .

[69]  I. Dan Melamed A portable algorithm for mapping bitext correspondence , 1997 .

[70]  Igor Mel’čuk,et al.  Lexical functions: a tool for the description of lexical relations in a lexicon , 1996 .

[71]  Ulrich Heid,et al.  Collocations of Complex Nouns : Evidence for Lexicalisation , 2004 .

[72]  Rosemarie Tracy,et al.  The acquisition of two languages from birth: A case study , 1993 .

[73]  Igor Mel’čuk Collocations : définition, rôle et utilité , 2003 .

[74]  Anne H. Soukhanov,et al.  The american heritage dictionary of the english language , 1992 .

[75]  Slava M. Katz,et al.  Technical terminology: some linguistic properties and an algorithm for identification in text , 1995, Natural Language Engineering.

[76]  Stefan Evert,et al.  The Statistics of Word Cooccur-rences: Word Pairs and Collocations , 2004 .

[77]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[78]  Christopher C. Yang,et al.  Mining English/Chinese Parallel Documents from the World Wide Web , 2002 .

[79]  Roberto Basili,et al.  A "not-so-shallow" parser for collocational analysis , 1994, COLING.

[80]  Mathieu Mangeot,et al.  Papillon Lexical Database Project: Monolingual Dictionaries & Interlingual Links , 2001, NLPRS.

[81]  Graeme Hirst,et al.  Acquiring Collocations for Lexical Choice between Near-Synonyms , 2002, Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition -.

[82]  S. Evert,et al.  Can we do better than frequency ? A case study on extracting PP-verb collocations , 2001 .

[83]  Nicoletta Calzolari,et al.  Acquisition of Lexical Information from a Large Textual Italian Corpus. , 1996 .

[84]  Eric Wehrli,et al.  Accurate Collocation Extraction Using a Multilingual Parser , 2006, ACL.

[85]  Udo Hahn,et al.  Collocation Extraction Based on Modifiability Statistics , 2004, COLING.

[86]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[87]  Maurice Gross,et al.  Lexicon-Grammar and the Syntactic Analysis of French , 1984, ACL.

[88]  Sylvain Kahane,et al.  Formal foundation of lexical functions , 2001 .

[89]  Ivar Jacobson,et al.  Unified Modeling Language Reference Manual, The (2nd Edition) , 2004 .

[90]  Danny Jones,et al.  Words in the mind: An introduction to the mental lexicon , 2004, Machine Translation.

[91]  B. Daille Approche mixte pour l'extraction de terminologie : statistique lexicale et filtres linguistiques , 1994 .

[92]  James Mayfield,et al.  Character N-Gram Tokenization for European Language Text Retrieval , 2004, Information Retrieval.

[93]  MarcuDaniel,et al.  Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005 .

[94]  Yaacov Choueka,et al.  Looking for Needles in a Haystack or Locating Interesting Collocational Expressions in Large Textual Databases , 1988, RIAO Conference.