Lexique 2 : A new French lexical database

In this article, we present a new lexical database for French:Lexique. In addition to classical word information such as gender, number, and grammatical category,Lexique includes a series of interesting new characteristics. First, word frequencies are based on two cues: a contemporary corpus of texts and the number of Web pages containing the word. Second, the database is split into a graphemic table with all the relevant frequencies, a table structured around lemmas (particularly interesting for the study of the inflectional family), and a table about surface frequency cues. Third,Lexique is distributed under a GNU-like license, allowing people to contribute to it. Finally, a metasearch engine,Open Lexique, has been developed so that new databases can be added very easily to the existing ones.Lexique can either be downloaded or interrogated freely fromhttp://www.lexique.org.

[1]  P. Mousty,et al.  Brulex: une base de donne 'es lexicales informatise 'e pour le franc?ais e 'crit et parle , 1990 .

[2]  R. H. Baayen,et al.  The CELEX Lexical Database (CD-ROM) , 1996 .

[3]  Fiammetta Namer FLEMM : Un analyseur flexionnel du français à base de règles , 2000 .

[4]  Eric Keller,et al.  Motivations for the prosodic predictive chain , 1998, SSW.

[5]  M. Brysbaert,et al.  Neighborhood-frequency effects when primes and targets are of different lengths , 2000, Psychological research.

[6]  Bernard Lété,et al.  MANULEX: A grade-level lexical database from French elementary school readers , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[7]  Ronald Peereman,et al.  Un correctif aux notations phontiques de la base de donnes Lexique , 2003 .

[8]  Eric Lambert,et al.  NOVLEX : une base de donnes lexicales pour les lves de primaire , 2001 .

[9]  K. Rastle,et al.  The processing of singular and plural nouns in French and English , 2004 .

[10]  Max Coltheart,et al.  Access to the internal lexicon , 1977 .

[11]  W. Nelson Francis,et al.  FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[12]  F X Alario,et al.  A set of 400 pictures standardized for French: Norms for name agreement, image agreement, familiarity, visual complexity, image variability, and age of acquisition , 1999, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[13]  Boris New,et al.  Une base de données lexicales du français contemporain sur internet: LEXIQUE , 2001 .

[14]  Jonathan Grainger,et al.  Normes d'âge d'acquisition pour 400 mots monosyllabiques , 2003 .

[15]  R. Burchfield Frequency Analysis of English Usage: Lexicon and Grammar. By W. Nelson Francis and Henry Kučera with the assistance of Andrew W. Mackie. Boston: Houghton Mifflin. 1982. x + 561 , 1985 .

[16]  Irene V Blair,et al.  Using Internet search engines to estimate word frequency , 2002, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.