Comparing Lexical Relationships Observed within Japanese Collocation Data and Japanese Word Association Norms

While large-scale corpora and various corpus query tools have long been recognized as essential language resources, the value of word association norms as language resources has been largely overlooked. This paper conducts some initial comparisons of the lexical relationships observed within Japanese collocation data extracted from a large corpus using the Japanese language version of the Sketch Engine (SkE) tool (Srdanovic et al., 2008) and the relationships found within Japanese word association sets taken from the large-scale Japanese Word Association Database (JWAD) under ongoing construction by Joyce (2005, 2007). The comparison results indicate that while some relationships are common to both linguistic resources, many lexical relationships are only observed in one resource. These findings suggest that both resources are necessary in order to more adequately cover the diverse range of lexical relationships. Finally, the paper reflects briefly on the implementation of association-based word-search strategies into electronic dictionaries proposed by Zock and Bilac (2004) and Zock (2006).

[1]  Frederic Lyman Wells,et al.  A Study of Association in Insanity , 1911 .

[2]  Graeme Hirst,et al.  Ontology and the Lexicon , 2004, Handbook on Ontologies.

[3]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4]  Joshua B. Tenenbaum,et al.  The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth , 2001, Cogn. Sci..

[5]  Michael Zock,et al.  Word Lookup on the Basis of Associations : from an Idea to a Roadmap , 2004 .

[6]  M. Moiron Proceedings of the 11th EURALEX International Congress , 2004 .

[7]  J. Deese The structure of associations in language and thought , 1966 .

[8]  R. Bailey,et al.  The computer and literary studies , 1974 .

[9]  Adam Kilgarriff,et al.  A Web Corpus and Word Sketches for Japanese , 2008 .

[10]  Adam Kilgarriff,et al.  WORD SKETCH: Extraction and Display of Signicant Collocations for Lexicography , 2000 .

[11]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[12]  Adam Kilgarriff,et al.  Lexical profiling software and its lexicographic applications: a case study , 2002 .

[13]  Terry Joyce Constructing a Large-Scale Database of Japanese Word Associations , 2005, Glottometrics.

[14]  Adam Kilgarriff,et al.  The Sketch Engine , 2004 .

[15]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[16]  Christopher R. Johnson,et al.  Background to Framenet , 2003 .

[17]  Heiner Stuckenschmidt,et al.  Handbook on Ontologies , 2004, Künstliche Intell..

[18]  Liu Yuan On Computer and Literary Studies , 2004 .

[19]  Pavel Smrz,et al.  Word Association Norms as a Unique Supplement of Traditional Language Resources , 2004, LREC.

[20]  Thomas A. Schreiber,et al.  The University of South Florida free association, rhyme, and word fragment norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[21]  Christopher R. Johnson,et al.  Lexicographic Relevance: Selecting Information From Corpus Evidence , 2003 .