A Character-based Approach to Distributional Semantic Models: Exploiting Kanji Characters for Constructing JapaneseWord Vectors

Many Japanese words are made of kanji characters, which themselves represent meanings. However traditional word-based distributional semantic models (DSMs) do not benefit from the useful semantic information of kanji characters. In this paper, we propose a method for exploiting the semantic information of kanji characters for constructing Japanese word vectors in DSMs. In the proposed method, the semantic representations of kanji characters (i.e, kanji vectors) are constructed first using the techniques of DSMs, and then word vectors are computed by combining the vectors of constituent kanji characters using vector composition methods. The evaluation experiment using a synonym identification task demonstrates that the kanji-based DSM achieves the best performance when a kanji-kanji matrix is weighted by positive pointwise mutual information and word vectors are composed by weighted multiplication. Comparison between kanji-based DSMs and word-based DSMs reveals that our kanji-based DSMs generally outperform latent semantic analysis, and also surpasses the best score word-based DSM for infrequent words comprising only frequent kanji characters. These findings clearly indicate that kanji-based DSMs are beneficial in improvement of quality of Japanese word vectors.

[1]  T. Dijkstra,et al.  The Time-Course of Lexical Activation in Japanese Morphographic Word Recognition: Evidence for a Character-Driven Processing Model , 2014, Quarterly journal of experimental psychology.

[2]  Jan Snajder,et al.  Derivational Smoothing for Syntactic Distributional Semantics , 2013, ACL.

[3]  Marco Marelli,et al.  Compositional-ly Derived Representations of Morphologically Complex Words in Distributional Semantics , 2013, ACL.

[4]  Yves Peirsman,et al.  Size matters: tight and loose context definitions in English word space models , 2008 .

[5]  Ioannis Korkontzelos,et al.  Estimating Linear Models for Compositional Distributional Semantics , 2010, COLING.

[6]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[7]  Akira Utsumi,et al.  A semantic space approach to the computational semantics of noun compounds , 2013, Natural Language Engineering.

[8]  J. Bullinaria,et al.  Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.

[9]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[10]  Stephen Clark,et al.  Vector Space Models of Lexical Meaning , 2015 .

[11]  Akira Utsumi,et al.  Computational Exploration of Metaphor Comprehension Processes Using a Semantic Space Model , 2011, Cogn. Sci..

[12]  Yasushi Ogawa,et al.  A new character-based indexing method using frequency data for Japanese documents , 1995, SIGIR '95.

[13]  J. Quesada Creating Your Own LSA Spaces , 2007 .

[14]  Gabriel Recchia,et al.  More data trumps smarter algorithms: Comparing pointwise mutual information with latent semantic analysis , 2009, Behavior research methods.

[15]  Ogawa Yasushi,et al.  A new character-based indexing method using frequency data for Japanese documents , 1995, SIGIR 1995.

[16]  Akira Utsumi Exploring the Relationship between Semantic Spaces and Semantic Relations , 2010, LREC.

[17]  Marco Baroni,et al.  Frege in Space: A Program for Composition Distributional Semantics , 2014, LILT.

[18]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[19]  Dominic Widdows,et al.  Geometry and Meaning , 2004, Computational Linguistics.

[20]  Taiji Ueno,et al.  Contribution of sublexical information to word meaning: An objective approach using latent semantic analysis and corpus analysis on predicates , 2014, CogSci.

[21]  Mirella Lapata,et al.  Composition in Distributional Models of Semantics , 2010, Cogn. Sci..

[22]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[23]  W. Kintsch,et al.  High-Dimensional Semantic Space Accounts of Priming. , 2006 .