Automatic Domain Adaptation for Word Sense Disambiguation Based on Comparison of Multiple Classifiers

Domain adaptation (DA), which involves adapting a classifier developed from source to target data, has been studied intensively in recent years. However, when DA for word sense disambiguation (WSD) was carried out, the optimal DA method varied according to the properties of the source and target data. This paper proposes automatic DA based on comparing the degrees of confidence of multiple classifiers for each instance. We compared three classifiers for three DA methods, where 1) a classifier was trained with a small amount of target data that was randomly selected and manually labeled but without source data, 2) a classifier was trained with source data and a small amount of target data that was randomly selected and manually labeled, and 3) a classifier was trained with selected source data that were sufficiently similar to the target data and a small amount of target data that was randomly selected and manually labeled. We used the method whose degree of confidence was the highest for each instance when Japanese WSD was carried out. The average accuracy of WSD when the DA methods that were determined automatically were used was significantly higher than when the original methods were used collectively.

[1]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[2]  Deepak S. Turaga,et al.  Cross domain distribution adaptation via kernel mapping , 2009, KDD.

[3]  Hitoshi Isahara,et al.  The RWC text databases , 1998, LREC.

[4]  Eneko Agirre,et al.  Supervised Domain Adaption for WSD , 2009, EACL.

[5]  Daumé,et al.  Frustratingly Easy Semi-Supervised Domain Adaptation , 2010 .

[6]  Hwee Tou Ng,et al.  Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation , 2006, ACL.

[7]  Manabu Okumura,et al.  Automatic Determination of a Domain Adaptation Method for Word Sense Disambiguation Using Decision Tree Learning , 2011, IJCNLP.

[8]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[9]  Eugene Charniak,et al.  Automatic Domain Adaptation for Parsing , 2010, NAACL.

[10]  Pushpak Bhattacharyya,et al.  Domain-Specific Word Sense Disambiguation combining corpus based and wordnet based parameters , 2009 .

[11]  Walter Daelemans,et al.  Using Domain Similarity for Performance Estimation , 2010, ACL 2010.

[12]  Eneko Agirre,et al.  On Robustness and Domain Adaptation using SVD for Word Sense Disambiguation , 2008, COLING.

[13]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[14]  ChengXiang Zhai,et al.  Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[15]  Manabu Okumura,et al.  Automatic Selection of Domain Adaptation Method for WSD using Decision Tree Learning , 2012, Journal of Natural Language Processing.

[16]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[17]  Gokhan Tur,et al.  Co-adaptation: Adaptive co-training for semi-supervised learning , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[19]  Kikuo Maekawa,et al.  Balanced corpus of contemporary written Japanese , 2013, Language Resources and Evaluation.

[20]  Hwee Tou Ng,et al.  Domain Adaptation with Active Learning for Word Sense Disambiguation , 2007, ACL.