Conference of the European Chapter of the Association for Computational Linguistics Multi-word-expressions in a Multilingual Context Grouping Multi-word Expressions According to Part-of-speech in Statistical Machine Translation Automatic Extraction of Chinese Multiword Expressions with a Statistical

In this paper, we introduce a WordNetbased measure of semantic relatedness by combining the structure and content of WordNet with co–occurrence information derived from raw text. We use the co–occurrence information along with the WordNet definitions to build gloss vectors corresponding to each concept in WordNet. Numeric scores of relatedness are assigned to a pair of concepts by measuring the cosine of the angle between their respective gloss vectors. We show that this measure compares favorably to other measures with respect to human judgments of semantic relatedness, and that it performs well when used in a word sense disambiguation algorithm that relies on semantic relatedness. This measure is flexible in that it can make comparisons between any two concepts without regard to their part of speech. In addition, it can be adapted to different domains, since any plain text corpus can be used to derive the co–occurrence information.

[1]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[2]  J. G. Snodgrass,et al.  A standardized set of 260 pictures: norms for name agreement, image agreement, familiarity, and visual complexity. , 1980, Journal of experimental psychology. Human learning and memory.

[3]  Henwood,et al.  Author index , 1983, Pharmacology Biochemistry and Behavior.

[4]  Roger W. Schvaneveldt,et al.  Pathfinder associative networks: studies in knowledge organization , 1990 .

[5]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[6]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[7]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[8]  Preface a , 1992 .

[9]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[10]  Walter Daelemans,et al.  Generalization performance of backpropagation learning on a syllabification task , 1992 .

[11]  George A. Miller,et al.  A Semantic Concordance , 1993, HLT.

[12]  R. Gibbs The Poetics of Mind: Figurative Thought, Language, and Understanding , 1994 .

[13]  William B. Dolan,et al.  Word Sense Ambiguation: Clustering Related Senses , 1994, COLING.

[14]  Mats Rooth,et al.  Two-dimensional clusters in grammatical relations , 1995 .

[15]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[16]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[17]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[18]  Adam Kilgarriff,et al.  "I Don’t Believe in Word Senses" , 1997, Comput. Humanit..

[19]  Piek Vossen Building a Multilingual Database with Wordnets for European Languages , 1998 .

[20]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[21]  Yorick Wilks,et al.  The grammar of sense: Using part-of-speech tags as a first step in semantic disambiguation , 1998, Natural Language Engineering.

[22]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[23]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[24]  Jason S. Chang,et al.  Topical Clustering of MRD Senses Based on Information Retrieval Techniques , 1998, CL.

[25]  V. Ferreira,et al.  Semantic and phonological information flow in the production lexicon. , 1999, Journal of experimental psychology. Learning, memory, and cognition.

[26]  M J Pickering,et al.  The processing of metonymy: evidence from eye movements. , 1999, Journal of experimental psychology. Learning, memory, and cognition.

[27]  Nancy M. Ide,et al.  Parallel Translations as Sense Discriminators , 1999 .

[28]  Mats Rooth,et al.  Inducing a Semantically Annotated Lexicon via EM-Based Clustering , 1999, ACL.

[29]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[30]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[31]  Adam Kilgarriff,et al.  English Lexical Sample Task Description , 2001, *SEMEVAL.

[32]  Noriko Tomuro,et al.  Tree-Cut and a Lexicon Based on Systematic Polysemy , 2001, NAACL.

[33]  Nancy Ide,et al.  Sense Discrimination with Parallel Corpora , 2002, SENSEVAL.

[34]  Philip Resnik,et al.  An Unsupervised Method for Word Sense Tagging using Parallel Corpora , 2002, ACL.

[35]  Malvina Nissim,et al.  Metonymy Resolution as a Classification Task , 2002, EMNLP.

[36]  Philip Resnik,et al.  A Perspective on Word Sense Disambiguation Methods and Their Evaluation , 2002 .

[37]  Andrés Montoyo,et al.  The Role of WSD for Multilingual Natural Language Applications , 2002, TSD.

[38]  Malvina Nissim,et al.  Towards a Corpus Annotated for Metonymies: the Case of Location Names , 2002, LREC.

[39]  Z. Kövecses,et al.  Metaphor: A Practical Introduction , 2002 .

[40]  Ted Briscoe,et al.  Robust Accurate Statistical Annotation of General Text , 2002, LREC.

[41]  Eneko Agirre,et al.  Clustering WordNet word senses , 2003, RANLP.

[42]  Hwee Tou Ng,et al.  Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study , 2003, ACL.

[43]  Malvina Nissim,et al.  Syntactic Features and Word Similarity for Supervised Metonymy Resolution , 2003, ACL.

[44]  Ted Pedersen,et al.  Word Sense Discrimination by Clustering Contexts in Vector and Similarity Spaces , 2004, CoNLL.

[45]  K.,et al.  Evidence for Multiple Stages in the Processing of Ambiguous Words in Syntactic Contexts , 2004 .

[46]  Lucia Specia,et al.  A Hybrid Model for Word Sense Disambiguation in English-Portuguese Machine Translation , 2004 .

[47]  Emanuele Pianta,et al.  Evaluating Cross-Language Annotation Transfer in the MultiSemCor Corpus , 2004, COLING.

[48]  Daphne Koller,et al.  Word-Sense Disambiguation for Machine Translation , 2005, HLT.

[49]  Wee Sun Lee,et al.  Learning Semantic Classes for Word Sense Disambiguation , 2005, ACL.

[50]  Diana McCarthy,et al.  Domain-Speci(cid:12)c Sense Distributions and Predominant Sense Acquisition , 2022 .

[51]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[52]  Niladri Chatterjee,et al.  Pattern Ambiguity and its Resolution in English to Hindi Translation , 2005 .

[53]  Márton Miháltz Towards A Hybrid Approach To Word-Sense Disambiguation In Machine Translation , 2005 .

[54]  Diana McCarthy,et al.  Relating WordNet Senses for Word Sense Disambiguation , 2006 .

[55]  Christopher G. Chute,et al.  Cluster Stopping Rules for Word Sense Discrimination , 2006 .

[56]  Yves Peirsman,et al.  Whats in a Name? The Automatic Recognition of Metonymical Location Names , 2006 .

[57]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner , 2007 .