Directional Distributional Similarity for Lexical Expansion

Distributional word similarity is most commonly perceived as a symmetric relation. Yet, one of its major applications is lexical expansion, which is generally asymmetric. This paper investigates the nature of directional (asymmetric) similarity measures, which aim to quantify distributional feature inclusion. We identify desired properties of such measures, specify a particular one based on averaged precision, and demonstrate the empirical benefit of directional measures for expansion.

[1]  Ido Dagan,et al.  Contextual Preferences , 2008, ACL.

[2]  Dan Roth,et al.  Semantic and Logical Inference Model for Textual Entailment , 2007, ACL-PASCAL@ACL.

[3]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[4]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[5]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[6]  Dekang Lin,et al.  Dependency-Based Evaluation of Minipar , 2003 .

[7]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[8]  Ido Dagan,et al.  Semantic Inference at the Lexical-Syntactic Level , 2007, AAAI.

[9]  Ido Dagan,et al.  Contextual word similarity and estimation from sparse data , 1995, Comput. Speech Lang..

[10]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[11]  David J. Weir,et al.  Characterising Measures of Lexical Distributional Similarity , 2004, COLING.

[12]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[13]  Lillian Lee,et al.  Measures of Distributional Similarity , 1999, ACL.

[14]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[15]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[16]  Andrew McCallum,et al.  Text Classification by Bootstrapping with Keywords, EM and Shrinkage , 1999 .

[17]  Michael E. Lesk,et al.  Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone , 1986, SIGDOC '86.

[18]  Zellig S. Harris,et al.  Mathematical structures of language , 1968, Interscience tracts in pure and applied mathematics.

[19]  Patrick Pantel,et al.  Discovering word senses from text , 2002, KDD.

[20]  Sanda M. Harabagiu,et al.  FALCON: Boosting Knowledge for Answer Engines , 2000, TREC.

[21]  David J. Weir,et al.  A General Framework for Distributional Similarity , 2003, EMNLP.

[22]  Patrick Pantel,et al.  LEDIR: An Unsupervised Algorithm for Learning Directionality of Inference Rules , 2007, EMNLP.

[23]  Ellen M. Voorhees,et al.  The seventh text REtrieval conference (TREC-7) , 1999 .

[24]  Ido Dagan,et al.  Learning Entailment Rules for Unary Templates , 2008, COLING.

[25]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[26]  Philip S. Yu,et al.  Text Classification by Labeling Words , 2004, AAAI.

[27]  Lillian Lee,et al.  Similarity-Based Approaches to Natural Language Processing , 1997, ArXiv.

[28]  Carlo Strapparava,et al.  Direct Word Sense Matching for Lexical Substitution , 2006, ACL.

[29]  Gerda Ruge,et al.  Experiments on Linguistically-Based Term Associations , 1992, Inf. Process. Manag..

[30]  Ido Dagan,et al.  Evaluating the Inferential Utility of Lexical-Semantic Resources , 2009, EACL.

[31]  Ido Dagan,et al.  Lexical Reference: a Semantic Matching Subtask , 2006, EMNLP.

[32]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[33]  Ido Dagan,et al.  Integrating Pattern-Based and Distributional Similarity Methods for Lexical Entailment Acquisition , 2006, ACL.

[34]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[35]  Ido Dagan,et al.  Instance-based Evaluation of Entailment Rule Acquisition , 2007, ACL.

[36]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[37]  Ido Dagan,et al.  The Distributional Inclusion Hypotheses and Lexical Entailment , 2005, ACL.

[38]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[39]  Ido Dagan,et al.  Feature Vector Quality and Distributional Similarity , 2004, COLING.

[40]  Yorick Wilks,et al.  Book Reviews: Electric Words: Dictionaries, Computers, and Meanings , 1996, CL.

[41]  Patrick Pantel,et al.  Automatically Labeling Semantic Classes , 2004, NAACL.

[42]  Youngjoong Ko,et al.  Learning with Unlabeled Data for Text Categorization Using a Bootstrapping and a Feature Projection Technique , 2004, ACL.

[43]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[44]  Ralph Grishman,et al.  Discovery Procedures for Sublanguage Selectional Patterns: Initial Experiments , 1986, Comput. Linguistics.