Semi-compositional Method for Synonym Extraction of Multi-Word Terms

Automatic synonyms and semantically related word extraction is a challenging task, useful in many NLP applications such as question answering, search query expansion, text summarization, etc. While different studies addressed the task of word synonym extraction, only a few investigations tackled the problem of acquiring synonyms of multi-word terms (MWT) from specialized corpora. To extract pairs of synonyms of multi-word terms, we propose in this paper an unsupervised semi-compositional method that makes use of distributional semantics and exploit the compositional property shared by most MWT. We show that our method outperforms significantly the state-of-the-art.

[1]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[2]  Ming Zhou,et al.  Optimizing Synonym Extraction Using Monolingual and Bilingual Resources , 2003, IWP@ACL.

[3]  Lynne Bowker,et al.  Variation in the organization of medical terms: Exploring some motivations for term choice , 2006 .

[4]  Paul Gipe,et al.  Wind Power: Renewable Energy for Home, Farm and Business , 2004 .

[5]  Thierry Hamon,et al.  Detection of synonymy links between terms: experiment and results , 2001 .

[6]  G. A. Barnard,et al.  Transmission of Information: A Statistical Theory of Communications. , 1961 .

[7]  Graeme Hirst,et al.  Non-Classical Lexical Semantic Relations , 2004, Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics - CLS '04.

[8]  Donald Hindle,et al.  Noun Classification From Predicate-Argument Structures , 1990, ACL.

[9]  Kyo Kageura,et al.  Bilingual Terminology Mining - Using Brain, not brawn comparable corpora , 2007, ACL.

[10]  Marcus Kracht,et al.  Compositionality: The Very Idea , 2007 .

[11]  Pablo Gamallo Evaluating Two Different Methods for the Task of Extracting Bilingual Lexicons from Comparable Corpora , 2008 .

[12]  Emmanuel Morin,et al.  Revising the Compositional Method for Terminology Acquisition from Comparable Corpora , 2012, COLING.

[13]  Takashi Onishi,et al.  Synonym Acquisition Using Bilingual Comparable Corpora , 2013, IJCNLP.

[14]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[15]  Vincent D. Blondel,et al.  Automatic extraction of synonyms in a dictionary , 2002 .

[16]  Ming Zhou,et al.  Identifying Synonyms among Distributionally Similar Words , 2003, IJCAI.

[17]  Michael E. Lesk,et al.  Computer Evaluation of Indexing and Text Processing , 1968, JACM.

[18]  Olivier Ferret,et al.  Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus , 2010, LREC.

[19]  Philippe Langlais,et al.  Revisiting Context-based Projection Methods for Term-Translation Spotting in Comparable Corpora , 2010, COLING.

[20]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[21]  A. M. Ramer Mathematical Methods in Linguistics , 1992 .

[22]  Dekang Lin Automatic Retrieval and Clustering of Similar Words , 2022, COLING.

[23]  Jörg Tiedemann,et al.  Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity , 2006, ACL.

[24]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[25]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[26]  Masato Hagiwara,et al.  A Supervised Learning Approach to Automatic Synonym Identification Based on Distributional Features , 2008, ACL.

[27]  Zellig S. Harris,et al.  Distributional Structure , 1954 .