ParaLS: Lexical Substitution via Pretrained Paraphraser

Lexical substitution (LS) aims at finding appropriate substitutes for a target word in a sentence. Recently, LS methods based on pretrained language models have made remarkable progress, generating potential substitutes for a target word through analysis of its contextual surroundings. However, these methods tend to overlook the preservation of the sentence’s meaning when generating the substitutes. This study explores how to generate the substitute candidates from a paraphraser, as the generated paraphrases from a paraphraser contain variations in word choice and preserve the sentence’s meaning. Since we cannot directly generate the substitutes via commonly used decoding strategies, we propose two simple decoding strategies that focus on the variations of the target word during decoding. Experimental results show that our methods outperform state-of-the-art LS methods based on pre-trained language models on three benchmarks.

[1]  Zhecheng An,et al.  Improving Contextual Representation with Gloss Regularized Pre-training , 2022, NAACL-HLT.

[2]  Jungo Kasai,et al.  NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics , 2021, NAACL.

[3]  Qinghong Han,et al.  ConRPG: Paraphrase Generation using Contexts as Regularizer , 2021, EMNLP.

[4]  I. McKillop,et al.  LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution , 2021, ACL.

[5]  Weizhe Yuan,et al.  BARTScore: Evaluating Generated Text as Text Generation , 2021, NeurIPS.

[6]  Percy Liang,et al.  Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality , 2021, NAACL.

[7]  Yik-Cheung Tam,et al.  Cluster-based beam search for pointer-generator chatbot grounded by knowledge , 2020, Comput. Speech Lang..

[8]  Yang Shi,et al.  Chinese Lexical Simplification , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[9]  Liqun Chen,et al.  Contextualized Perturbation for Textual Adversarial Attack , 2020, NAACL.

[10]  Alexander Panchenko,et al.  A Comparative Study of Lexical Substitution Approaches based on Neural Language Models , 2020, ArXiv.

[11]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[12]  Thibault Sellam,et al.  BLEURT: Learning Robust Metrics for Text Generation , 2020, ACL.

[13]  Matt Post,et al.  Large-Scale, Diverse, Paraphrastic Bitexts via Sampling and Clustering , 2019, CoNLL.

[14]  Xindong Wu,et al.  Unsupervised Statistical Text Simplification , 2019, IEEE Transactions on Knowledge and Data Engineering.

[15]  Ming Zhou,et al.  BERT-based Lexical Substitution , 2019, ACL.

[16]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[17]  Chris Callison-Burch,et al.  Simplification Using Paraphrases and Context-Based Lexical Substitution , 2018, NAACL.

[18]  Ashwin K. Vijayakumar,et al.  Diverse Beam Search for Improved Description of Complex Scenes , 2018, AAAI.

[19]  Kevin Gimpel,et al.  Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations , 2017, ArXiv.

[20]  Christian Biemann,et al.  Language Transfer Learning for Supervised Lexical Substitution , 2016, ACL.

[21]  Chris Callison-Burch,et al.  Simple PPDB: A Paraphrase Database for Simplification , 2016, ACL.

[22]  Kyunghyun Cho,et al.  Noisy Parallel Approximate Decoding for Conditional Recurrent Language Model , 2016, ArXiv.

[23]  Lucia Specia,et al.  Unsupervised Lexical Simplification for Non-Native Speakers , 2016, AAAI.

[24]  Chris Callison-Burch,et al.  PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification , 2015, ACL.

[25]  Omer Levy,et al.  A Simple Word Embedding Model for Lexical Substitution , 2015, VS@HLT-NAACL.

[26]  Chris Callison-Burch,et al.  PPDB: The Paraphrase Database , 2013, NAACL.

[27]  Deniz Yuret,et al.  KU: Word Sense Disambiguation by Substitution , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[28]  Rada Mihalcea,et al.  UNT: SubFinder: Combining Knowledge Sources for Automatic Lexical Substitution , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[29]  Diana McCarthy,et al.  Lexical Substitution as a Task for WSD Evaluation , 2002, SENSEVAL.

[30]  Nils J. Nilsson,et al.  A Formal Basis for the Heuristic Determination of Minimum Cost Paths , 1968, IEEE Trans. Syst. Sci. Cybern..

[31]  E. Daskalaki,et al.  CILex: An Investigation of Context Information for Lexical Substitution Methods , 2022, COLING.

[32]  Tommaso Pasini,et al.  ALaSca: an Automated approach for Large-Scale Lexical Substitution , 2021, IJCAI.

[33]  Rocco Tripodi,et al.  GeneSis: A Generative Approach to Substitutes in Context , 2021, EMNLP.

[34]  Jipeng Qiang,et al.  LSBert: Lexical Simplification Based on BERT , 2021, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[35]  Makoto Onizuka,et al.  Edit Distance Based Curriculum Learning for Paraphrase Generation , 2021, ACL.

[36]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[37]  Ido Dagan,et al.  Modeling Word Meaning in Context with Substitute Vectors , 2015, NAACL.

[38]  Stefan Thater,et al.  What Substitutes Tell Us - Analysis of an “All-Words” Lexical Substitution Corpus , 2014, EACL.

[39]  Diana McCarthy,et al.  SemEval-2007 Task 10: English Lexical Substitution Task , 2007, *SEMEVAL.