论文信息 - Emotions are Subtle: Learning Sentiment Based Text Representations Using Contrastive Learning

Emotions are Subtle: Learning Sentiment Based Text Representations Using Contrastive Learning

Contrastive learning techniques have been widely used in the field of computer vision as a means of augmenting datasets. In this paper, we extend the use of these contrastive learning embeddings to sentiment analysis tasks and demonstrate that fine-tuning on these embeddings provides an improvement over finetuning on BERT-based embeddings to achieve higher benchmarks on the task of sentiment analysis when evaluated on the DynaSent dataset. We also explore how our fine-tuned models perform on cross-domain benchmark datasets. Additionally, we explore upsampling techniques to achieve a more balanced class distribution to make further improvements on our benchmark tasks.

[1] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Heeyoul Choi,et al. Adversarial Training with Contrastive Learning in NLP , 2021, ArXiv.

[3] Bo Pang,et al. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[4] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[5] Peter D. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[6] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[7] Tianyu Gao,et al. SimCSE: Simple Contrastive Learning of Sentence Embeddings , 2021, EMNLP.

[8] Timothy Baldwin,et al. Contrastive Learning for Fair Representations , 2021, ArXiv.

[9] Bing Liu,et al. Mining and summarizing customer reviews , 2004, KDD.

[10] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[11] Songlin Hu,et al. ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding , 2021, ArXiv.

[12] Jo Plested,et al. GAN-SMOTE: A Generative Adversarial Network approach to Synthetic Minority Oversampling , 2019, Aust. J. Intell. Inf. Process. Syst..

[13] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[14] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[15] Mika V. Mäntylä,et al. The evolution of sentiment analysis - A review of research topics, venues, and top cited papers , 2016, Comput. Sci. Rev..

[16] Christopher Potts,et al. DynaSent: A Dynamic Benchmark for Sentiment Analysis , 2020, ACL.

[17] Thomas Wolf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[18] Jiarun Cao,et al. Whitening Sentence Representations for Better Semantics and Faster Retrieval , 2021, ArXiv.

[19] Danqi Liao,et al. Sentence Embeddings using Supervised Contrastive Learning , 2021, ArXiv.

[20] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.