A Review of Text Style Transfer Using Deep Learning

Style is an integral component of a sentence indicated by the choice of words a person makes. Different people have different ways of expressing themselves, however, they adjust their speaking and writing style to a social context, an audience, an interlocutor or the formality of an occasion. Text style transfer is defined as a task of adapting and/or changing the stylistic manner in which a sentence is written, while preserving the meaning of the original sentence. A systematic review of text style transfer methodologies using deep learning is presented in this paper. We point out the technological advances in deep neural networks that have been the driving force behind current successes in the fields of natural language understanding and generation. The review is structured around two key stages in the text style transfer process, namely, representation learning and sentence generation in a new style. The discussion highlights the commonalities and differences between proposed solutions as well as challenges and opportunities that are expected to direct and foster further research in the field. Impact Statement—Motivated by recent advancements in the field, we have carried out a systematic review of state-of-the-art research to highlight the trends, commonalities and differences across style transfer methodologies using deep learning. The discussion is organized around key stages of the process, namely, representation learning of style and content of a given sentence, and generation of the sentence in a new style. A comprehensive view of methodologies, available datasets and evaluation metrics is compiled to foster further research in the field.

[1]  Joshua B. Tenenbaum,et al.  Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[2]  Kilian Q. Weinberger,et al.  BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.

[3]  Yann Dauphin,et al.  Convolutional Sequence to Sequence Learning , 2017, ICML.

[4]  Oladimeji Farri,et al.  Neural Paraphrase Generation with Stacked Residual LSTM Networks , 2016, COLING.

[5]  Chenghua Lin,et al.  DGST: a Dual-Generator Network for Text Style Transfer , 2020, Conference on Empirical Methods in Natural Language Processing.

[6]  Percy Liang,et al.  ExpBERT: Representation Engineering with Natural Language Explanations , 2020, ACL.

[7]  Shakir Mohamed,et al.  Variational Inference with Normalizing Flows , 2015, ICML.

[8]  Mehdi Rezagholizadeh,et al.  TextKD-GAN: Text Generation Using Knowledge Distillation and Generative Adversarial Networks , 2019, Canadian Conference on AI.

[9]  Akhilesh Sudhakar,et al.  “Transforming” Delete, Retrieve, Generate Approach for Controlled Text Style Transfer , 2019, EMNLP.

[10]  Wei Xu,et al.  Data-driven Approaches for Paraphrasing across Language Variations , 2014 .

[11]  Shlomo Argamon,et al.  Automatically Categorizing Written Texts by Author Gender , 2002, Lit. Linguistic Comput..

[12]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[13]  J. Pennebaker,et al.  Psychological aspects of natural language. use: our words, our selves. , 2003, Annual review of psychology.

[14]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[15]  Alon Lavie,et al.  Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.

[16]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[17]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[18]  Kewei Tu,et al.  Language Style Transfer from Sentences with Arbitrary Unknown Styles , 2018, ArXiv.

[19]  Joel R. Tetreault,et al.  Dear Sir or Madam, May I Introduce the GYAFC Dataset: Corpus, Benchmarks and Metrics for Formality Style Transfer , 2018, NAACL.

[20]  Alexander M. Rush,et al.  Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[21]  Erik Cambria,et al.  Popularity prediction on vacation rental websites , 2020, Neurocomputing.

[22]  Natalie S. Glance,et al.  Star Quality: Aggregating Reviews to Rank Products and Merchants , 2010, ICWSM.

[23]  Alexander M. Rush,et al.  Structured Attention Networks , 2017, ICLR.

[24]  Maosong Sun,et al.  Text Style Transfer via Learning Style Instance Supported Latent Space , 2020, IJCAI.

[25]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[26]  Keiron O'Shea,et al.  An Introduction to Convolutional Neural Networks , 2015, ArXiv.

[27]  Walter Daelemans,et al.  The Effects of Age, Gender and Region on Non-standard Linguistic Variation in Online Social Networks , 2016, ArXiv.

[28]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[29]  Lucia Specia,et al.  Personalized Machine Translation: Preserving Original Author Traits , 2016, EACL.

[30]  Tie-Yan Liu,et al.  Adversarial Neural Machine Translation , 2017, ACML.

[31]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[32]  R. Plutchik Emotions : a general psychoevolutionary theory , 1984 .

[33]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[34]  Dongyan Zhao,et al.  Style Transfer in Text: Exploration and Evaluation , 2017, AAAI.

[35]  David Yarowsky,et al.  Classifying latent user attributes in twitter , 2010, SMUC '10.

[36]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[37]  William B. Dolan,et al.  Collecting Highly Parallel Data for Paraphrase Evaluation , 2011, ACL.

[38]  Eric P. Xing,et al.  Toward Controlled Generation of Text , 2017, ICML.

[39]  Ankush Gupta,et al.  A Deep Generative Framework for Paraphrase Generation , 2017, AAAI.

[40]  Hal Daumé,et al.  Answer-based Adversarial Training for Generating Clarification Questions , 2019, NAACL.

[41]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[42]  Carol Van Ess-Dykema,et al.  The Form is the Substance: Classification of Genres in Text , 2001, HTLKM@ACL.

[43]  Bingran Liu,et al.  Neural Question Generation based on Seq2Seq , 2020, Proceedings of the 2020 5th International Conference on Mathematics and Artificial Intelligence.

[44]  E. Cambria,et al.  Predicting political sentiments of voters from Twitter in multi-party contexts , 2020, Appl. Soft Comput..

[45]  Bolei Zhou,et al.  GAN Dissection: Visualizing and Understanding Generative Adversarial Networks , 2018, ICLR.

[46]  P. Ekman An argument for basic emotions , 1992 .

[47]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[48]  Regina Barzilay,et al.  Style Transfer from Non-Parallel Text by Cross-Alignment , 2017, NIPS.

[49]  Chris Callison-Burch,et al.  Optimizing Statistical Machine Translation for Text Simplification , 2016, TACL.

[50]  Douglas Biber,et al.  Variation across speech and writing: Methodology , 1988 .

[51]  Yiming Yang,et al.  Politeness Transfer: A Tag and Generate Approach , 2020, ACL.

[52]  William Labov,et al.  Field methods of the Project on Linguistic Change and Variation , 1981 .

[53]  Shlomo Argamon,et al.  The Rest of the Story: Finding Meaning in Stylistic Variation , 2010, The Structure of Style.

[54]  Furu Wei,et al.  Formality Style Transfer with Hybrid Textual Annotations , 2019, ArXiv.

[55]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[56]  Diana Inkpen,et al.  Automatic classification of documents by formality , 2010, Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010).

[57]  Nigel Collier,et al.  Twitter Emotion Analysis in Earthquake Situations , 2013, Int. J. Comput. Linguistics Appl..

[58]  Samy Bengio,et al.  Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Ye Zhang,et al.  SHAPED: Shared-Private Encoder-Decoder for Text Style Adaptation , 2018, NAACL.

[60]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[61]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[62]  Ashish Sardana,et al.  Multilogue-Net: A Context-Aware RNN for Multi-modal Emotion Detection and Sentiment Analysis in Conversation , 2020, CHALLENGEHML.

[63]  Fei Xia,et al.  Email Formality in the Workplace: A Case Study on the Enron Corpus , 2011 .

[64]  Tanya Goyal,et al.  Frustrated, Polite, or Formal: Quantifying Feelings and Tone in Email , 2018, PEOPLES@NAACL-HTL.

[65]  Ralf Krestel,et al.  Offensive Language Detection Explained , 2020, TRAC.

[66]  Martine De Cock,et al.  Private Text Classification with Convolutional Neural Networks , 2021, PRIVATENLP.

[67]  Erik Cambria,et al.  Targeted Aspect-Based Sentiment Analysis via Embedding Commonsense Knowledge into an Attentive LSTM , 2018, AAAI.

[68]  Xiaojun Wan,et al.  Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation , 2019, NeurIPS.

[69]  Anat Rachel Shimoni,et al.  Gender, genre, and writing style in formal written texts , 2003 .

[70]  Min Yang,et al.  Generative Adversarial Network for Abstractive Text Summarization , 2017, AAAI.

[71]  Afroz Ahamad,et al.  Generating Text through Adversarial Training Using Skip-Thought Vectors , 2018, NAACL.

[72]  Richard Socher,et al.  Pointer Sentinel Mixture Models , 2016, ICLR.

[73]  Geoffrey E. Hinton,et al.  The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.

[74]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[75]  L. Coppock,et al.  Politeness strategies in conversation closings , 2005 .

[76]  P. Chilton Politeness, Politics and Diplomacy , 1990 .

[77]  Samy Bengio,et al.  Content preserving text generation with attribute controls , 2018, NeurIPS.

[78]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[79]  James T. Kwok,et al.  Generalizing from a Few Examples , 2019, ACM Comput. Surv..

[80]  Jianfeng Gao,et al.  A Persona-Based Neural Conversation Model , 2016, ACL.

[81]  Christine M. Pearson,et al.  Tit for Tat? The Spiraling Effect of Incivility in the Workplace , 1999 .

[82]  George R. Doddington,et al.  Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[83]  S. Dupont,et al.  A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis , 2020, CHALLENGEHML.

[84]  Hang Li,et al.  Paraphrase Generation with Deep Reinforcement Learning , 2017, EMNLP.

[85]  Matiss Rikters,et al.  Fragmented and Valuable: Following Sentiment Changes in Food Tweets , 2021, ArXiv.

[86]  Harsh Jhamtani,et al.  Shakespearizing Modern Language Using Copy-Enriched Sequence to Sequence Models , 2017, Proceedings of the Workshop on Stylistic Variation.

[87]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[88]  Joelle Pineau,et al.  A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[89]  Dong Nguyen,et al.  "How Old Do You Think I Am?" A Study of Language and Age in Twitter , 2013, ICWSM.

[90]  David Bamman,et al.  Gender identity and lexical variation in social media , 2012, 1210.4567.

[91]  Jean-Marc Dewaele,et al.  Formality of Language: definition, measurement and behavioral determinants , 1999 .

[92]  Percy Liang,et al.  Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer , 2018, NAACL.

[93]  A. Bell Language style as audience design , 1984, Language in Society.

[94]  Takeru Miyato,et al.  cGANs with Projection Discriminator , 2018, ICLR.

[95]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[96]  Xuanjing Huang,et al.  Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation , 2019, ACL.

[97]  Scott F. Kiesling,et al.  Language Style as Identity Construction: A Footing and Framing Approach , 1998 .

[98]  Jiaxin Pei,et al.  Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders , 2020, ACL.

[99]  Xiaolin Hu,et al.  Convolutional Neural Networks With Gated Recurrent Connections , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[100]  Lyle H. Ungar,et al.  Discovering User Attribute Stylistic Differences via Paraphrasing , 2016, AAAI.

[101]  Preslav Nakov,et al.  Predicting the Type and Target of Offensive Posts in Social Media , 2019, NAACL.

[102]  P. Eckert,et al.  Style and Sociolinguistic Variation. , 2002 .

[103]  David Y. W. Lee,et al.  Genres, Registers, Text Types, Domains and Styles: Clarifying the Concepts and Navigating a Path through the BNC Jungle , 2001 .

[104]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[105]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[106]  Kyung-Ah Sohn,et al.  How Positive Are You: Text Style Transfer using Adaptive Style Embedding , 2020, COLING.

[107]  Behrouz Minaei-Bidgoli,et al.  Applying Transformers and Aspect-based Sentiment Analysis approaches on Sarcasm Detection , 2020, FIGLANG.

[108]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[109]  Yulia Tsvetkov,et al.  Style Transfer Through Multilingual and Feedback-Based Back-Translation , 2018, ArXiv.

[110]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[111]  Emiel Krahmer,et al.  Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation , 2017, J. Artif. Intell. Res..

[112]  Yulia Tsvetkov,et al.  Style Transfer Through Back-Translation , 2018, ACL.

[113]  Cícero Nogueira dos Santos,et al.  Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer , 2018, ACL.

[114]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[115]  Julian J. McAuley,et al.  Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering , 2016, WWW.

[116]  Thomas Wolf,et al.  DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter , 2019, ArXiv.

[117]  Alec Radford,et al.  Improving Language Understanding by Generative Pre-Training , 2018 .

[118]  Guillaume Lample,et al.  Multiple-Attribute Text Rewriting , 2018, ICLR.

[119]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[120]  Yulia Tsvetkov,et al.  RtGender: A Corpus for Studying Differential Responses to Gender , 2018, LREC.

[121]  Elke A. Rundensteiner,et al.  EMOTEX: Detecting Emotions in Twitter Messages , 2014 .

[122]  William Labov,et al.  Some principles of linguistic methodology , 1972, Language in Society.

[123]  J. Russell A circumplex model of affect. , 1980 .

[124]  Jure Leskovec,et al.  A computational approach to politeness with application to social factors , 2013, ACL.

[125]  Xu Sun,et al.  Learning Sentiment Memories for Sentiment Modification without Parallel Data , 2018, EMNLP.

[126]  Shibamouli Lahiri,et al.  Informality Judgment at Sentence Level and Experiments with Formality Score , 2011, CICLing.

[127]  Zhe Gan,et al.  Adversarial Text Generation via Feature-Mover's Distance , 2018, NeurIPS.

[128]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[129]  Zhe Gan,et al.  StyleNet: Generating Attractive Visual Captions with Styles , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[130]  George Kurian,et al.  Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[131]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[132]  Ralph Grishman,et al.  Paraphrasing for Style , 2012, COLING.

[133]  Bing Liu,et al.  Lifelong machine learning: a paradigm for continuous learning , 2017, Frontiers of Computer Science.

[134]  Asif Ekbal,et al.  How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble [Application Notes] , 2020, IEEE Comput. Intell. Mag..

[135]  Joosung Lee Stable Style Transformer: Delete and Generate Approach with Encoder-Decoder for Text Style Transfer , 2020, INLG.

[136]  Yiming Yang,et al.  Introducing the Enron Corpus , 2004, CEAS.

[137]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[138]  Zhoujun Li,et al.  Harnessing Pre-Trained Neural Networks with Rules for Formality Style Transfer , 2019, EMNLP.

[139]  K. Vohs,et al.  How Emotion Shapes Behavior: Feedback, Anticipation, and Reflection, Rather Than Direct Causation , 2007, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[140]  Wenjie Li,et al.  Joint Copying and Restricted Generation for Paraphrase , 2016, AAAI.

[141]  Lili Mou,et al.  Disentangled Representation Learning for Non-Parallel Text Style Transfer , 2018, ACL.

[142]  Sanja Fidler,et al.  Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[143]  Alexander J. Smola,et al.  Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS) , 2014, KDD.

[144]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[145]  Naren Ramakrishnan,et al.  Neural Abstractive Text Summarization with Sequence-to-Sequence Models , 2018, Trans. Data Sci..

[146]  Ye Zhang,et al.  A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[147]  John Pavlopoulos,et al.  ConvAI at SemEval-2019 Task 6: Offensive Language Identification and Categorization with Perspective and BERT , 2019, *SEMEVAL.

[148]  Erik Cambria,et al.  The Hourglass Model Revisited , 2020, IEEE Intelligent Systems.

[149]  Kevin Knight,et al.  Obfuscating Gender in Social Media Writing , 2016, NLP+CSS@EMNLP.

[150]  Feiyu Xu,et al.  Cycle-Consistent Adversarial Autoencoders for Unsupervised Text Style Transfer , 2020, COLING.

[151]  Xinyan Xiao,et al.  Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer , 2020, ACL.

[152]  Zhou Yu,et al.  Structured Content Preservation for Unsupervised Text Style Transfer , 2018, ArXiv.

[153]  Penelope Brown,et al.  Politeness: Some Universals in Language Usage , 1989 .

[154]  Yixin Cao,et al.  Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen , 2020, ACL.

[155]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[156]  Dayiheng Liu,et al.  Revision in Continuous Space: Fine-Grained Control of Text Style Transfer , 2019, ArXiv.

[157]  Jianfeng Gao,et al.  Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[158]  Benjamin Van Durme,et al.  Annotated Gigaword , 2012, AKBC-WEKEX@NAACL-HLT.

[159]  Richard Socher,et al.  A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.

[160]  Carlo Strapparava,et al.  Learning to identify emotions in text , 2008, SAC '08.

[161]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[162]  Quoc V. Le,et al.  Massive Exploration of Neural Machine Translation Architectures , 2017, EMNLP.