论文信息 - Back to the Future - Sequential Alignment of Text Representations

Back to the Future - Sequential Alignment of Text Representations

Language evolves over time in many ways relevant to natural language processing tasks. For example, recent occurrences of tokens 'BERT' and 'ELMO' in publications refer to neural network architectures rather than persons. This type of temporal signal is typically overlooked, but is important if one aims to deploy a machine learning model over an extended period of time. In particular, language evolution causes data drift between time-steps in sequential decision-making tasks. Examples of such tasks include prediction of paper acceptance for yearly conferences (regular intervals) or author stance prediction for rumours on Twitter (irregular intervals). Inspired by successes in computer vision, we tackle data drift by sequentially aligning learned representations. %We consider both unsupervised and semi-supervised alignment. We evaluate on three challenging tasks varying in terms of time-scales, linguistic units, and domains. These tasks show our method outperforming several strong baselines, including using all available data. We argue that, due to its low computational expense, sequential alignment is a practical solution to dealing with language evolution.

Isabelle Augenstein | Johannes Bjerva | Wouter Kouw

[1] Wouter M. Kouw,et al. A Review of Domain Adaptation without Target Labels , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] D. Wijaya,et al. Understanding semantic change of words over centuries , 2011, DETECT '11.

[3] John D. Lafferty,et al. Dynamic topic models , 2006, ICML.

[4] Anders Søgaard,et al. Sentiment analysis under temporal shift , 2018, WASSA@EMNLP.

[5] Jing Zhang,et al. Joint Geometrical and Statistical Alignment for Visual Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Charles A. Sutton,et al. Popularity of arXiv.org within Computer Science , 2017, ArXiv.

[7] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[8] Chong-Wah Ngo,et al. Semi-supervised Domain Adaptation with Subspace Learning for visual recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] James Allan,et al. Temporal Summaries of News Topics , 2019 .

[10] Bhavana Dalvi,et al. A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications , 2018, NAACL.

[11] Kalina Bontcheva,et al. Broad Twitter Corpus: A Diverse Named Entity Recognition Resource , 2016, COLING.