Embedding Lexical Features via Tensor Decomposition for Small Sample Humor Recognition

We propose a novel tensor embedding method that can effectively extract lexical features for humor recognition. Specifically, we use word-word co-occurrence to encode the contextual content of documents, and then decompose the tensor to get corresponding vector representations. We show that this simple method can capture features of lexical humor effectively for continuous humor recognition. In particular, we achieve a distance of 0.887 on a global humor ranking task, comparable to the top performing systems from SemEval 2017 Task 6B (Potash et al., 2017) but without the need for any external training corpus. In addition, we further show that this approach is also beneficial for small sample humor recognition tasks through a semi-supervised label propagation procedure, which achieves about 0.7 accuracy on the 16000 One-Liners (Mihalcea and Strapparava, 2005) and Pun of the Day (Yang et al., 2015) humour classification datasets using only 10% of known labels.

[1]  Dafna Shahaf,et al.  Inside Jokes: Identifying Humorous Cartoon Captions , 2015, KDD.

[2]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[3]  E. Papalexakis Unsupervised Content-Based Identification of Fake News Articles with Tensor Decomposition Ensembles , 2018 .

[4]  Diane J. Litman,et al.  Humor: Prosody Analysis and Automatic Recognition for F*R*I*E*N*D*S* , 2006, EMNLP.

[5]  Anna Rumshisky,et al.  HumorHawk at SemEval-2017 Task 6: Mixing Meaning and Sound for Humor Recognition , 2017, SemEval@ACL.

[6]  Anna Rumshisky,et al.  SemEval-2017 Task 6: #HashtagWars: Learning a Sense of Humor , 2017, *SEMEVAL.

[7]  Matteo Pagliardini,et al.  Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features , 2017, NAACL.

[8]  Xiaojuan Ma,et al.  Recognizing Humour using Word Associations and Humour Anchor Extraction , 2018, COLING.

[9]  Pascale Fung,et al.  A Long Short-Term Memory Framework for Predicting Humor in Dialogues , 2016, NAACL.

[10]  Dragomir R. Radev,et al.  Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest , 2015, LREC.

[11]  Diyi Yang,et al.  Humor Recognition and Humor Anchor Extraction , 2015, EMNLP.

[12]  Carlo Strapparava,et al.  Making Computers Laugh: Investigations in Automatic Humor Recognition , 2005, HLT.

[13]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[14]  Ted Pedersen,et al.  Duluth at SemEval-2017 Task 6: Language Models in Humor Detection , 2017, SemEval@ACL.

[15]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[16]  Davide Buscaldi,et al.  From humor recognition to irony detection: The figurative language of social media , 2012, Data Knowl. Eng..

[17]  Xiaojuan Ma,et al.  Effects of Semantic Relatedness between Setups and Punchlines in Twitter Hashtag Games , 2016, PEOPLES@COLING.

[18]  Nikos D. Sidiropoulos,et al.  Tensor Decomposition for Signal Processing and Machine Learning , 2016, IEEE Transactions on Signal Processing.