New Modality: Emoji Challenges in Prediction, Anticipation, and Retrieval

Over the past decade, emoji have emerged as a new and widespread form of digital communication, spanning diverse social networks and spoken languages. We propose treating these ideograms as a new modality in their own right, distinct in their semantic structure from both the text in which they are often embedded as well as the images which they resemble. As a new modality, emoji present rich novel possibilities for representation and interaction. In this paper, we explore the challenges that arise naturally from considering the emoji modality through the lens of multimedia research, specifically the ways in which emoji can be related to other common modalities such as text and images. To do so, we first present a large-scale data set of real-world emoji usage collected from Twitter. This data set contains examples of both text-emoji and image-emoji relationships within tweets. We present baseline results on the challenge of predicting emoji from both text and images, using state-of-the-art neural networks. Further, we offer a first consideration into the problem of how to account for new, unseen emoji—a relevant issue as the emoji vocabulary continues to expand on a yearly basis. Finally, we present results for multimedia retrieval using emoji as queries.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Marília Prada,et al.  A frown emoji can be worth a thousand words: Perceptions of emoji use in text messages exchanged between romantic partners , 2017, Telematics Informatics.

[3]  Loren G. Terveen,et al.  Understanding Emoji Ambiguity in Context: The Role of Text in Emoji-Related Miscommunication , 2017, ICWSM.

[4]  Yue Gao,et al.  View-Based 3D Object Retrieval: Challenges and Approaches , 2014, IEEE MultiMedia.

[5]  Ning Wang,et al.  Untangling Emoji Popularity Through Semantic Embeddings , 2017, ICWSM.

[6]  Dennis Koelma,et al.  The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection , 2016, ICMR.

[7]  Monica A. Riordan The communicative role of non-face emojis: Affect and disambiguation , 2017, Comput. Hum. Behav..

[8]  Xuanzhe Liu,et al.  Through a Gender Lens: Learning Usage Patterns of Emojis from Large-Scale Android Users , 2017, WWW.

[9]  Cees Snoek,et al.  Image2Emoji: Zero-shot Emoji Prediction for Visual Media , 2015, ACM Multimedia.

[10]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[11]  Amit P. Sheth,et al.  EmojiNet: Building a Machine Readable Sense Inventory for Emoji , 2016, SocInfo.

[12]  Patrizia Paggio,et al.  Investigating Redundancy in Emoji Use: Study on a Twitter Based Corpus , 2017, WASSA@EMNLP.

[13]  HENNING POHL,et al.  Beyond Just Text , 2017, ACM Trans. Comput. Hum. Interact..

[14]  VirtanenTuomas,et al.  Detection and Classification of Acoustic Scenes and Events , 2018 .

[15]  Samy Bengio,et al.  Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.

[16]  Jose Berengueres,et al.  Sentiment Perception of Readers and Writers in Emoji use , 2017, ArXiv.

[17]  Horacio Saggion,et al.  What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis , 2016, LREC.

[18]  Cordelia Schmid,et al.  Label-Embedding for Image Classification , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Cees Snoek,et al.  Query-by-Emoji Video Search , 2015, ACM Multimedia.

[20]  Yiannis Kompatsiaris,et al.  Sensing Trending Topics in Twitter , 2013, IEEE Transactions on Multimedia.

[21]  Dong Liu,et al.  Event-Driven Semantic Concept Discovery by Exploiting Weakly Tagged Internet Images , 2014, ICMR.

[22]  Teruko Mitamura,et al.  Zero-Example Event Search using MultiModal Pseudo Relevance Feedback , 2014, ICMR.

[23]  Torben Wallbaum,et al.  Face2Emoji: Using Facial Emotional Expressions to Filter Emojis , 2017, CHI Extended Abstracts.

[24]  Serkan Ayvaz,et al.  The Effects of Emoji in Sentiment Analysis , 2017 .

[25]  Loren G. Terveen,et al.  "Blissfully Happy" or "Ready toFight": Varying Interpretations of Emoji , 2016, ICWSM.

[26]  Orion Montoya,et al.  Varying Linguistic Purposes of Emoji in (Twitter) Context , 2017, ACL.

[27]  Kennedy Njenga,et al.  Social Media Information Security Threats: Anthropomorphic Emoji Analysis on Social Engineering , 2018 .

[28]  Fei Chen,et al.  A Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference , 2010, IEEE Transactions on Multimedia.

[29]  Darja Fiser,et al.  A Global Analysis of Emoji Usage , 2016, WAC@ACL.

[30]  David A. Shamma,et al.  The New Data and New Challenges in Multimedia Research , 2015, ArXiv.

[31]  Xuanzhe Liu,et al.  Through a Gender Lens: Learning Usage Patterns of Emojis from Large-Scale Android Users , 2017, WWW.

[32]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[33]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[34]  Stacy Marsella,et al.  Predicting Speaker Head Nods and the Effects of Affective Information , 2010, IEEE Transactions on Multimedia.

[35]  Abdulmotaleb El-Saddik,et al.  Language-independent data set annotation for machine learning-based sentiment analysis , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[36]  Horacio Saggion,et al.  How Cosmopolitan Are Emojis?: Exploring Emojis Usage and Meaning over Different Languages with Distributional Semantics , 2016, ACM Multimedia.

[37]  Xiang Li,et al.  Joint Emoji Classification and Embedding Learning , 2017, APWeb/WAIM.

[38]  Petra Kralj Novak,et al.  Sentiment of Emojis , 2015, PloS one.

[39]  Neha Kumar,et al.  Goodbye Text, Hello Emoji: Mobile Communication on WeChat in China , 2017, CHI.

[40]  David R. Flatla,et al.  Oh that's what you meant!: reducing emoji misunderstanding , 2016, MobileHCI Adjunct.

[41]  Ning Wang,et al.  Learning from the ubiquitous language: an empirical analysis of emoji usage of smartphone users , 2016, UbiComp.

[42]  Amit P. Sheth,et al.  A semantics-based measure of emoji similarity , 2017, WI.

[43]  K. R. Venugopal,et al.  Consumer insight mining: Aspect based Twitter opinion mining of mobile phone reviews , 2017, Appl. Soft Comput..

[44]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[45]  Horacio Saggion,et al.  Are Emojis Predictable? , 2017, EACL.

[46]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Fabienne Cap,et al.  Did you ever read about Frogs drinking Coffee? Investigating the Compositionality of Multi-Emoji Expressions , 2017, WASSA@EMNLP.