EmojiNet: Building a Machine Readable Sense Inventory for Emoji

Emoji are a contemporary and extremely popular way to enhance electronic communication. Without rigid semantics attached to them, emoji symbols take on different meanings based on the context of a message. Thus, like the word sense disambiguation task in natural language processing, machines also need to disambiguate the meaning or 'sense' of an emoji. In a first step toward achieving this goal, this paper presents EmojiNet, the first machine readable sense inventory for emoji. EmojiNet is a resource enabling systems to link emoji with their context-specific meaning. It is automatically constructed by integrating multiple emoji resources with BabelNet, which is the most comprehensive multilingual sense inventory available to date. The paper discusses its construction, evaluates the automatic resource creation process, and presents a use case where EmojiNet disambiguates emoji usage in tweets. EmojiNet is available online for use at http://emojinet.knoesis.org.

[1]  Annalina Caputo,et al.  An Enhanced Lesk Word Sense Disambiguation Algorithm through a Distributional Semantic Model , 2014, COLING.

[2]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[3]  Loren G. Terveen,et al.  "Blissfully Happy" or "Ready toFight": Varying Interpretations of Emoji , 2016, ICWSM.

[4]  Roberto Navigli,et al.  NASARI: a Novel Approach to a Semantically-Aware Representation of Items , 2015, NAACL.

[5]  Ryan Kelly,et al.  Characterising the inventive appropriation of emoji as relationally meaningful in mediated close personal relationships , 2015 .

[6]  Landra L. Rezabek,et al.  Visual Cues in Computer-Mediated Communication: Supplementing Text with Emoticons , 1998 .

[7]  Petra Kralj Novak,et al.  Sentiment of Emojis , 2015, PloS one.

[8]  Jacob Eisenstein,et al.  Emoticons vs. Emojis on Twitter: A Causal Inference Approach , 2015, ArXiv.

[9]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[10]  Rebecca J. Passonneau,et al.  Annotating the MASC Corpus with BabelNet , 2014, LREC.

[11]  Amit P. Sheth,et al.  Finding street gang members on Twitter , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[12]  Philippe Langlais,et al.  Evaluating Variants of the Lesk Approach for Disambiguating Words , 2004, LREC.

[13]  Amit P. Sheth,et al.  Word Embeddings to Enhance Twitter Gang Member Profile Identification , 2016, ArXiv.