De l'usage réel des emojis à une prédiction de leurs catégories (From Emoji Usage to Emoji-Category Prediction)

From Emoji Usage to Emoji-Category Prediction Emoji usage drastically increased recently, they are becoming some of the most common ways to convey emotions and sentiments in social messaging applications. Several research works proposed to automatically recommend them to avoid having users scrolling down a library of thousands emojis. In order to improve emoji recommendation, we present a method to automatically extract emoji categories from their usage in tweets, following the assumption that emojis are part of written natual language, as words are. Thereby, emotion categories of face emojis were obtained directly from text in a fully reproductible way. These resources and methodology have multiple usages, including enhanced emoji understanding or enhanced emoji recommendation. MOTS-CLÉS : emoji, recommandation, plongements lexicaux, ressource, regroupement.

[1]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  Patrice Bellot,et al.  Prédiction automatique d'emojis sentimentaux , 2017, CORIA.

[4]  Ryan Kelly,et al.  Characterising the inventive appropriation of emoji as relationally meaningful in mediated close personal relationships , 2015 .

[5]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[6]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[7]  Horacio Saggion,et al.  Are Emojis Predictable? , 2017, EACL.

[8]  Horacio Saggion,et al.  What does this Emoji Mean? A Vector Space Skip-Gram Model for Twitter Emojis , 2016, LREC.

[9]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[10]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[11]  Ning Wang,et al.  Untangling Emoji Popularity Through Semantic Embeddings , 2017, ICWSM.

[12]  Julia Hirschberg,et al.  V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[13]  Amit P. Sheth,et al.  EmojiNet: An Open Service and API for Emoji Sense Discovery , 2017, ICWSM.

[14]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[15]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[16]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[17]  Caroline Kelly Do you know what I mean g :( : A linguistic study of the understanding ofemoticons and emojis in text messages , 2015 .