Semantic Maps of Twitter Conversations

Twitter is an irreplaceable source of data for opinion mining, emergency communications, or fact sharing, whose readability is severely limited by the sheer volume of tweets published every day. A method to represent and synthesize the information content of conversations on Twitter in form of semantic maps, from which the main topics and the main orientations of tweeters may easily be read, is proposed hereafter. After a preliminary grouping of tweets in conversations, relevant keywords and Named Entities are extracted, disambiguated and clustered. Annotations are made using extensive knowledge bases and state-of-the-art techniques from Natural Language Processing and Machine Learning. The results are in form of coloured graphs, to be easily interpretable. Several experiments confirm the high understandability and the good adherence to tackled topics of the mapped conversations.