A Hybrid Approach for Drug Abuse Events Extraction from Twitter

Since their emergence, social media have become a reliable source of social events which attracted the interest of research community to extract them for many business requirements. However, unlike formal sources like news articles, social data exploitation for events extraction is much harder regarding the complex character of social text. Many approaches, ranging from linguistic techniques to learning algorithms, were proposed to succeed this task. Nevertheless, achieved results are weak regarding the complexity and completeness of the task.In this paper, we focus on private events extraction from Twitter by tracking digital drug abusers. We propose a hybrid approach in which we combine strengths of linguistic rules and learning techniques looking for better performance. In fact, we use linguistic rules to build an automatically annotated training set and extract a set of features as well, to be used in a learning process in order to improve obtained results. The proposed approach outperforms the baseline by 24,8% thanks to combination of techniques.

[1]  Khaled Shaalan,et al.  A hybrid approach to Arabic named entity recognition , 2014, J. Inf. Sci..

[2]  Tom M. Mitchell,et al.  Weakly Supervised Extraction of Computer Security Events from Twitter , 2015, WWW.

[3]  Pierre Zweigenbaum,et al.  A Hybrid Approach for the Extraction of Semantic Relations from MEDLINE Abstracts , 2011, CICLing.

[4]  Liangyu Chen,et al.  An Unsupervised Framework of Exploring Events on Twitter: Filtering, Extraction and Categorization , 2015, AAAI.

[5]  Salma Jamoussi,et al.  A hybrid method for extracting relations between Arabic named entities , 2014, J. King Saud Univ. Comput. Inf. Sci..

[6]  Jakub Piskorski,et al.  Real-time News Event Extraction for Global Monitoring Systems , 2008 .

[7]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[8]  Mihai Surdeanu,et al.  A Domain-independent Rule-based Framework for Event Extraction , 2015, ACL.

[9]  Claire Cardie,et al.  Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts , 2014, EMNLP.

[10]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[11]  Liangyu Chen,et al.  A Simple Bayesian Modelling Approach to Event Extraction from Twitter , 2014, ACL.

[12]  Christopher D. Manning,et al.  Stanford typed dependencies manual , 2010 .

[13]  Lamjed Ben Said,et al.  Lexicon-Based System for Drug Abuse Entity Extraction from Twitter , 2015, BDAS.

[14]  Bruno Grilhères,et al.  Combinaison d'approches pour l'extraction automatique d'événements (Automatic events extraction by combining multiple approaches) [in French] , 2012, JEP-TALN-RECITAL.

[15]  Hsinchun Chen,et al.  Identifying adverse drug events from patient social media: A case study for diabetes , 2015, IEEE Intelligent Systems.

[16]  Georgios Paliouras,et al.  Towards Open Domain Event Extraction from Twitter: REVEALing Entity Relations , 2015, DeRiVE@ESWC.

[17]  Mihai Surdeanu,et al.  Description of the Odin Event Extraction Framework and Rule Language , 2015, ArXiv.

[18]  Vasudeva Varma,et al.  Structured Information Extraction from Natural Disaster Events on Twitter , 2014, Web-KR '14.

[19]  Thierry Poibeau,et al.  Multi-source, Multilingual Information Extraction and Summarization , 2012, Theory and Applications of Natural Language Processing.