Social media analysis for product safety using text mining and sentiment analysis

The growing incidents of counterfeiting and associated economic and health consequences necessitate the development of active surveillance systems capable of producing timely and reliable information for all stake holders in the anti-counterfeiting fight. User generated content from social media platforms can provide early clues about product allergies, adverse events and product counterfeiting. This paper reports a work in progress with contributions including: the development of a framework for gathering and analyzing the views and experiences of users of drug and cosmetic products using machine learning, text mining and sentiment analysis; the application of the proposed framework on Facebook comments and data from Twitter for brand analysis, and the description of how to develop a product safety lexicon and training data for modeling a machine learning classifier for drug and cosmetic product sentiment prediction. The initial brand and product comparison results signify the usefulness of text mining and sentiment analysis on social media data while the use of machine learning classifier for predicting the sentiment orientation provides a useful tool for users, product manufacturers, regulatory and enforcement agencies to monitor brand or product sentiment trends in order to act in the event of sudden or significant rise in negative sentiment.

[1]  Bin Tang,et al.  Document Representation and Dimension Reduction for Text Clustering , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[2]  Blesson Varghese,et al.  The royal birth of 2013: Analysing and visualising public sentiment in the UK using Twitter , 2013, 2013 IEEE International Conference on Big Data.

[3]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[4]  Meera Narvekar,et al.  A review of techniques for sentiment analysis Of Twitter data , 2014, 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT).

[5]  Thiago Pardo,et al.  NILC_USP: A Hybrid System for Sentiment Analysis in Twitter Messages , 2013, *SEMEVAL.

[6]  Jalel Akaichi,et al.  Text mining facebook status updates for sentiment classification , 2013, 2013 17th International Conference on System Theory, Control and Computing (ICSTCC).

[7]  Richard Colbaugh,et al.  Estimating the sentiment of social media content for security informatics applications , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[8]  Ryen W. White,et al.  Web-scale pharmacovigilance: listening to signals from the crowd , 2013, J. Am. Medical Informatics Assoc..

[9]  Freimut Bodendorf,et al.  Mining Patient Experiences on Web 2.0 - A Case Study in the Pharmaceutical Industry , 2012, 2012 Annual SRII Global Conference.

[10]  Pierre Margot,et al.  Understanding and fighting the medicine counterfeit market. , 2014, Journal of pharmaceutical and biomedical analysis.

[11]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[12]  Dong Li,et al.  Sentiment Orientation Classification of Webpage Online Commentary Based on Intuitionistic Fuzzy Reasoning , 2013 .

[13]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[14]  Kevin Makice TWITTER API : UP AND RUNNING , 2009 .

[15]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[16]  Marc Cheong,et al.  A microblogging-based approach to terrorism informatics: Exploration and chronicling civilian sentiment and response to terrorism events via Twitter , 2011, Inf. Syst. Frontiers.

[17]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[18]  Ann O'Brien,et al.  National Security and Social Media Monitoring: A Presentation of the EMOTIVE and Related Systems , 2013, 2013 European Intelligence and Security Informatics Conference.

[19]  Patty Kostkova,et al.  Early Warning and Outbreak Detection Using Social Networking Websites: The Potential of Twitter , 2009, eHealth.

[20]  Mohamed M. Mostafa,et al.  More than words: Social networks' text mining for consumer brand sentiments , 2013, Expert Syst. Appl..

[21]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[22]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[23]  Bruce R. Schatz,et al.  Social Visualization of Health Messages , 2009, 2009 42nd Hawaii International Conference on System Sciences.

[24]  Nathan Danneman,et al.  Social Media Mining with R , 2014 .