Twitter for marijuana infodemiology

Today online social networks seem to be good tools to quickly monitor what is going on with the population, since they provide environments where users can freely share large amounts of information related to their own lives. Due to well known limitations of surveys, this novel kind of data can be used to get additional real time insights from people to understand their actual behavior related to drug use. The aim of this work is to make use of text messages (tweets) and relationships between Chilean Twitter users to predict marijuana use among them. To do this we collected Twitter accounts using a location-based criteria, and built a set of features based on tweets they made and ego centric network metrics. To get tweet-based features, tweets were filtered using marijuana-related keywords and a set of 1000 tweets were manually labeled to train algorithms capable of predicting marijuana use in tweets. In addition, a sentiment classifier of tweets was developed using the TASS corpus. Then, we made a survey to get real marijuana use labels related to accounts and these labels were used to train supervised machine learning algorithms. The marijuana use per user classifier had precision, recall and F-measure results close to 0.7, implying significant predictive power of the selected variables. We obtained a model capable of predicting marijuana use of Twitter users and estimating their opinion about marijuana. This information can be used as an efficient (fast and low cost) tool for marijuana surveillance, and support decision making about drug policies.

[1]  Juan D. Velásquez,et al.  Biometric information fusion for web user navigation and preferences analysis: An overview , 2017, Inf. Fusion.

[2]  Melissa J. Krauss,et al.  Twitter chatter about marijuana. , 2015, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[3]  José Carlos González,et al.  TASS - Workshop on Sentiment Analysis at SEPLN , 2013, Proces. del Leng. Natural.

[4]  Mark Dredze,et al.  Could behavioral medicine lead the web data revolution? , 2014, JAMA.

[5]  Yukie Ikedaa,et al.  Knowledge Based and Intelligent Information and Engineering Systems An Evacuation Route Planning for Safety Route Guidance System after Natural Disaster Using Multi-Objective Genetic Algorithm , 2016 .

[6]  Amit Sheth,et al.  "Those edibles hit hard": Exploration of Twitter data on cannabis edibles in the U.S. , 2016, Drug and alcohol dependence.

[7]  Baoxin Li,et al.  Finding needles of interested tweets in the haystack of Twitter network , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[8]  A. Amialchuk,et al.  The Social Contagion Effect of Marijuana Use among Adolescents , 2011, PloS one.

[9]  Melissa J. Krauss,et al.  A content analysis of tweets about high-potency marijuana. , 2016, Drug and alcohol dependence.

[10]  D. Henry,et al.  Interplay of Network Position and Peer Substance Use in Early Adolescent Cigarette, Alcohol, and Marijuana Use , 2010 .

[11]  Juan D. Velásquez,et al.  Design and Implementation of a Methodology for Identifying Website Keyobjects , 2009, KES.

[12]  Nicholas Genes,et al.  Leveraging Social Networks for Toxicovigilance , 2013, Journal of Medical Toxicology.

[13]  Melissa J. Krauss,et al.  Young Adults' Exposure to Alcohol- and Marijuana-Related Content on Twitter. , 2016, Journal of studies on alcohol and drugs.

[14]  Hongying Dai,et al.  Mining social media data for opinion polarities about electronic cigarettes , 2016, Tobacco Control.

[15]  Jorge A. Balazs,et al.  Opinion Mining and Information Fusion: A survey , 2016, Inf. Fusion.

[16]  J. Bauermeister,et al.  Online Network Influences on Emerging Adults’ Alcohol and Drug Use , 2013, Journal of youth and adolescence.

[17]  Gunther Eysenbach,et al.  Infodemiology and infoveillance tracking online health information and cyberbehavior for public health. , 2011, American journal of preventive medicine.

[18]  Krishnaprasad Thirunarayan,et al.  “When ‘Bad’ is ‘Good’”: Identifying Personal Communication and Sentiment in Drug-Related Tweets , 2016, JMIR public health and surveillance.

[19]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[20]  Francois R. Lamy,et al.  "Time for dabs": Analyzing Twitter data on marijuana concentrates across the U.S. , 2015, Drug and alcohol dependence.

[21]  Priya Anand,et al.  Focused web crawlers and its approaches , 2015, 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE).

[22]  Dong Nguyen,et al.  "How Old Do You Think I Am?" A Study of Language and Age in Twitter , 2013, ICWSM.

[23]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[24]  José Carlos González Cristóbal,et al.  TASS - Workshop on Sentiment Analysis at SEPLN , 2013 .

[25]  Marc A Zimmerman,et al.  Permissive norms and young adults' alcohol and marijuana use: the role of online communities. , 2012, Journal of studies on alcohol and drugs.

[26]  Ian Portelli,et al.  Drug Use in the Twittersphere: A Qualitative Contextual Analysis of Tweets About Prescription Drugs , 2015, Journal of addictive diseases.

[27]  Karl E. Bauman,et al.  The Peer Context of Adolescent Substance Use: Findings from Social Network Analysis , 2006 .