Inferring Fine-grained Details on User Activities and Home Location from Social Media: Detecting Drinking-While-Tweeting Patterns in Communities

Nearly all previous work on geo-locating latent states and activities from social media confounds general discussions about activities, self-reports of users participating in those activities at times in the past or future, and self-reports made at the immediate time and place the activity occurs. Activities, such as alcohol consumption, may occur at different places and types of places, and it is important not only to detect the local regions where these activities occur, but also to analyze the degree of participation in them by local residents. In this paper, we develop new machine learning based methods for fine-grained localization of activities and home locations from Twitter data. We apply these methods to discover and compare alcohol consumption patterns in a large urban area, New York City, and a more suburban and rural area, Monroe County. We find positive correlations between the rate of alcohol consumption reported among a community's Twitter users and the density of alcohol outlets, demonstrating that the degree of correlation varies significantly between urban and suburban areas. While our experiments are focused on alcohol use, our methods for locating homes and distinguishing temporally-specific self-reports are applicable to a broad range of behaviors and latent states.

[1]  J. Brownstein,et al.  Digital disease detection--harnessing the Web for public health surveillance. , 2009, The New England journal of medicine.

[2]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[3]  Michael Livingston,et al.  Alcohol outlet density and assault: a spatial analysis. , 2008, Addiction.

[4]  Alcohol-attributable deaths and years of potential life lost--United States, 2001. , 2004, MMWR. Morbidity and mortality weekly report.

[5]  Ryen W. White,et al.  Cyberchondria: Studies of the escalation of medical concerns in Web search , 2009, TOIS.

[6]  Malcolm R. Parks,et al.  Display of health risk behaviors on MySpace by adolescents: prevalence and associations. , 2009, Archives of pediatrics & adolescent medicine.

[7]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[8]  Virgílio A. F. Almeida,et al.  We know where you live: privacy characterization of foursquare behavior , 2012, UbiComp.

[9]  Jennifer Cook Middleton,et al.  The effectiveness of limiting alcohol outlet density as a means of reducing excessive alcohol consumption and alcohol-related harms. , 2009, American journal of preventive medicine.

[10]  Virgílio A. F. Almeida,et al.  Beware of What You Share: Inferring Home Location in Social Networks , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[11]  K. Kypri,et al.  Alcohol outlet density and university student drinking: a national study. , 2008, Addiction.

[12]  D. Mackinnon,et al.  Alcohol outlet density and motor vehicle crashes in Los Angeles County cities. , 1994, Journal of studies on alcohol.

[13]  Benyuan Liu,et al.  Twitter Improves Seasonal Influenza Prediction , 2018, HEALTHINF.

[14]  Timothy B. Patrick,et al.  Social Media, Big Data, and Public Health Informatics: Ruminating Behavior of Depression Revealed through Twitter , 2015, 2015 48th Hawaii International Conference on System Sciences.

[15]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[16]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[17]  Scott A. Golder,et al.  Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures , 2011 .

[18]  Emmanuel Kuntsche,et al.  Why do young people drink? A review of drinking motives. , 2005, Clinical psychology review.

[19]  John Krumm,et al.  Inference Attacks on Location Tracks , 2007, Pervasive.

[20]  M. Moreno,et al.  Alcohol References on Undergraduate Males’ Facebook Profiles , 2011, American journal of men's health.

[21]  Aron Culotta,et al.  Using matched samples to estimate the effects of exercise on mental health from twitter , 2015, AAAI 2015.

[22]  H. Wechsler,et al.  The relationship of alcohol outlet density to heavy and frequent drinking and drinking-related problems among college students at eight universities. , 2003, Health & place.

[23]  D. Cohen,et al.  Alcohol availability and homicide in New Orleans: conceptual considerations for small area analysis of the effect of alcohol outlet density. , 1999, Journal of studies on alcohol.

[24]  T. Farley,et al.  Alcohol outlet density and alcohol consumption in Los Angeles county and southern Louisiana. , 2008, Geospatial health.

[25]  M. Livingston A longitudinal analysis of alcohol outlet density and assault. , 2008, Alcoholism, clinical and experimental research.

[26]  Henry A. Kautz,et al.  Predicting Disease Transmission from Geo-Tagged Micro-Blog Data , 2012, AAAI.

[27]  A. Brennan,et al.  The impact of spatial and temporal availability of alcohol on its consumption and related harms: A critical review in the context of UK licensing policies , 2014, Drug and alcohol review.

[28]  M. Stock,et al.  Adolescent alcohol-related risk cognitions: the roles of social norms and social networking sites. , 2011, Psychology of addictive behaviors : journal of the Society of Psychologists in Addictive Behaviors.

[29]  Mark Dredze,et al.  Separating Fact from Fear: Tracking Flu Infections on Twitter , 2013, NAACL.

[30]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[31]  Mark Mingyi Young,et al.  Twitter Me: Using Micro-blogging to Motivate Teenagers to Exercise , 2010, DESRIST.

[32]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[33]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[34]  Hui Xiong,et al.  Enhancing Security and Privacy in Traffic-Monitoring Systems , 2006, IEEE Pervasive Computing.

[35]  Henry A. Kautz,et al.  Modeling Spread of Disease from Social Interactions , 2012, ICWSM.

[36]  CulottaAron Lightweight methods to estimate influenza rates and alcohol sales volume from Twitter messages , 2013 .

[37]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[38]  Lars Backstrom,et al.  Structural diversity in social contagion , 2012, Proceedings of the National Academy of Sciences.

[39]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[40]  John Krumm,et al.  Placer: semantic place labels from diary data , 2013, UbiComp.

[41]  Liqi Zhu,et al.  Alcohol outlet density and violence: a geospatial analysis. , 2004, Alcohol and alcoholism.

[42]  Wen-Jing Hsu,et al.  Predictability of individuals' mobility with high-resolution positioning data , 2012, UbiComp.

[43]  Bradley P. Carlin,et al.  Neighborhood Level Spatial Analysis of the Relationship Between Alcohol Outlet Density and Criminal Violence , 2005, Environmental and Ecological Statistics.

[44]  Henry A. Kautz,et al.  Towards Understanding Global Spread of Disease from Everyday Interpersonal Interactions , 2013, IJCAI.

[45]  Michael Livingston,et al.  A longitudinal analysis of alcohol outlet density and domestic violence. , 2011, Addiction.

[46]  L. Atwood,et al.  Evaluating the Believability and Effectiveness of the Social Norms Message "Most Students Drink 0 to 4 Drinks When They Party" , 2006, Health communication.

[47]  Mir M. Ali,et al.  Social network effects in alcohol consumption among adolescents. , 2010, Addictive behaviors.

[48]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[49]  Cecilia Mascolo,et al.  Socio-Spatial Properties of Online Location-Based Social Networks , 2011, ICWSM.

[50]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[51]  John Krumm,et al.  Far Out: Predicting Long-Term Human Mobility , 2012, AAAI.

[52]  J. Murabito,et al.  The Spread of Alcohol Consumption Behavior in a Large Social Network , 2010, Annals of Internal Medicine.

[53]  Jeffrey Nichols,et al.  Where Is This Tweet From? Inferring Home Locations of Twitter Users , 2012, ICWSM.

[55]  Shanthi Ameratunga,et al.  Predictors of drinking patterns in adolescence: a latent class analysis. , 2014, Drug and alcohol dependence.

[56]  Daphne Koller,et al.  Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.

[57]  A. Mokdad,et al.  Binge drinking among US adults. , 2003, JAMA.

[58]  Aron Culotta,et al.  Lightweight methods to estimate influenza rates and alcohol sales volume from Twitter messages , 2012, Language Resources and Evaluation.

[59]  Gavin Smith,et al.  A refined limit on the predictability of human mobility , 2014, 2014 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[60]  Henry A. Kautz,et al.  Modeling the impact of lifestyle on health at scale , 2013, WSDM.

[61]  William DeJong,et al.  The contextual role of alcohol outlet density in college drinking. , 2008, Journal of studies on alcohol and drugs.

[62]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[63]  Hiroyuki Ohsaki,et al.  Recognizing Depression from Twitter Activity , 2015, CHI.

[64]  Henry A. Kautz,et al.  nEmesis: Which Restaurants Should You Avoid Today? , 2013, HCOMP.

[65]  Munmun De Choudhury,et al.  Characterizing Smoking and Drinking Abstinence from Social Media , 2015, HT.

[66]  WENBIN LIANG,et al.  Revealing the link between licensed outlets and violence: counting venues versus measuring alcohol availability. , 2011, Drug and alcohol review.

[67]  Matthew Rowe,et al.  Towards tracking and analysing regional alcohol consumption patterns in the UK through the use of social media , 2014, WebSci '14.

[68]  P. Gruenewald,et al.  Community alcohol outlet density and underage drinking. , 2010, Addiction.