Social Media Big Data Acquisition and Analysis for Qualitative GIScience: Challenges and Opportunities

Qualitative geographic information systems (GIS) have come a long way since the original call from critical GIS scholars in the 1990s. The invention of the geoweb as well as big data sources for qualitative information have enabled qualitative GIS to actually be implemented. Academic researchers are now grappling with how best to engage with and use qualitative spatial data. Our focus is on using qualitative data from social media sources. We review the process of collecting and analyzing patterns based on qualitative spatial data using methods from GIScience as well as new techniques from computational linguistics. We review these methods through the lens of critical qualitative GIScience. We reflect critically on the ethics associated with implementation of social qualitative data. Qualitative GIS has reached a critical juncture where the data, methods, and tools have enabled new questions to be asked that were previously not possible to pose. In this article we look to provide guidance and clarity for researchers engaging with geo-social and spatial qualitative data.

[1]  Wray L. Buntine,et al.  Twitter-Network Topic Model: A Full Bayesian Treatment for Social Network and Text Modeling , 2016, ArXiv.

[2]  Wenwen Li,et al.  Using geolocated Twitter data to monitor the prevalence of healthy and unhealthy food references across the US , 2014 .

[3]  Craig M. Dalton,et al.  Inflated granularity: Spatial “Big Data” and geodemographics , 2015, Big Data Soc..

[4]  Christopher M. Danforth,et al.  The Geography of Happiness: Connecting Twitter Sentiment and Expression, Demographics, and Objective Characteristics of Place , 2013, PloS one.

[5]  P. Waddell,et al.  New Insights into Rental Housing Markets across the United States: Web Scraping and Analyzing Craigslist Rental Listings , 2016, 1605.05397.

[6]  Shoshana Magnet,et al.  Feminist sexualities, race and the internet: an investigation of suicidegirls.com , 2007, New Media Soc..

[7]  Ralph Schroeder,et al.  Causation, Correlation, and Big Data in Social Science Research , 2015 .

[8]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[9]  Richard J. Lane,et al.  The Big Humanities: Digital Humanities/Digital Laboratories , 2016 .

[10]  Gina Neff,et al.  Talking to Bots: Symbiotic Agency and the Case of Tay , 2016 .

[11]  Chong Wang,et al.  Mining geographic knowledge using location aware topic model , 2007, GIR '07.

[12]  Brian H. Spitzberg,et al.  Mapping social activities and concepts with social media (Twitter) and web search engines (Yahoo and Bing): a case study in 2012 US Presidential Election , 2013 .

[13]  N. Schuurman,et al.  Mechanism Matters: Data Production for Geosurveillance , 2016 .

[14]  R. Kozinets The Field behind the Screen: Using Netnography for Marketing Research in Online Communities , 2002 .

[15]  J. Crampton Cartography: maps 2.0 , 2009 .

[16]  G. Ding,et al.  Geo-Narrative: Extending Geographic Information Systems for Narrative Analysis in Qualitative and Mixed-Method Research , 2008 .

[17]  Jordi Vallverdú,et al.  Can machines talk? Comparison of Eliza with modern dialogue systems , 2016, Comput. Hum. Behav..

[18]  Shaowen Wang,et al.  Depicting urban boundaries from a mobility network of spatial interactions: a case study of Great Britain with geo-located Twitter data , 2017, Int. J. Geogr. Inf. Sci..

[19]  Changhu Wang,et al.  Equip tourists with knowledge mined from travelogues , 2010, WWW '10.

[20]  Michael I. Jordan,et al.  Bayesian Nonparametrics: Hierarchical Bayesian nonparametric models with applications , 2010 .

[21]  M. Kwan The Uncertain Geographic Context Problem , 2012 .

[22]  P. Mechant,et al.  Broadcast Yourself: An Exploratory Study of Sharing Physical Activity on Social Networking Sites , 2015 .

[23]  Amit P. Sheth,et al.  Finding street gang members on Twitter , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[24]  Michael F. Goodchild,et al.  Formalizing Place in Geographic Information Systems , 2011 .

[25]  Mylynn Felt,et al.  Social media and the social sciences: How researchers employ Big Data analytics , 2016, Big Data Soc..

[26]  Clio Andris,et al.  Integrating social network data into GISystems , 2016, Int. J. Geogr. Inf. Sci..

[27]  Antonietta Alonge,et al.  The linguistic design of the EuroWordNet database , 1998 .

[28]  Matthew S. Gerber,et al.  Predicting crime using Twitter and kernel density estimation , 2014, Decis. Support Syst..

[29]  Lee A. Bygrave,et al.  A right to be forgotten? , 2014, Commun. ACM.

[30]  Barry Smyth,et al.  Context-Aware Sentiment Detection from Ratings , 2016, SGAI Conf..

[31]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[32]  M. Crang The promises and perils of a digital geohumanities , 2015 .

[33]  D. Murthy Digital Ethnography , 2008 .

[34]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[35]  Alexander Zipf,et al.  Twitter as an indicator for whereabouts of people? Correlating Twitter with UK census data , 2015, Comput. Environ. Urban Syst..

[36]  Paul A. Longley,et al.  The geography of topics from geo-referenced social media data in London , 2015 .

[37]  Xiaofeng Wang,et al.  Spatio-temporal modeling of criminal incidents using geographic, demographic, and twitter-derived information , 2012, 2012 IEEE International Conference on Intelligence and Security Informatics.

[38]  Christopher M. Danforth,et al.  Climate Change Sentiment on Twitter: An Unsolicited Public Opinion Poll , 2015, PloS one.

[39]  Hila Becker,et al.  Beyond Trending Topics: Real-World Event Identification on Twitter , 2011, ICWSM.

[40]  Christiane Fellbaum,et al.  Medical WordNet: A New Methodology for the Construction and Validation of Information Resources for Consumer Health , 2004, COLING.

[41]  J. Pickles Ground truth : the social implications of geographic information systems , 1995 .

[42]  Nadine Schuurman,et al.  Care of the Subject: Feminism and Critiques of GIS , 2002 .

[43]  Matthew L. Jockers Macroanalysis: Digital Methods and Literary History , 2013 .

[44]  Anoop Nayak,et al.  The quantitative revolution , 2013 .

[45]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[46]  Anthony Stefanidis,et al.  #Earthquake: Twitter as a Distributed Sensor System , 2013, Trans. GIS.

[47]  Trisalyn A. Nelson,et al.  Mapping ridership using crowdsourced cycling data , 2016 .

[48]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[49]  Víctor Soto,et al.  Characterizing Urban Landscapes Using Geolocated Tweets , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[50]  Alan M. MacEachren,et al.  How Maps Work - Representation, Visualization, and Design , 1995 .

[51]  Jeffrey A. Rydberg-Cox,et al.  The Perseus Project: a Digital Library for the Humanities , 2000 .

[52]  Dietmar Janetzko The Role of APIs in Data Sampling from Social Media , 2016 .

[53]  Michael I. Jordan,et al.  Hierarchical Bayesian Nonparametric Models with Applications , 2008 .

[54]  Alexander Serebrenik,et al.  Choosing your weapons: On sentiment analysis tools for software engineering research , 2015, 2015 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[55]  Matthew Zook,et al.  Geographies of mobility: applications of location-based data , 2015, Int. J. Geogr. Inf. Sci..

[56]  Chunxiao Jiang,et al.  Information Security in Big Data: Privacy and Data Mining , 2014, IEEE Access.

[57]  Matthew Zook,et al.  Social Media and the City: Rethinking Urban Socio-Spatial Inequality Using User-Generated Geographic Information , 2015 .

[58]  Chris Ashford Queer theory, cyber-ethnographies and researching online sex environments , 2009 .

[59]  Matthew Zook,et al.  Offline Brews and Online Views: Exploring the Geography of Beer Tweets , 2014 .

[60]  Michael F. Goodchild,et al.  The convergence of GIS and social media: challenges for GIScience , 2011, Int. J. Geogr. Inf. Sci..

[61]  Andrew Prescott,et al.  Consumers, creators or commentators? , 2012 .

[62]  Bo Hu,et al.  Spatio-Temporal Topic Models for Check-in Data , 2015, 2015 IEEE International Conference on Data Mining.

[63]  Alexander J. Smola,et al.  Discovering geographical topics in the twitter stream , 2012, WWW.

[64]  Maeve Duggan,et al.  Social Media Update 2016 , 2016 .

[65]  Frederic L. Pryor,et al.  On the geography of hate , 1999 .

[66]  Agnieszka Leszczynski Situating the geoweb in political economy , 2012 .

[67]  Jens Grossklags,et al.  Third-party apps on Facebook: privacy and the illusion of control , 2011, CHIMIT '11.

[68]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[69]  Katherine McCoy,et al.  Information and Persuasion: Rivals or Partners? , 2000, Design Issues.

[70]  Gerhard Weikum,et al.  YAGO: A Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames , 2016, SEMWEB.

[71]  Wendy Hsu,et al.  On Digital Ethnography , 2014 .

[72]  Antonio Maria Rinaldi,et al.  Improving the Visualization of WordNet Large Lexical Database through Semantic Tag Clouds , 2016, 2016 IEEE International Congress on Big Data (BigData Congress).

[73]  Rob Kitchin,et al.  Code/Space: Software and Everyday Life , 2011 .

[74]  Xiao Zhang,et al.  SensePlace2: GeoTwitter analytics support for situational awareness , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[75]  S. Elwood,et al.  Privacy, reconsidered: New representations, data practices, and the geoweb , 2011 .

[76]  D. Haraway Simians, Cyborgs, and Women: The Reinvention of Nature , 1990 .

[77]  R. Kitchin,et al.  Big Data, new epistemologies and paradigm shifts , 2014, Big Data Soc..

[78]  Daniel A. Keim,et al.  Visual sentiment analysis on twitter data streams , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[79]  Kyumin Lee,et al.  Spatio-temporal dynamics of online memes: a study of geo-tagged tweets , 2013, WWW.

[80]  D. Lyon,et al.  After Snowden: Rethinking the Impact of Surveillance , 2014 .

[81]  Chng Eng Siong,et al.  Modelling Public Sentiment in Twitter: Using Linguistic Patterns to Enhance Supervised Learning , 2015, CICLing.

[82]  M. Goodchild,et al.  Prospects for VGI Research and the Emerging Fourth Paradigm , 2013 .

[83]  Martin Theus,et al.  Statistical Data Exploration and Geographical Information Visualization , 2005 .

[84]  Christopher M. Danforth,et al.  Happiness and the Patterns of Life: A Study of Geolocated Tweets , 2013, Scientific Reports.

[85]  Mei-Po Kwan,et al.  Algorithmic Geographies: Big Data, Algorithmic Uncertainty, and the Production of Geographic Knowledge , 2016, Geographies of Mobility.

[86]  Monica Stephens Gender and the GeoWeb: divisions in the production of user-generated cartographic information , 2013, GeoJournal.

[87]  Tim Causer,et al.  Building A Volunteer Community: Results and Findings from Transcribe Bentham , 2012, Digit. Humanit. Q..

[88]  Bill Fitzgerald Facebook Tinkers With Users’ Emotions in News Feed Experiment, Stirring Outcry , 2015 .

[89]  Sarah Elwood,et al.  Mixed Methods: Thinking, Doing, and Asking in Multiple Ways , 2010 .

[90]  N. Schuurman Trouble in the heartland: GIS and its critics in the 1990s , 2000 .

[91]  J. Kent,et al.  Spatial patterns and demographic indicators of effective social media content during theHorsethief Canyon fire of 2012 , 2013 .

[92]  Wiebe E. Bijker,et al.  Science in action : how to follow scientists and engineers through society , 1989 .

[93]  Jin-Kyu Jung,et al.  Code clouds: Qualitative geovisualization of geotweets , 2015 .

[94]  Helen Couclelis,et al.  People Manipulate Objects (but Cultivate Fields): Beyond the Raster-Vector Debate in GIS , 1992, Spatio-Temporal Reasoning.

[95]  Nadine Schuurman,et al.  Area-Based Topic Modeling and Visualization of Social Media for Qualitative GIS , 2017 .

[96]  Cécile Paris,et al.  We Feel: Mapping Emotion on Twitter , 2015, IEEE Journal of Biomedical and Health Informatics.

[97]  R. Guha,et al.  What are we ‘tweeting’ about obesity? Mapping tweets with topic modeling and Geographic Information System , 2013, Cartography and geographic information science.

[98]  Marianna Pavlovskaya,et al.  NON-QUANTITATIVE GIS , 2009 .

[99]  Meghan Cope,et al.  Grounded Visualization: Integrating the Analysis of Qualitative and Quantitative Data through Grounded Theory and Visualization , 2006 .

[100]  Brad McKenna,et al.  Social media in qualitative research: Challenges and recommendations , 2017, Inf. Organ..

[101]  Filippo Menczer,et al.  Traveling trends: social butterflies or frequent fliers? , 2013, COSN '13.

[102]  M. Monmonier How to Lie with Maps , 1991 .

[103]  Gerhard Weikum,et al.  Knowledge harvesting from text and Web sources , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[104]  B. Latour Science in action : how to follow scientists and engineers through society , 1989 .

[105]  R. Kitchin,et al.  Big data and human geography , 2013 .

[106]  Mahidhar Tatineni,et al.  Topic Modeling and Visualization for Big Data in Social Sciences , 2016, 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld).

[107]  S. Elwood,et al.  Qualitative GIS: A Mixed Methods Approach , 2009 .

[108]  Matthew Zook,et al.  Beyond the geotag: situating ‘big data’ and leveraging the potential of the geoweb , 2013 .

[109]  Jef Ausloos,et al.  The Right to Be Forgotten Across the Pond , 2013, Journal of Information Policy.

[110]  S. Diallo,et al.  You Are What You Tweet: Connecting the Geographic Variation in America’s Obesity Rate to Twitter Content , 2015, PloS one.

[111]  Matthew Zook,et al.  Small Stories in Big Data: Gaining Insights from Large Spatial Point Pattern Datasets , 2015 .

[112]  J. B. Harley,et al.  DECONSTRUCTING THE MAP , 1989 .

[113]  Alessandro Mantelero,et al.  The EU Proposal for a General Data Protection Regulation and the roots of the 'right to be forgotten' , 2013, Comput. Law Secur. Rev..

[114]  G. Lees-Maffei,et al.  Why Design History? A Multi-National Perspective on the State and Purpose of the Field , 2013 .