Multi-model Investigative Exploration of Social Media Data with BOUTIQUE: A Case Study in Public Health

We present our experience with a data science problem in Public Health, where researchers use social media (Twitter) to determine whether the public shows awareness of HIV prevention measures offered by Public Health campaigns. To help the researcher, we develop a investigative exploration system called BOUTIQUE that allows a user to perform a multistep visualization and exploration of data through a dashboard interface. Unique features of BOUTIQUE includes its ability to handle heterogeneous types of data provided by a polystore, and its ability to use computation as part of the investigative exploration process. In this paper, we present the design of the BOUTIQUE middleware and walk through an investigation process for a real-life problem.

[1]  John Lee,et al.  Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System , 2016, Proc. VLDB Endow..

[2]  Chris North,et al.  Big Text Visual Analytics in Sensemaking , 2015, 2015 Big Data Visual Analytics (BDVA).

[3]  Anastasia Ailamaki,et al.  Alpine: Efficient In-Situ Data Exploration in the Presence of Updates , 2017, SIGMOD Conference.

[4]  Letizia Tanca,et al.  Exploratory computing: a comprehensive approach to data sensemaking , 2017, International Journal of Data Science and Analytics.

[5]  Subhasis Dasgupta,et al.  Analytics-driven data ingestion and derivation in the AWESOME polystore , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[6]  W. Cumberland,et al.  Social Networking Technologies as an Emerging Tool for HIV Prevention , 2013, Annals of Internal Medicine.

[7]  Guoliang Li,et al.  DeepEye: An automatic big data visualization framework , 2018, Big Data Min. Anal..

[8]  S. Young,et al.  Social media as a tool to monitor adherence to HIV antiretroviral therapy , 2018, Journal of clinical and translational research.

[9]  Stefan Plantikow,et al.  openCypher: New Directions in Property Graph Querying , 2018, EDBT.

[10]  Subhasis Dasgupta,et al.  Generating polystore ingestion plans — A demonstration with the AWESOME system , 2017, 2017 IEEE International Conference on Big Data (Big Data).