A bootstrapping approach to social media quantification

This work considers the use of classifiers in a downstream aggregation task estimating class proportions, such as estimating the percentage of reviews for a movie with positive sentiment. We derive the bias and variance of the class proportion estimator when taking classification error into account to determine how to best trade off different error types when tuning a classifier for these tasks. Additionally, we propose a method for constructing confidence intervals that correctly adjusts for classification error when estimating these statistics. We conduct experiments on four document classification tasks comparing our methods to prior approaches across classifier thresholds, sample sizes, and label distributions. Prior approaches have focused on providing the most accurate point estimate while this work focuses on the creation of correct confidence intervals that appropriately account for classifier error. Compared to the prior approaches, our methods provide lower error and more accurate confidence intervals.

[1]  Fabrizio Sebastiani Sentiment Quantification of User-Generated Content , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[2]  Gary M. Weiss,et al.  Quantification and semi-supervised classification methods for handling changes in class distribution , 2009, KDD.

[3]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[4]  George Forman,et al.  Tackling concept drift by temporal inductive transfer , 2006, SIGIR.

[5]  Wei Gao,et al.  From classification to quantification in tweet sentiment analysis , 2016, Social Network Analysis and Mining.

[6]  Christopher J Paciorek,et al.  Measurement error in two‐stage analyses, with application to air pollution epidemiology , 2012, Environmetrics.

[7]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[8]  José Ramón Quevedo,et al.  Using ensembles for problems with characterizable changes in data distribution: A case study on quantification , 2017, Inf. Fusion.

[9]  D. Stram,et al.  Correcting for exposure measurement error in a reanalysis of lung cancer mortality for the Colorado Plateau Uranium Miners cohort. , 1999, Health physics.

[10]  George Forman,et al.  Counting Positives Accurately Despite Inaccurate Classification , 2005, ECML.

[11]  Wei Gao,et al.  Tweet sentiment: From classification to quantification , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[12]  Magne Thoresen,et al.  Model‐based bootstrapping when correcting for measurement error with application to logistic regression , 2018, Biometrics.

[13]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[14]  Wei Gao,et al.  Ordinal Text Quantification , 2016, SIGIR.

[15]  Michael J. Paul,et al.  Constructing Accurate Confidence Intervals When Aggregating Social Media Data for Public Health Monitoring , 2019, Precision Health and Medicine.

[16]  George Forman,et al.  Quantifying counts and costs via classification , 2008, Data Mining and Knowledge Discovery.

[17]  Alessandro Barbiero,et al.  Bootstrapping probability-proportional-to-size samples via calibrated empirical population , 2015 .

[18]  Nitesh V. Chawla,et al.  A Review on Quantification Learning , 2017, ACM Comput. Surv..

[19]  Juan José del Coz,et al.  On the study of nearest neighbor algorithms for prevalence estimation in binary problems , 2013, Pattern Recognit..

[20]  Aron Culotta,et al.  Learning from noisy label proportions for classifying online social data , 2017, Social Network Analysis and Mining.