Generalized Bayes Quantification Learning under Dataset Shift

Quantification learning is the task of prevalence estimation for a test population using predictions from a classifier trained on a different population. Quantification methods assume that the sensitivities and specificities of the classifier are either perfect or transportable from the training to the test population. These assumptions are inappropriate in the presence of dataset shift, when the misclassification rates in the training population are not representative of those for the test population. Quantification under dataset shift has been addressed only for single-class (categorical) predictions and assuming perfect knowledge of the true labels on a small subset of the test population. We propose generalized Bayes quantification learning (GBQL) that uses the entire compositional predictions from probabilistic classifiers and allows for uncertainty in true class labels for the limited labeled test data. Instead of positing a full model, we use a model-free Bayesian estimating equation approach to compositional data based only on a first-moment assumption. The idea will be useful in Bayesian compositional data analysis in general as it is robust to different generating mechanisms for compositional data and includes categorical outputs as a special case. We show how our method yields existing quantification approaches as special cases. Extension to an ensemble GBQL that uses predictions from multiple classifiers yielding inference robust to inclusion of a poor classifier is discussed. We outline a fast and efficient Gibbs sampler using a rounding and coarsening approximation to the loss functions. We also establish posterior consistency, asymptotic normality and valid coverage of interval estimates from GBQL, as well as finite sample posterior concentration rate. Empirical performance of GBQL is demonstrated through simulations and analysis of real data with evident dataset shift.

[1]  Stephen R Cole,et al.  Transportability of Trial Results Using Inverse Odds of Sampling Weights. , 2017, American journal of epidemiology.

[2]  Murat K. Munkin,et al.  Bayesian estimation of panel data fractional response models with endogeneity: an application to standardized test rates , 2015 .

[3]  Vladimir Vovk,et al.  Aggregating strategies , 1990, COLT '90.

[4]  Rajendra Prasad,et al.  Population Health Metrics Research Consortium gold standard verbal autopsy validation study: design, implementation, and development of analysis datasets , 2011, Population health metrics.

[5]  H. Teicher Identifiability of Finite Mixtures , 1963 .

[6]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[7]  Francisco Herrera,et al.  A unifying view on dataset shift in classification , 2012, Pattern Recognit..

[8]  O. Catoni A PAC-Bayesian approach to adaptive classification , 2004 .

[9]  Alan D. Lopez,et al.  Improving performance of the Tariff Method for assigning causes of death to verbal autopsies , 2015, BMC Medicine.

[10]  José Ramón Quevedo,et al.  Dynamic ensemble selection for quantification tasks , 2019, Inf. Fusion.

[11]  S. Cole,et al.  Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial. , 2010, American journal of epidemiology.

[12]  Pier Giovanni Bissiri,et al.  A general framework for updating belief distributions , 2013, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[13]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[14]  Peter Grünwald,et al.  Safe Learning: bridging the gap between Bayes, MDL and statistical learning theory via empirical convexity , 2011, COLT.

[15]  Van Der Vaart,et al.  The Bernstein-Von-Mises theorem under misspecification , 2012 .

[16]  George Forman,et al.  Counting Positives Accurately Despite Inaccurate Classification , 2005, ECML.

[17]  Gary King,et al.  A Method of Automated Nonparametric Content Analysis for Social Science , 2010 .

[18]  R. Chappell,et al.  The Continual Reassessment Method for Multiple Toxicity Grades: A Bayesian Quasi‐Likelihood Approach , 2007, Biometrics.

[19]  Zheng-Zheng Tang,et al.  Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis. , 2018, Biostatistics.

[20]  John Mullahy,et al.  Multivariate Fractional Regression Estimation of Econometric Share Models , 2010, Journal of econometric methods.

[21]  N. Hjort,et al.  On Bayesian consistency , 2001 .

[22]  Nitesh V. Chawla,et al.  A Review on Quantification Learning , 2017, ACM Comput. Surv..

[23]  Thijs van Ommen,et al.  Inconsistency of Bayesian Inference for Misspecified Linear Models, and a Proposal for Repairing It , 2014, 1412.3730.

[24]  James G. Scott,et al.  Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables , 2012, 1205.0310.

[25]  Samuel J. Clark,et al.  Probabilistic Cause-of-Death Assignment Using Verbal Autopsies , 2014, Journal of the American Statistical Association.

[26]  Joseph G Ibrahim,et al.  The power prior: theory and applications , 2015, Statistics in medicine.

[27]  Mausam,et al.  Crowdsourcing Multi-Label Classification for Taxonomy Creation , 2013, HCOMP.

[28]  V. Chernozhukov,et al.  An MCMC Approach to Classical Estimation , 2002, 2301.07782.

[29]  Peter Grünwald,et al.  The Safe Bayesian - Learning the Learning Rate via the Mixability Gap , 2012, ALT.

[30]  Ying Lu,et al.  Verbal Autopsy Methods with Multiple Causes of Death , 2008, 0808.0645.

[31]  Wei Gao,et al.  From classification to quantification in tweet sentiment analysis , 2016, Social Network Analysis and Mining.

[32]  M. Tanner,et al.  Gibbs posterior for variable selection in high-dimensional classification and data mining , 2008, 0810.5655.

[33]  David B. Dunson,et al.  Robust Bayesian Inference via Coarsening , 2015, Journal of the American Statistical Association.

[34]  A. Bhattacharya,et al.  Bayesian fractional posteriors , 2016, The Annals of Statistics.

[35]  José Hernández-Orallo,et al.  Quantification via Probability Estimators , 2010, 2010 IEEE International Conference on Data Mining.

[36]  A. Flaxman,et al.  The WHO 2016 verbal autopsy instrument: An international standard suitable for automated analysis by InterVA, InSilicoVA, and Tariff 2.0 , 2018, PLoS medicine.

[37]  A. DeMaria,et al.  Estimating Prevalence, Demographics, and Costs of ME/CFS Using Large Scale Medical Claims Data and Machine Learning , 2019, Front. Pediatr..

[38]  C. Holmes,et al.  Assigning a value to a power likelihood in a general Bayesian model , 2017, 1701.08515.

[39]  G. Mateu-Figueras,et al.  Log-ratio methods in mixture models for compositional data sets , 2016 .

[40]  S. Yakowitz,et al.  On the Identifiability of Finite Mixtures , 1968 .

[41]  Leslie E. Papke,et al.  Econometric Methods for Fractional Response Variables with an Application to 401(K) Plan Participation Rates , 1993 .

[42]  P. Byass,et al.  Strengthening standardised interpretation of verbal autopsy data: the new InterVA-4 tool. , 2012, Global health action.

[43]  Raflq H. Hijazi,et al.  Modelling Compositional Data Using Dirichlet Regression Models , 2007 .

[44]  George Forman,et al.  Quantifying counts and costs via classification , 2008, Data Mining and Knowledge Discovery.

[45]  Nong Ye,et al.  Naïve Bayes Classifier , 2013 .

[46]  Alexander Y. Shestopaloff,et al.  Naive Bayes classifiers for verbal autopsies: comparison to physician-based classification for 21,000 child and adult deaths , 2015, BMC Medicine.

[47]  Tong Zhang From ɛ-entropy to KL-entropy: Analysis of minimum information complexity density estimation , 2006, math/0702653.

[48]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[49]  R. Black,et al.  Direct estimates of national neonatal and child cause–specific mortality proportions in Niger by expert algorithm and physician–coded analysis of verbal autopsy interviews , 2015, Journal of global health.

[50]  Fabio Crestani,et al.  Like It or Not , 2016, ACM Comput. Surv..

[51]  Alan D. Lopez,et al.  Measuring causes of death in populations: a new metric that corrects cause-specific mortality fractions for chance , 2015, Population Health Metrics.

[52]  John Shawe-Taylor,et al.  A PAC analysis of a Bayesian estimator , 1997, COLT '97.

[53]  Ryan Martin,et al.  Calibrating general posterior credible regions , 2015, Biometrika.

[54]  Jerzy Tiuryn,et al.  Introducing Knowledge into Differential Expression Analysis , 2010, J. Comput. Biol..

[55]  Abhirup Datta,et al.  Regularized Bayesian transfer learning for population-level etiological distributions , 2018, Biostatistics.

[56]  Tzu-Tsung Wong,et al.  Generalized Dirichlet distribution in Bayesian analysis , 1998, Appl. Math. Comput..