Judicious Judgment Meets Unsettling Updating: Dilation, Sure Loss and Simpson’s Paradox

Statistical learning using imprecise probabilities is gaining more attention because it presents an alternative strategy for reducing irreplicable findings by freeing the user from the task of making up unwarranted high-resolution assumptions. However, model updating as a mathematical operation is inherently exact, hence updating imprecise models requires the user's judgment in choosing among competing updating rules. These rules often lead to incompatible inferences, and can exhibit unsettling phenomena like dilation, contraction and sure loss, which cannot occur with the Bayes rule and precise probabilities. We revisit a number of famous "paradoxes", including the three prisoners/Monty Hall problem, revealing that a logical fallacy arises from a set of marginally plausible yet jointly incommensurable assumptions when updating the underlying imprecise model. We establish an equivalence between Simpson's paradox and an implicit adoption of a pair of aggregation rules that induce sure loss. We also explore behavioral discrepancies between the generalized Bayes rule, Dempster's rule and the Geometric rule as alternative posterior updating rules for Choquet capacities of order 2. We show that both the generalized Bayes rule and Geometric rule are incapable of updating without prior information regardless of how strong the information in our data is, and that Dempster's rule and the Geometric rule can mathematically contradict each other with respect to dilation and contraction. Our findings show that unsettling updates reflect a collision between the rules' assumptions and the inexactness allowed by the model itself, highlighting the invaluable role of judicious judgment in handling low-resolution information, and the care we must take when applying learning rules to update imprecise probabilities.

[1]  Peter Walley,et al.  Towards a unified theory of imprecise probability , 2000, Int. J. Approx. Reason..

[2]  Todd Iverson,et al.  Generalized fiducial inference , 2014 .

[3]  Michael D. Perlman,et al.  How Likely Is Simpson’s Paradox? , 2009 .

[4]  Randy C. S. Lai,et al.  Generalized Fiducial Inference: A Review , 2015 .

[5]  Larry Wasserman,et al.  Divisive Conditioning: Further Results on Dilation , 1997, Philosophy of Science.

[6]  Ram C. Dahiya,et al.  Let's Make a Deal: The Player's Dilemma , 1991 .

[7]  Minge Xie,et al.  A note on Dempster-Shafer recombination of confidence distributions , 2012 .

[8]  C. Blyth On Simpson's Paradox and the Sure-Thing Principle , 1972 .

[9]  Ronald Fagin,et al.  A new approach to updating beliefs , 1990, UAI.

[10]  L. Wasserman,et al.  Bayes' Theorem for Choquet Capacities , 1990 .

[11]  Randy C. S. Lai,et al.  Generalized Fiducial Inference: A Review and New Results , 2016 .

[12]  Enrique Miranda,et al.  Coherent updating of non-additive measures , 2015, Int. J. Approx. Reason..

[13]  Philippe Smets About Updating , 1991, UAI.

[14]  Chuanhai Liu,et al.  Inferential Models: Reasoning with Uncertainty , 2015 .

[15]  Keli Liu,et al.  Comment: A Fruitful Resolution to Simpson’s Paradox via Multiresolution Inference , 2014 .

[16]  Philippe Smets,et al.  Belief functions: The disjunctive rule of combination and the generalized Bayesian theorem , 1993, Int. J. Approx. Reason..

[17]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[18]  A. Gelman The Boxer, the Wrestler, and the Coin Flip , 2006 .

[19]  Hung T. Nguyen,et al.  On Random Sets and Belief Functions , 1978, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[20]  R. Yager On the dempster-shafer framework and new combination rules , 1987, Inf. Sci..

[21]  Daniel F. Heitjan,et al.  Ignorability in general incomplete-data models , 1994 .

[22]  Mendel Fygenson,et al.  MODELING AND PREDICTING EXTRAPOLATED PROBABILITIES WITH OUTLOOKS , 2008 .

[23]  Lotfi A. Zadeh,et al.  Review of A Mathematical Theory of Evidence , 1984 .

[24]  Frederick Mosteller,et al.  Fifty Challenging Problems in Probability with Solutions , 1987 .

[25]  Henry E. Kyburg,et al.  Bayesian and Non-Bayesian Evidential Updating , 1987, Artificial Intelligence.

[26]  Jürg Kohlas The reliability of reasoning with unreliable arguments , 1991, Ann. Oper. Res..

[27]  Judea Pearl,et al.  Reasoning with belief functions: An analysis of compatibility , 1990, Int. J. Approx. Reason..

[28]  Sandy L. Zabell,et al.  Some Alternatives to Bayes' Rule. , 1983 .

[29]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[30]  Gregory Wheeler,et al.  Demystifying Dilation , 2014 .

[31]  Joseph K. Blitzstein,et al.  Conditional probability , 2018, Fat Chance.

[32]  Glenn Shafer,et al.  Allocations of Probability , 1979, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[33]  Larry Wasserman,et al.  Dilation for Sets of Probabilities , 1993 .

[34]  Scott Ferson,et al.  Satellite conjunction analysis and the false confidence theorem , 2017, Proceedings of the Royal Society A.

[35]  Kosuke Imai,et al.  Survey Sampling , 1998, Nov/Dec 2017.

[36]  L. Wasserman,et al.  The Extent of Dilation of Sets of Probabilities and the Asymptotics of Robust Bayesian Inference , 1994, PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.

[37]  P. J. Huber,et al.  Minimax Tests and the Neyman-Pearson Lemma for Capacities , 1973 .

[38]  R. Bass,et al.  Review: P. Billingsley, Convergence of probability measures , 1971 .

[39]  Patrick Suppes,et al.  On using random relations to generate upper and lower probabilities , 1977, Synthese.

[40]  Xiao-Li Meng,et al.  There is Individualized Treatment. Why Not Individualized Inference , 2015, 1510.08539.

[41]  Tyler J. VanderWeele,et al.  Sensitivity Analysis Without Assumptions , 2015, Epidemiology.

[42]  Ronald R. Yager,et al.  Classic Works of the Dempster-Shafer Theory of Belief Functions , 2010, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[43]  Xiao-Li Meng,et al.  I Got More Data, My Model is More Refined, but My Estimator is Getting Worse! Am I Just Dumb? , 2014 .

[44]  R. Kruse,et al.  SPECIALIZATION—A NEW CONCEPT FOR UNCERTAINTY HANDLING WITH BELIEF FUNCTIONS , 1990 .

[45]  K. Singh,et al.  Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review , 2013 .

[46]  E. H. Simpson,et al.  The Interpretation of Interaction in Contingency Tables , 1951 .

[47]  E. C. Hammond,et al.  Smoking and lung cancer: recent evidence and a discussion of some questions. , 1959, Journal of the National Cancer Institute.

[48]  Larry Wasserman,et al.  Prior Envelopes Based on Belief Functions , 1990 .

[49]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[50]  D. Rubin,et al.  Ignorability and Coarse Data , 1991 .

[51]  Irving John Good,et al.  A LITTLE LEARNING CAN BE DANGEROUS* , 1974, The British Journal for the Philosophy of Science.

[52]  P. Walley Statistical Reasoning with Imprecise Probabilities , 1990 .

[53]  Jean-Yves Jaffray Bayesian Updating and Belief Functions , 2008, Classic Works of the Dempster-Shafer Theory of Belief Functions.