Combining information for heterogeneous studies and rare events studies: A confidence distribution approach

OF THE DISSERTATION Combining Information for Heterogeneous Studies and Rare Events Studies: a Confidence Distribution Approach by Dungang Liu Dissertation Director: Regina Liu and Minge Xie This dissertation develops efficient statistical methodologies for combining information from independent sources. The developments focus on two settings where the studies are heterogeneous or the studies involve rare events. In these settings, the conventional combining approaches often lead to inefficient or even invalid statistical inference. In this dissertation, we propose effective and efficient combining approaches using confidence distributions. The proposed approaches are justified both theoretically and numerically. They are also shown to be superior to the conventional approaches. Combining information from multiple studies, often referred to as meta-analysis in the literature, has been used extensively in many fields, including health sciences, social sciences, and others. However, there remain many unresolved problems on how to effectively and efficiently combine information. For example, • Heterogeneous studies – When the effect of interest is not estimable in heterogeneous studies (e.g., indirect evidence), how can we utilize these studies to

[1]  M. Kendall,et al.  The advanced theory of statistics , 1945 .

[2]  Petter Laake,et al.  Recommended tests for association in 2×2 tables , 2009, Statistics in medicine.

[3]  Tianxi Cai,et al.  The Highest Confidence Density Region and Its Usage for Joint Inferences about Constrained Parameters , 2011, Biometrics.

[4]  Sanjay Kaul,et al.  Uncertain Effects of Rosiglitazone on the Risk for Myocardial Infarction and Cardiovascular Death , 2007, Annals of Internal Medicine.

[5]  Norman E. Breslow,et al.  Odds ratio estimators when the data are sparse , 1981 .

[6]  Cyrus R. Mehta,et al.  Computing an Exact Confidence Interval for the Common Odds Ratio in Several 2×2 Contingency Tables , 1985 .

[7]  Lu Tian,et al.  Exact and efficient inference procedure for meta-analysis and its application to the analysis of independent 2 x 2 tables with all available data but without artificial continuity correction. , 2009, Biostatistics.

[8]  Ingram Olkin,et al.  Why add anything to nothing? The arcsine difference as a measure of treatment effect in meta‐analysis with zero cells , 2009, Statistics in medicine.

[9]  S. Nissen,et al.  Effect of rosiglitazone on the risk of myocardial infarction and death from cardiovascular causes. , 2007, The New England journal of medicine.

[10]  S L Normand,et al.  Meta-analysis: formulating, evaluating, combining, and reporting. , 1999, Statistics in medicine.

[11]  A. W. van der Vaart,et al.  On Profile Likelihood , 2000 .

[12]  B. Efron Empirical Bayes Methods for Combining Likelihoods , 1996 .

[13]  R. Fisher,et al.  Statistical Methods for Research Workers , 1930, Nature.

[14]  Minge Xie,et al.  Asymptotics for generalized estimating equations with large cluster sizes , 2003 .

[15]  Alan Agresti,et al.  Unconditional small-sample confidence intervals for the odds ratio. , 2002, Biostatistics.

[16]  I Olkin,et al.  Comparison of meta-analysis versus analysis of variance of individual patient data. , 1998, Biometrics.

[17]  Julian P T Higgins,et al.  Recent developments in meta‐analysis , 2008, Statistics in medicine.

[18]  S. Stouffer Adjustment during army life , 1977 .

[19]  N Breslow,et al.  Estimators of the Mantel-Haenszel variance consistent in both sparse data and large-strata limiting models. , 1986, Biometrics.

[20]  Regina Y. Liu,et al.  Multivariate analysis by data depth: descriptive statistics, graphics and inference, (with discussion and a rejoinder by Liu and Singh) , 1999 .

[21]  R. Fisher Statistical Methods for Research Workers , 1971 .

[22]  Jenő Reiczigel,et al.  An exact confidence set for two binomial proportions and exact unconditional confidence intervals for the difference and ratio of proportions , 2008, Comput. Stat. Data Anal..

[23]  A. Agresti [A Survey of Exact Inference for Contingency Tables]: Rejoinder , 1992 .

[24]  John J. Gart,et al.  Point and interval estimation of the common odds ratio in the combination of 2 × 2 tables with fixed marginals , 1970 .

[25]  A Whitehead,et al.  Combining summaries of binary outcomes with those of continuous outcomes in a meta-analysis. , 1999, Journal of biopharmaceutical statistics.

[26]  Joseph L. Gastwirth,et al.  Statistical Methods for Analyzing Claims of Employment Discrimination , 1984 .

[27]  Minge Xie,et al.  Confidence Distributions and a Unifying Framework for Meta-Analysis , 2011 .

[28]  M C Simmonds,et al.  Covariate heterogeneity in meta‐analysis: Criteria for deciding between meta‐regression and individual patient data , 2007, Statistics in medicine.

[29]  L. Fahrmeir,et al.  Multivariate statistical modelling based on generalized linear models , 1994 .

[30]  Alex J. Sutton,et al.  Multiparameter evidence synthesis in epidemiology and medical decision‐making: current approaches , 2006 .

[31]  N. Hjort,et al.  Confidence and Likelihood * , 2002 .

[32]  H. O. Lancaster,et al.  Significance Tests in Discrete Distributions , 1961 .

[33]  Donald Fraser,et al.  Statistical Inference: Likelihood to Significance , 1991 .

[34]  R. D. Boschloo Raised conditional level of significance for the 2 × 2‐table when testing the equality of two probabilities , 1970 .

[35]  Alexander J Sutton,et al.  What to add to nothing? Use and avoidance of continuity corrections in meta-analysis of sparse data. , 2004, Statistics in medicine.

[36]  Jonathan J Deeks,et al.  Much ado about nothing: a comparison of the performance of meta‐analytical methods with rare events , 2007, Statistics in medicine.

[37]  B. Efron Bayes and likelihood calculations from confidence intervals , 1993 .

[38]  K. Singh,et al.  Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review , 2013 .

[39]  D. Zeng,et al.  On the relative efficiency of using summary statistics versus individual-level data in meta-analysis. , 2010, Biometrika.

[40]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[41]  Ming-Chung Yang,et al.  AN OPTIMALITY THEORY FOR MID p-VALUES IN 2 × 2 CONTINGENCY TABLES , 2001 .

[42]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[43]  S. Kou,et al.  ASYMPTOTICS FOR A 2 × 2T ABLE WITH FIXED MARGINS , 1996 .

[44]  T. Mathew,et al.  On the Equivalence of Meta‐Analysis Using Literature and Using Individual Patient Data , 1999, Biometrics.

[45]  Giovanni Parmigiani,et al.  Combining Studies with Continuous and Dichotomous Responses: A Latent-Variables Approach , 2000 .

[46]  D. Cox,et al.  A note on pseudolikelihood constructed from marginal densities , 2004 .

[47]  E. Andersen The Asymptotic Distribution of Conditional Likelihood Ratio Tests , 1971 .

[48]  Jonathan J Shuster,et al.  How conservative is Fisher's exact test? A quantitative evaluation of the two‐sample comparative binomial trial , 2008, Statistics in medicine.

[49]  Samy Suissa,et al.  Exact unconditional sample sizes for the 2×2 binomial trial , 1985 .

[50]  Minge Xie,et al.  Combining information from independent sources through confidence distributions , 2005, math/0504507.

[51]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .