Aspects of Composite Likelihood Estimation And Prediction

A composite likelihood is usually constructed by multiplying a collection of lower dimensional marginal or conditional densities. In recent years, composite likelihood methods have received increasing interest for modeling complex data arising from various application areas, where the full likelihood function is analytically unknown or computationally prohibitive due to the structure of dependence, the dimension of data or the presence of nuisance parameters. In this thesis we investigate some theoretical properties of the maximum composite likelihood estimator (MCLE). In particular, we obtain the limit of the MCLE in a general setting, and set out a framework for understanding the notion of robustness in the context of composite likelihood inference. We also study the improvement of the efficiency of a composite likelihood by incorporating additional component likelihoods, or by using component likelihoods with higher dimension. We show through some illustrative examples that such strategies do not always work and may impair the efficiency. We also show that the MCLE of the parameter of interest can be less efficient when the nuisance parameters are known than when they are unknown. In addition to the theoretical study on composite likelihood estimation, we also explore the possibility of using composite likelihood to make predictive inference in computer experiments. The Gaussian process model is widely used to build statistical emulators for computer experiments. However, when the number of trials is large, both estimation and prediction based on a Gaussian process can be computationally intractable due to the dimension of the covariance matrix. To address this problem, we propose prediction methods based on different composite likelihood functions, which do not require the evaluation of the large covariance matrix and hence alleviate the computational burden. Simulation studies show that the blockwise composite likelihood-based predictors perform well and are competitive with the optimal predictor based on the full likelihood.%%%%PhD

[1]  A. Wald Note on the Consistency of the Maximum Likelihood Estimate , 1949 .

[2]  Nils Lid Hjort,et al.  ML, PL, QL in Markov Chain Models , 2008 .

[3]  J. Kent Robust properties of likelihood ratio tests , 1982 .

[4]  D. Nychka,et al.  Covariance Tapering for Interpolation of Large Spatial Datasets , 2006 .

[5]  Luigi Pace,et al.  ADJUSTING COMPOSITE LIKELIHOOD RATIO STATISTICS , 2009 .

[6]  P. Song,et al.  Composite Likelihood Bayesian Information Criteria for Model Selection in High-Dimensional Data , 2010 .

[7]  Jianhua Z. Huang,et al.  A full scale approximation of covariance functions for large spatial data sets , 2012 .

[8]  Thomas J. Santner,et al.  The Design and Analysis of Computer Experiments , 2003, Springer Series in Statistics.

[9]  Richard A. Davis,et al.  COMMENTS ON PAIRWISE LIKELIHOOD IN TIME SERIES MODELS , 2011 .

[10]  N. Reid,et al.  On the robustness of maximum composite likelihood estimate , 2011 .

[11]  Carlo Gaetan,et al.  Composite likelihood methods for space-time data , 2006 .

[12]  Geert Molenberghs,et al.  PSEUDO-LIKELIHOOD ESTIMATION FOR INCOMPLETE DATA , 2011 .

[13]  Richard J. Cook,et al.  A robust pairwise likelihood method for incomplete longitudinal binary data arising in clusters , 2011 .

[14]  G. Matheron Principles of geostatistics , 1963 .

[15]  G. Molenberghs,et al.  Models for Discrete Longitudinal Data , 2005 .

[16]  Zi Jin Aspects of Composite Likelihood Inference , 2011 .

[17]  C. Varin On composite marginal likelihoods , 2008 .

[18]  G. Yi,et al.  A Pairwise Likelihood Method For Correlated Binary Data With/withoutMissing Observations Under Generalized Partially Linear Single-indexModels , 2011 .

[19]  Douglas W. Nychka,et al.  Covariance Tapering for Likelihood-Based Estimation in Large Spatial Data Sets , 2008 .

[20]  D. Cox,et al.  A note on pseudolikelihood constructed from marginal densities , 2004 .

[21]  Cristiano Varin,et al.  Pairwise likelihood inference for ordinal categorical time series , 2006, Comput. Stat. Data Anal..

[22]  Volker Tresp,et al.  A Bayesian Committee Machine , 2000, Neural Computation.

[23]  N. Cressie,et al.  Fixed rank kriging for very large spatial data sets , 2008 .

[24]  Michael L. Stein,et al.  A modeling approach for large spatial datasets , 2008 .

[25]  B. Lindsay Conditional score functions: Some optimality results , 1982 .

[26]  Runze Li,et al.  Design and Modeling for Computer Experiments , 2005 .

[27]  S. Lele,et al.  A Composite Likelihood Approach to Binary Spatial Data , 1998 .

[28]  Jo Eidsvik,et al.  Estimation and Prediction in Spatial Models With Block Composite Likelihoods , 2014 .

[29]  Derek Bingham,et al.  Efficient emulators of computer experiments using compactly supported correlation functions, with an application to cosmology , 2011, 1107.0749.

[30]  Thomas B. Fomby Maximum Likelihood Estimation of Misspecified Models , 2003 .

[31]  Richard L. Smith,et al.  Asymptotic properties of computationally efficient alternative estimators for a class of multivariate normal models , 2007 .

[32]  Henry P. Wynn,et al.  Screening, predicting, and computer experiments , 1992 .

[33]  H. Joe,et al.  Composite likelihood estimation in multivariate data analysis , 2005 .

[34]  Øivind Skare,et al.  Pairwise likelihood inference in spatial generalized linear mixed models , 2005, Comput. Stat. Data Anal..

[35]  K. Mardia,et al.  Maximum likelihood estimation using composite likelihoods for closed exponential families , 2009 .

[36]  A. V. Vecchia Estimation and model identification for continuous spatial processes , 1988 .

[37]  Richard E. Chandler,et al.  Inference for clustered data using the independence loglikelihood , 2007 .

[38]  A. Kuk A Hybrid Pairwise Likelihood Method , 2007 .

[39]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[40]  Jerome Sacks,et al.  Designs for Computer Experiments , 1989 .

[41]  S. Eguchi,et al.  A paradox concerning nuisance parameters and projected estimating functions , 2004 .

[42]  G. Molenberghs,et al.  Pseudolikelihood Modeling of Multivariate Outcomes in Developmental Toxicology , 1999 .

[43]  Geert Molenberghs,et al.  A pairwise likelihood approach to estimation in multilevel probit models , 2004, Comput. Stat. Data Anal..

[44]  Harry Joe,et al.  On weighting of bivariate margins in pairwise likelihood , 2009, J. Multivar. Anal..

[45]  Bruce G. Lindsay,et al.  ISSUES AND STRATEGIES IN THE SELECTION OF COMPOSITE LIKELIHOODS , 2011 .

[46]  V. P. Godambe An Optimum Property of Regular Maximum Likelihood Estimation , 1960 .

[47]  Subhash R. Lele,et al.  A composite likelihood approach to semivariogram estimation , 1999 .