The Indirect Method: Inference Based on Intermediate Statistics—A Synthesis and Examples

This article presents an exposition and synthesis of the theory and some applications of the so-called indirect method of inference. These ideas have been exploited in the field of econometrics, but less so in other fields such as biostatistics and epidemiology. In the indirect method, statistical inference is based on an intermediate statistic, which typically follows an asymptotic normal distribution, but is not necessarily a consistent estimator of the parameter of interest. This intermediate statistic can be a naive estimator based on a convenient but misspecified model, a sample moment or a solution to an estimating equation. We review a procedure of indirect inference based on the generalized method of moments, which involves adjusting the naive estimator to be consistent and asymptotically normal. The objective function of this procedure is shown to be interpretable as an “indirect likelihood” based on the intermediate statistic. Many properties of the ordinary likelihood function can be extended to this indirect likelihood. This method is often more convenient computationally than maximum likelihood estimation when handling such model complexities as random effects and measurement error, for example, and it can also serve as a basis for robust inference and model selection, with less stringent assumptions on the data generating mechanism. Many familiar estimation techniques can be viewed as examples of this approach. We describe applications to measurement error, omitted covariates and recurrent events. A dataset concerning prevention of mammary tumors in rats is analyzed using a Poisson regression model with overdispersion. A second dataset from an epidemiological study is analyzed using a logistic regression model with mismeasured covariates. A third dataset of exam scores is used to illustrate robust covariance selection in graphical models.

[1]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[2]  C. L. Chiang ON REGULAR BEST ASYMPTOTICALLY NORMAL ESTIMATES , 1956 .

[3]  L. L. Cam,et al.  On the Asymptotic Theory of Estimation and Testing Hypotheses , 1956 .

[4]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[5]  Thomas S. Ferguson,et al.  A Method of Generating Best Asymptotically Normal Estimates with Application to the Estimation of Bacterial Densities , 1958 .

[6]  Calyampudi Radhakrishna Rao,et al.  Linear Statistical Inference and its Applications , 1967 .

[7]  R. Berk,et al.  Limiting Behavior of Posterior Distributions when the Model is Incorrect , 1966 .

[8]  Calyampudi R. Rao,et al.  Linear statistical inference and its applications , 1965 .

[9]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[10]  F. Hampel Contributions to the theory of robust estimation , 1968 .

[11]  J. Hájek A characterization of limiting distributions of regular estimates , 1970 .

[12]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[13]  P. Lachenbruch Mathematical Statistics, 2nd Edition , 1972 .

[14]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[15]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[16]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[17]  Robert V. Foutz,et al.  The Performance of the Likelihood Ratio Test When the Model is Incorrect , 1977 .

[18]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[19]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[20]  J. Hausman Specification tests in econometrics , 1978 .

[21]  J. B. Ramsey,et al.  Estimating Mixtures of Normal Distributions and Switching Regressions , 1978 .

[22]  B D Goldstein,et al.  The relation between respiratory illness in primary schoolchildren and the use of gas for cooking--III. Nitrogen dioxide, respiratory illness and lung infection. , 1979, International journal of epidemiology.

[23]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[24]  T. Santner,et al.  An analysis of comparative carcinogenesis experiments based on multiple times to tumor. , 1980, Biometrics.

[25]  R. Serfling Approximation Theorems of Mathematical Statistics , 1980 .

[26]  Peter Schmidt,et al.  An Improved Version of the Quandt-Ramsey MGE Estimator for Mixtures of Normal Distributions and Switching Regressions , 1982 .

[27]  J. Kent Robust properties of likelihood ratio tests , 1982 .

[28]  L. Hansen Large Sample Properties of Generalized Method of Moments Estimators , 1982 .

[29]  David R. Cox,et al.  Some remarks on overdispersion , 1983 .

[30]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[31]  M. Gail,et al.  Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates , 1984 .

[32]  Martin Crowder,et al.  Gaussian Estimation for Correlated Binomial Data , 1985 .

[33]  W. Newey,et al.  Large sample estimation and hypothesis testing , 1986 .

[34]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[35]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[36]  J A Stolwijk,et al.  Assessment of exposure to indoor air contaminants from combustion sources: methodology and application. , 1986, American journal of epidemiology.

[37]  Alice S. Whittemore,et al.  Approximations for Regression with Covariate Measurement Error , 1988 .

[38]  Wayne A. Fuller,et al.  Measurement Error Models , 1988 .

[39]  L. J. Wei,et al.  Regression analysis of multivariate incomplete failure time data by modeling marginal distributions , 1989 .

[40]  D. Pollard,et al.  Simulation and the Asymptotics of Optimization Estimators , 1989 .

[41]  D. McFadden A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration , 1989 .

[42]  Norman E. Breslow,et al.  Tests of Hypotheses in Overdispersed Poisson Regression and other Quasi-Likelihood Models , 1990 .

[43]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for measurement error: the case of multiple covariates measured with error. , 1990, American journal of epidemiology.

[44]  Nanny Wermuth,et al.  An approximation to maximum likelihood estimates in reduced models , 1990 .

[45]  W. Newey,et al.  Kernel Estimation of Partial Means and a General Variance Estimator , 1994, Econometric Theory.

[46]  P. Sen,et al.  Large sample methods in statistics , 1993 .

[47]  Alain Monfort,et al.  Simulation-based inference: A survey with special reference to panel data models , 1993 .

[48]  John A. Nelder,et al.  Generalized linear models. 2nd ed. , 1993 .

[49]  A Kong,et al.  Asymptotic theory for gene mapping. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Roderick J. A. Little,et al.  A Class of Pattern-Mixture Models for Normal Incomplete Data , 1994 .

[51]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[52]  J. Klein,et al.  Statistical Models Based On Counting Process , 1994 .

[53]  A. Rotnitzky,et al.  A note on the bias of estimators with missing data. , 1994, Biometrics.

[54]  A. Kuk Asymptotically Unbiased Estimation in Generalized Linear Models with Random Effects , 1995 .

[55]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[56]  Jerald F. Lawless,et al.  Some Simple Robust Methods for the Analysis of Recurrent Events , 1995 .

[57]  A. Gallant,et al.  Which Moments to Match? , 1995, Econometric Theory.

[58]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[59]  D. Hand,et al.  Practical Longitudinal Data Analysis , 1996 .

[60]  Bruce W. Turnbull,et al.  Effects of Selenium Supplementation for Cancer Prevention in Patients With Carcinoma of the Skin: A Randomized Controlled Trial , 1996 .

[61]  Adrian Pagan,et al.  Estimation, Inference and Specification Analysis. , 1996 .

[62]  B W Turnbull,et al.  Effects of selenium supplementation for cancer prevention in patients with carcinoma of the skin. A randomized controlled trial. Nutritional Prevention of Cancer Study Group. , 1996, JAMA.

[63]  James G. MacKinnon,et al.  Approximate bias correction in econometrics , 1998 .

[64]  A. Gallant,et al.  Estimating stochastic differential equations efficiently by minimum chi-squared , 1997 .

[65]  B. Turnbull,et al.  Regression models for recurrent event data: parametric random effects models with measurement error. , 1997, Statistics in medicine.

[66]  A. Kong,et al.  Linkage mapping in experimental crosses: the robustness of single-gene models. , 1997, Genetics.

[67]  A. Dawid Conditional Independence , 1997 .

[68]  Pseudo-maximum likelihood method, adjusted pseudo-maximum likelihood method and covariance estimators , 1998 .

[69]  Confidence intervals for gene location - The effect of model misspecification and smoothing , 1998 .

[70]  Bruce W. Turnbull,et al.  Semiparametric Regression Models for Repeated Events with Random Effects and Measurement Error , 1999 .

[71]  Enno Mammen,et al.  The Existence and Asymptotic Properties of a Backfitting Projection Algorithm Under Weak Conditions , 1999 .

[72]  A. Ronald Gallant,et al.  The relative efficiency of method of moments estimators , 1999 .

[73]  Laszlo Matyas Generalized Method of Moments Estimation: Preface , 1999 .

[74]  B. Lindsay,et al.  Improving generalised estimating equations using quadratic inference functions , 2000 .

[75]  M. Genton,et al.  Robust simulation-based estimation , 2000 .

[76]  Elvezio Ronchetti,et al.  Robust Indirect Inference , 2003 .

[77]  Martin Crowder,et al.  On repeated measures analysis with misspecified covariance structure , 2001 .

[78]  E. Ronchetti,et al.  Robust inference with GMM estimators , 2001 .

[79]  M. Genton,et al.  Robust Simulation-Based Estimation of ARMA Models , 2001 .

[80]  Marc G. Genton,et al.  Simulation-based inference for simultaneous processes on regular lattices , 2002, Stat. Comput..

[81]  Jean-Pierre Florens,et al.  Simulation-Based Method of Moments and Efficiency , 2002 .

[82]  W. Jiang,et al.  The Indirect Method - Robust Inference Based on Intermediate Statistics , 2003 .

[83]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[84]  Nicolas W. Hengartner,et al.  Rate optimal estimation with the integration method in the presence of many covariates , 2005 .