Marginal maximum likelihood estimation of SAR models with missing data

Maximum likelihood (ML) estimation of simultaneous autocorrelation models is well known. Under the presence of missing data, estimation is not straightforward, due to the implied dependence of all units. The EM algorithm is the standard approach to accomplish ML estimation in this case. An alternative approach is considered, the method of maximising the marginal likelihood. At first glance the method is computationally complex due to inversion of large matrices that are of the same size as the complete data, but these can be avoided, leading to an algorithm that is usually much faster than the EM algorithm and without typical EM convergence issues. Another approximate method is also proposed that serves as an alternative, for example when the contiguity matrix is dense. The methods are illustrated using a well known data set on house prices with 25,357 units.

[1]  J. LeSage Introduction to spatial econometrics , 2009 .

[2]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[3]  K. Ord Estimation Methods for Models of Spatial Interaction , 1975 .

[4]  Roger Bivand,et al.  Approximate Bayesian inference for spatial econometrics models , 2014 .

[5]  Takafumi Kato,et al.  Usefulness of the Information Contained in the Prediction Sample for the Spatial Error Model , 2013 .

[6]  Roger Bivand,et al.  Comparing Estimation Methods for Spatial Econometrics Techniques Using R , 2010 .

[7]  Noel Cressie,et al.  One-step estimation of spatial dependence parameters: Properties and extensions of the APLE statistic , 2012, J. Multivar. Anal..

[8]  James P. LeSage,et al.  Models for Spatially Dependent Missing Data , 2004 .

[9]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[10]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[11]  D. Rubinfeld,et al.  Hedonic housing prices and the demand for clean air , 1978 .

[12]  Christine Thomas-Agnan,et al.  About predictions in spatial autoregressive models: optimal and almost optimal strategies , 2017 .

[13]  D. Bates,et al.  Newton-Raphson and EM Algorithms for Linear Mixed-Effects Models for Repeated-Measures Data , 1988 .

[14]  Hyunjoong Kim,et al.  Functional Analysis I , 2017 .

[15]  D. Harville Matrix Algebra From a Statistician's Perspective , 1998 .

[16]  A. Zammit‐Mangion,et al.  Computational aspects of the EM algorithm for spatial econometric models with missing data , 2017 .

[17]  Lung-fei Lee,et al.  Estimation of Spatial Autoregressive Models with Randomly Missing Data in the Dependent Variable , 2013 .

[18]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[19]  Roger Bivand,et al.  A New Latent Class to Fit Spatial Econometrics Models with Integrated Nested Laplace Approximations , 2015 .