Fast inversion of a flexible regression model for multivariate pollen counts data

We introduce a rich class of models π(Y | C,θ) for multivariate constrained, zero-inflated counts data. We use new methodology for fast Bayesian inference on unobserved θ given matched training data and propose a new algorithm for fast inversion to π(C | Y). Such models arise in palaeoclimate reconstruction, where Y represents vector of counts of a proxy such as pollen and C represents climate. Subsequently, is a vector of counts in a sample reflecting ancient pollen rain and thus ancient climate (the palaeoclimate). The methodology applies in principle to palaeoclimate reconstruction from other proxy types (e.g. chironomids, diatoms, testate amoebae). Furthermore, the generic issue—the statistical inversion of a multivariate relationship—is found in many areas of application (e.g. clustering, supervised classification, medical imaging, oil shale modelling). The contribution of the paper is the application of a Nested-Dirichlet–Multinomial model in conjunction with models for zero inflation and non-parametric response surface modelling. We present the models and the tools to perform fast Bayesian inference on the forward problem and inversion, by using integrated nested Laplace approximations. We demonstrate the increase in precision associated with including more climate proxies and the increase in error associated with doing this naively. Copyright © 2012 John Wiley & Sons, Ltd.

[1]  V. Garreta Bayesian approach of pollen-based palaeoclimate reconstructions: Toward the modelling of ecological processes , 2010 .

[2]  J. Haslett,et al.  A simple monotone process with application to radiocarbon‐dated depth chronologies , 2008 .

[3]  Christian Ohlwein,et al.  Review of probabilistic pollen-climate transfer methods , 2012 .

[4]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[5]  Brian Huntley,et al.  Weichselian palynostratigraphy, palaeovegetation and palaeoenvironment; the record from Lago Grande di Monticchio, southern Italy , 2000 .

[6]  Heikki Mannila,et al.  APPLYING BAYESIAN STATISTICS TO ORGANISM-BASED ENVIRONMENTAL RECONSTRUCTION , 2001 .

[7]  C. Paciorek,et al.  Mapping Ancient Forests: Bayesian Inference for Spatio-Temporal Trends in Forest Composition Using the Fossil Pollen Proxy Record , 2009, Journal of the American Statistical Association.

[8]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[9]  Robert J. Connor,et al.  Concepts of Independence for Proportions with a Generalization of the Dirichlet Distribution , 1969 .

[10]  Guo-Liang Tian,et al.  Bayesian computation for contingency tables with incomplete cell-counts , 2003 .

[11]  Thomas Kneib,et al.  Mixed model based inference in structured additive regression , 2006 .

[12]  Brian Huntley,et al.  The use of climate response surfaces to reconstruct palaeoclimate from Quaternary pollen and plant macrofossil data , 1993 .

[13]  Thomas Brendan Murphy,et al.  Application of Compositional Models for Glycan HILIC Data , 2011 .

[14]  A. Mackay,et al.  A Bayesian palaeoenvironmental transfer function model for acidified lakes , 2008 .

[15]  Simon P. Wilson,et al.  Bayesian palaeoclimate reconstruction , 2006 .

[16]  Tzu-Tsung Wong,et al.  Generalized Dirichlet distribution in Bayesian analysis , 1998, Appl. Math. Comput..

[17]  James V. Zidek,et al.  Editorial: (Post‐normal) statistical science , 2006 .

[18]  C. Buck,et al.  A flexible approach to assessing synchroneity of past events using Bayesian reconstructions of sedimentation history , 2008 .

[19]  Michael Salter-Townshend,et al.  Fast approximate inverse Bayesian inference in non-parametric multivariate regression with application to palaeoclimate reconstruction , 2009 .

[20]  Earl Lawrence,et al.  Using the Bayesian Framework to Combine Simulations and Physical Observations for Statistical Inference , 2010 .

[21]  Hannu Toivonen,et al.  A Bayesian multinomial Gaussian response model for organism-based environmental reconstruction , 2000 .

[22]  Hannu Toivonen,et al.  Holocene temperature changes in northern Fennoscandia reconstructed from chironomids using Bayesian modelling , 2002 .