Proliferation of atmospheric datasets can hinder policy making: a data blending technique offers a solution

The proliferation of atmospheric datasets is a key outcome from the continued development and advancement of our collective scientific understanding. Yet often datasets describing ostensibly identical processes or atmospheric variables provide widely varying results. As an example, we analyze several datasets representing rainfall over Nepal. We show that estimates of extreme rainfall are highly variable depending on which dataset you choose to look at. This leads to confusion and inaction from policy-focused decision makers. Scientifically, we should use datasets that sample a range of creation methodologies and prioritize the use of data science techniques that have the flexibility to incorporate these multiple sources of data. We demonstrate the use of a statistically interpretable data blending technique to help discern and communicate a consensus result, rather than imposing a priori judgment on the choice of dataset, for the benefit of policy decision making.

[1]  J. Lelieveld,et al.  A data integration framework for spatial interpolation of temperature observations using climate model data , 2023, PeerJ.

[2]  D. Barker,et al.  IMDAA: High Resolution Satellite-era Reanalysis for the Indian Monsoon Region , 2021, Journal of Climate.

[3]  E. Markowitz,et al.  Climate Decision-Making , 2020, Annual Review of Environment and Resources.

[4]  D. Scherer,et al.  WRF‐based dynamical downscaling of ERA5 reanalysis data for High Mountain Asia: Towards a new version of the High Asia Refined analysis , 2020, International Journal of Climatology.

[5]  J. Thepaut,et al.  The ERA5 global reanalysis , 2020, Quarterly Journal of the Royal Meteorological Society.

[6]  Eric J. Pedersen,et al.  Hierarchical generalized additive models in ecology: an introduction with mgcv , 2019, PeerJ.

[7]  Eric F. Wood,et al.  MSWEP V2 Global 3-Hourly 0.1° Precipitation: Methodology and Quantitative Assessment , 2018, Bulletin of the American Meteorological Society.

[8]  G. Balsamo,et al.  Precipitation over Monsoon Asia: A Comparison of Reanalyses and Observations , 2017 .

[9]  Adam A. Scaife,et al.  Global Seasonal forecast system version 5 (GloSea5): a high‐resolution seasonal forecast system , 2015 .

[10]  A. Kitoh,et al.  APHRODITE: Constructing a Long-Term Daily Gridded Precipitation Dataset for Asia Based on a Dense Network of Rain Gauges , 2012 .

[11]  Alan Y. Chiang,et al.  Generalized Additive Models: An Introduction With R , 2007, Technometrics.

[12]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[13]  S. Wood Low‐Rank Scale‐Invariant Tensor Product Smooths for Generalized Additive Mixed Models , 2006, Biometrics.

[14]  J. Grego,et al.  Fast stable direct fitting and smoothness selection for generalized additive models , 2006, 0709.3906.

[15]  A. Raftery,et al.  Inference for Deterministic Simulation Models: The Bayesian Melding Approach , 2000 .

[16]  Aaron M. McCright,et al.  Challenging global warming as a social problem: An analysis of the conservative movement's counter-claims , 2000 .

[17]  Stefan Sperlich,et al.  Generalized Additive Models , 2014 .

[18]  Adaptation to Climate Change in the Hydroelectricity Sector in Nepal , 2017 .

[19]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[20]  Fabian Scheipl,et al.  Straightforward intermediate rank tensor product smoothing in mixed models , 2012, Statistics and Computing.