SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Progress toward the United Nations Sustainable Development Goals (SDGs) has been hindered by a lack of data on key environmental and socioeconomic indicators, which historically have come from ground surveys with sparse temporal and spatial coverage. Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellites or social media, to provide insights into progress toward SDGs. Despite promising early results, approaches to using such data for SDG measurement thus far have largely evaluated on different datasets or used inconsistent evaluation metrics, making it hard to understand whether performance is improving and where additional research would be most fruitful. Furthermore, processing satellite and ground survey data requires domain knowledge that many in the machine learning community lack. In this paper, we introduce SUSTAINBENCH, a collection of 15 benchmark tasks across 7 SDGs, including tasks related to economic development, agriculture, health, education, water and sanitation, climate action, and life on land. Datasets for 11 of the 15 tasks are released publicly for the first time. Our goals for SUSTAINBENCH are to (1) lower the barriers to entry for the machine learning community to contribute to measuring and achieving the SDGs; (2) provide standard benchmarks for evaluating machine learning models on tasks across a variety of SDGs; and (3) encourage the development of novel machine learning methods where improved model performance facilitates progress towards the SDGs.

[1]  Nancy Fullman,et al.  Mapping local variation in educational attainment across Africa , 2018, Nature.

[2]  Jonathan Krause,et al.  Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States , 2017, Proceedings of the National Academy of Sciences.

[3]  Samy Bengio,et al.  Density estimation using Real NVP , 2016, ICLR.

[4]  Alan H. Strahler,et al.  Global land cover mapping from MODIS: algorithms and early results , 2002 .

[5]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[6]  M. Carter IDENTIFICATION OF THE INVERSE RELATIONSHIP BETWEEN FARM SIZE AND PRODUCTIVITY: AN EMPIRICAL ANALYSIS OF PEASANT AGRICULTURAL PRODUCTION , 1984 .

[7]  A. Mccord,et al.  Operational lessons learned for social protection system-strengthening and future shocks , 2021 .

[8]  Yaroslav Bulatov,et al.  xView: Objects in Context in Overhead Imagery , 2018, ArXiv.

[9]  David B. Lobell,et al.  Two Shifts for Crop Mapping: Leveraging Aggregate Crop Statistics to Improve Satellite-based Maps in New Regions , 2021, ArXiv.

[10]  N. Silleos,et al.  The use of multi-temporal NDVI measurements from AVHRR data for crop yield estimation and prediction , 1993 .

[11]  P. Alam,et al.  R , 1823, The Herodotus Encyclopedia.

[12]  P. Alam ‘E’ , 2021, Composites Engineering: An A–Z Guide.

[13]  Christopher D. Elvidge,et al.  DMSP-OLS Radiance Calibrated Nighttime Lights Time Series with Intercalibration , 2015, Remote. Sens..

[14]  E. Nsoesie,et al.  Use of Deep Learning to Examine the Association of the Built Environment With Prevalence of Neighborhood Adult Obesity , 2017, JAMA network open.

[15]  David B. Lobell,et al.  Mapping Crop Types in Southeast India with Smartphone Crowdsourcing and Deep Learning , 2020, Remote. Sens..

[16]  Begüm Demir,et al.  Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[17]  Stefano Ermon,et al.  Tile2Vec: Unsupervised representation learning for spatially distributed data , 2018, AAAI.

[18]  J. Sachs,et al.  Sustainable Development Report 2020 , 2021 .

[19]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[20]  Anne Driscoll,et al.  NBER WORKING PAPER SERIES USING SATELLITE IMAGERY TO UNDERSTAND AND PROMOTE SUSTAINABLE DEVELOPMENT , 2020 .

[21]  Gordon Christie,et al.  Functional Map of the World , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Patrick Hostert,et al.  Challenges and opportunities in mapping land use intensity globally , 2013, Current opinion in environmental sustainability.

[23]  Stefano Ermon,et al.  Deep Transfer Learning for Crop Yield Prediction with Remote Sensing Data , 2018, COMPASS.

[24]  A. Tatem,et al.  Using remotely sensed night-time light as a proxy for poverty in Africa , 2008, Population health metrics.

[25]  Stefano Ermon,et al.  Predicting Economic Development using Geolocated Wikipedia Articles , 2019, KDD.

[26]  Xiao Xiang Zhu,et al.  SEN12MS - A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion , 2019, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[27]  C. Justice,et al.  High-Resolution Global Maps of 21st-Century Forest Cover Change , 2013, Science.

[28]  Joshua Blumenstock,et al.  Machine learning can help get COVID-19 aid to those who need it most , 2020, Nature.

[29]  Stefano Ermon,et al.  Predicting Livelihood Indicators from Community-Generated Street-Level Imagery , 2021, AAAI.

[30]  Gérard Dedieu,et al.  Assessment of an Operational System for Crop Type Map Production Using High Temporal and Spatial Resolution Satellite Optical Imagery , 2015, Remote. Sens..

[31]  Sang Michael Xie,et al.  Combining satellite imagery and machine learning to predict poverty , 2016, Science.

[32]  M. Emmerson,et al.  Persistent negative effects of pesticides on biodiversity and biological control potential on European farmland , 2010 .

[33]  François Waldner,et al.  Deep learning on edge: extracting field boundaries from satellite images with a convolutional neural network , 2019, ArXiv.

[34]  P. Alam ‘L’ , 2021, Composites Engineering: An A–Z Guide.

[35]  G. Brakenridge,et al.  Satellite imaging reveals increased proportion of population exposed to floods , 2021, Nature.

[36]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[37]  Stefano Ermon,et al.  Scalable deep learning to identify brick kilns and aid regulatory capacity , 2021, Proceedings of the National Academy of Sciences.

[38]  Anne Driscoll,et al.  Using publicly available satellite imagery and deep learning to understand economic well-being in Africa , 2020, Nature Communications.

[39]  Michael Melone Detect , 2021, Designing Secure Systems.

[40]  Hans de Moel,et al.  A global database of historic and real-time flood events based on social media , 2019, Scientific Data.

[41]  Kaiming He,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[43]  Ryan Engstrom,et al.  Poverty from Space: Using High-Resolution Satellite Imagery for Estimating Economic Well-Being , 2017, The World Bank Economic Review.

[44]  Andrew Y. Ng,et al.  ForestNet: Classifying Drivers of Deforestation in Indonesia using Deep Learning on Satellite Imagery , 2020, ArXiv.

[45]  Jie Sun,et al.  County-Level Soybean Yield Prediction Using Deep CNN-LSTM Model , 2019, Sensors.

[46]  David B. Lobell,et al.  Weakly Supervised Deep Learning for Segmentation of Remote Sensing Imagery , 2020, Remote. Sens..

[47]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[48]  Gabriel Cadamuro,et al.  Predicting poverty and wealth from mobile phone metadata , 2015, Science.

[49]  Marta M. Jankowska,et al.  Estimating spatial inequalities of urban child mortality. , 2013, Demographic research.

[50]  Adam Van Etten,et al.  SpaceNet: A Remote Sensing Dataset and Challenge Series , 2018, ArXiv.

[51]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[52]  Douglas K. Bolton,et al.  Forecasting crop yield using remotely sensed vegetation indices and crop phenology metrics , 2013 .

[53]  Jing Huang,et al.  DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[54]  Blake Zachary,et al.  Geographic displacement procedure and georeferenced data release policy for the Demographic and Health Surveys. , 2013 .

[55]  D. Roy,et al.  Conterminous United States crop field size quantification from multi-temporal Landsat data , 2015 .

[56]  Howie Choset,et al.  xBD: A Dataset for Assessing Building Damage from Satellite Imagery , 2019, ArXiv.

[57]  Kenneth Hill,et al.  The effect of water and sanitation on child health: evidence from the demographic and health surveys 1986-2007. , 2011, International journal of epidemiology.

[58]  C. Elvidge,et al.  VIIRS night-time lights , 2017, Remote Sensing of Night-time Light.

[59]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[60]  D. Sahn,et al.  Exploring Alternative Measures of Welfare in the Absence of Expenditure Data , 2003 .

[61]  Nandin-Erdene Tsendbazar,et al.  Copernicus Global Land Cover Layers - Collection 2 , 2020, Remote. Sens..

[62]  Sara Beery,et al.  The iWildCam 2020 Competition Dataset , 2020, ArXiv.

[63]  Sam Desiere,et al.  Land Productivity and Plot Size: Is Measurement Error Driving the Inverse Relationship? , 2017 .

[64]  Stefano Ermon,et al.  Farm Parcel Delineation Using Spatio-temporal Convolutional Networks , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[65]  C. Conrad,et al.  A crop type dataset for consistent land cover classification in Central Asia , 2020, Scientific Data.

[66]  François Waldner,et al.  Detect, Consolidate, Delineate: Scalable Mapping of Field Boundaries Using Satellite Images , 2021, Remote. Sens..

[67]  Keywan Riahi,et al.  Prototype global sustainable development report , 2014 .

[68]  Stefano Ermon,et al.  Deep Gaussian Process for Crop Yield Prediction Based on Remote Sensing Data , 2017, AAAI.

[69]  Jure Leskovec,et al.  WILDS: A Benchmark of in-the-Wild Distribution Shifts , 2021, ICML.

[70]  Lovekesh Vig,et al.  Meta-Learning for Few-Shot Time Series Classification , 2019, COMAD/CODS.

[71]  Sabina Alkire,et al.  Multidimensional Poverty Measurement and Analysis: Chapter 2 – The Framework , 2015 .

[72]  Marc Rußwurm,et al.  Meta-Learning for Few-Shot Land Cover Classification , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[73]  R. Congalton,et al.  Automated cropland mapping of continental Africa using Google Earth Engine cloud computing , 2017 .

[74]  Cristiano Zerbato,et al.  Convolutional neural networks in predicting cotton yield from images of commercial fields , 2020, Comput. Electron. Agric..

[75]  David B. Lobell,et al.  Smallholder maize area and yield mapping at national scales with Google Earth Engine , 2019, Remote Sensing of Environment.

[76]  Michael Dixon,et al.  Google Earth Engine: Planetary-scale geospatial analysis for everyone , 2017 .

[77]  D. Lobell,et al.  Towards fine resolution global maps of crop yields: Testing multiple methods and satellites in three countries , 2017 .

[78]  D. Lobell,et al.  Landsat-based classification in the cloud: An opportunity for a paradigm shift in land cover monitoring , 2017 .

[79]  Stefano Ermon,et al.  Semantic Segmentation of Crop Type in Africa: A Novel Dataset and Analysis of Deep Learning Methods , 2019, CVPR Workshops.

[80]  Hongwei Zhao,et al.  Evaluation of Five Deep Learning Models for Crop Type Mapping Using Sentinel-2 Time Series Images with Missing Information , 2021, Remote. Sens..

[81]  Boris Babenko,et al.  Poverty Mapping Using Convolutional Neural Networks Trained on High and Medium Resolution Satellite Images, With an Application in Mexico , 2017, ArXiv.

[82]  Emily L. Aiken,et al.  NBER WORKING PAPER SERIES MACHINE LEARNING AND MOBILE PHONE DATA CAN IMPROVE THE TARGETING OF HUMANITARIAN ASSISTANCE , 2021 .

[83]  Stefano Ermon,et al.  A Framework for Sample Efficient Interval Estimation with Control Variates , 2020, AISTATS.

[84]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[85]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[86]  Andrew Head,et al.  Can Human Development be Measured with Satellite Imagery? , 2017, ICTD.

[87]  Hannah Kerner,et al.  Rapid Response Crop Maps in Data Sparse Regions , 2020, ArXiv.

[88]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[89]  Matthieu Stigler,et al.  Using Satellite Imagery and Machine Learning to Estimate the Livelihood Impact of Electricity Access , 2021, ArXiv.

[90]  Kenji Takasaki,et al.  Illuminating dark fishing fleets in North Korea , 2020, Science Advances.

[91]  Manfred S. Green,et al.  Mapping geographical inequalities in access to drinking water and sanitation facilities in low-income and middle-income countries, 2000–17 , 2020, The Lancet. Global health.

[92]  L. Pritchett,et al.  Estimating Wealth Effects Without Expenditure Data—Or Tears: An Application To Educational Enrollments In States Of India* , 2001, Demography.

[93]  Cheryl A. Palm,et al.  Socioecologically informed use of remote sensing data to predict rural household poverty , 2019, Proceedings of the National Academy of Sciences.

[94]  Peter Kontschieder,et al.  The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[95]  Martijn Gough Climate change , 2009, Canadian Medical Association Journal.

[96]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).