High resolution, annual maps of the characteristics of smallholder-dominated croplands at national scales

Understanding agricultural change requires reliable, frequently updated maps that describe the characteristics of croplands. Such data are often unavailable for regions dominated by smallholder agricultural systems, which are particularly challenging for remote sensing. To overcome these challenges, we designed a system to minimize several sources of error that arise when mapping smallholder croplands. To overcome errors caused by mismatches between image resolution and cropland scales, as well as persistent cloud cover, the system converts daily, 3.7 m PlanetScope imagery into two seasonal composites within a single agricultural year. To reduce errors that occur when training classifiers, we built a labelling platform that rigorously assesses label accuracy, and creates more accurate consensus labels that train a Random Forests model. The labelling platform and model interact within an active learning process that boosts the accuracy of the resulting cropland probability map, which is used in a segmentation process to delineate individual field boundaries. We applied this system to map Ghana’s croplands for the year 2018. We divided Ghana into 16 mapping regions (12,160-23,535 km^2), training separate models for each using a total of 6,299 labels, plus 1,600 for validation. Using an independent map reference sample (n=1,207), we found that overall accuracies of the resulting cropland probability and field boundary maps were 88% and 86.7%, respectively, with User’s accuracies for the cropland class of 61.2% and 78.9%, and Producer’s accuracies of 67.3% and 58.2%. Croplands covered 16.1-23.2% of the mapped area, comprising 1,131,146 total fields with an average size of 3.92 ha. Estimates based on the map reference sample indicate the cropland percentage is 17.1% (15.4-18.9%) or 17.6% (15.6-19.6%), depending on the map used to estimate the standard error. Using the labellers’ digitized field boundaries to estimate biases in field boundary statistics, we calculated an adjusted mean field size of 1.73 ha and total field count of 1,662,281. Although the cropland class contained substantial errors, the system was effective in mitigating error and quantifying resulting performance gains. By minimizing training errors, consensus labelling improved the model’s F1 scores by up to 25%, while 3 iterations of active learning increased the F1 score by 9.1%, on average, which was 2.3% higher than training models with randomly selected labels. Map accuracy can be improved by replacing Random Forests with a convolutional neural network. These results demonstrate a readily adapted, transferrable framework for developing high resolution, annual, nation-scale maps that provide important details about smallholder-dominated croplands.

[1]  P. D’Odorico,et al.  Food appropriation through large scale land acquisitions , 2014 .

[2]  Peng Chen,et al.  A large‐area, spatially continuous assessment of land cover map error and its impact on downstream analyses , 2018, Global change biology.

[3]  Robert Gilmore Pontius,et al.  The total operating characteristic to measure diagnostic ability for multiple thresholds , 2014, Int. J. Geogr. Inf. Sci..

[4]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Timothy A. Warner,et al.  Implementation of machine-learning classification in remote sensing: an applied review , 2018 .

[6]  Matthew F. McCabe,et al.  Daily Retrieval of NDVI and LAI at 3 m Resolution via the Fusion of CubeSat, Landsat, and MODIS Data , 2018, Remote. Sens..

[7]  Hannah Kerner,et al.  Rapid Response Crop Maps in Data Sparse Regions , 2020, ArXiv.

[8]  J. Cihlar,et al.  An image transform to characterize and compensate for spatial variations in thin cloud contamination of Landsat images , 2002 .

[9]  Navin Ramankutty,et al.  Mind the gap: how do climate and agricultural management explain the ‘yield gap’ of croplands around the world? , 2010 .

[10]  I. McCallum,et al.  Characterizing the Spatial and Temporal Availability of Very High Resolution Satellite Imagery in Google Earth and Microsoft Bing Maps as a Source of Reference Data , 2018, Land.

[11]  Mikhail F. Kanevski,et al.  A Survey of Active Learning Algorithms for Supervised Remote Sensing Image Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.

[12]  Nicholas J. Sitko,et al.  Africa's changing farm size distribution patterns: the rise of medium-scale farms , 2016 .

[13]  GLC_FCS30: global land-cover product with fine classification system at 30 m using time-series Landsat imagery , 2021 .

[14]  Philip K. Thornton,et al.  High carbon and biodiversity costs from converting Africa’s wet savannahs to cropland , 2015 .

[15]  Xueliang Zhang,et al.  Deep learning in remote sensing applications: A meta-analysis and review , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[16]  C. Field,et al.  Crop yield gaps: their importance, magnitudes, and causes. , 2009 .

[17]  Stefano Ricci,et al.  Collect Earth: Land Use and Land Cover Assessment through Augmented Visual Interpretation , 2016, Remote. Sens..

[18]  Peng Liu,et al.  Ieee Journal of Selected Topics in Applied Earth Observations and Remote Sensing 1 Active Deep Learning for Classification of Hyperspectral Images , 2022 .

[19]  Olivier Leo,et al.  Conflation of expert and crowd reference data to validate global binary thematic maps , 2019, Remote Sensing of Environment.

[20]  F. Achard,et al.  Tropical forests were the primary sources of new agricultural land in the 1980s and 1990s , 2010, Proceedings of the National Academy of Sciences.

[21]  Zhe Zhu,et al.  Object-based cloud and cloud shadow detection in Landsat imagery , 2012 .

[22]  Russell G. Congalton,et al.  Nominal 30-m Cropland Extent Map of Continental Africa by Integrating Pixel-Based and Object-Based Algorithms Using Sentinel-2 and Landsat-8 Data on Google Earth Engine , 2017, Remote. Sens..

[23]  Taloustieteen laitos,et al.  Small-Scale Farmers in Liberalised Trade Environment , 2005 .

[24]  Giles M. Foody,et al.  Key issues in rigorous accuracy assessment of land cover products , 2019, Remote Sensing of Environment.

[25]  Martin Brandt,et al.  The forgotten land use class: Mapping of fallow fields across the Sahel using Sentinel-2 , 2020, Remote Sensing of Environment.

[26]  Gershon Feder The relation between farm size and farm productivity: The role of family labor, supervision and credit constraints , 1985 .

[27]  Michael Oppenheimer,et al.  Projected climate impacts to South African maize and wheat production in 2055: a comparison of empirical and mechanistic modeling approaches , 2013, Global change biology.

[28]  James S. Gerber,et al.  Subnational distribution of average farm size and smallholder contributions to global food production , 2016 .

[29]  Peng Gong,et al.  New land-cover maps of Ghana for 2015 using Landsat 8 and three popular classifiers for biodiversity assessment , 2017 .

[30]  Sam Desiere,et al.  Land Productivity and Plot Size: Is Measurement Error Driving the Inverse Relationship? , 2017 .

[31]  David P. Roy,et al.  Analysis Ready Data: Enabling Analysis of the Landsat Archive , 2018, Remote. Sens..

[32]  Jin Chen,et al.  Global land cover mapping at 30 m resolution: A POK-based operational approach , 2015 .

[33]  Rahul Rakshit,et al.  A review of accuracy assessment for object-based image analysis: From per-pixel to per-polygon approaches , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[34]  N. Carré From Local to Global , 2016 .

[35]  Rick Mueller,et al.  Mapping global cropland and field size , 2015, Global change biology.

[36]  A. Ziegler,et al.  Highland cropland expansion and forest loss in Southeast Asia in the twenty-first century , 2018, Nature Geoscience.

[37]  Thomas J. Fuchs,et al.  A generalized computer vision approach to mapping crop fields in heterogeneous agricultural landscapes , 2016 .

[38]  Samia Boukir,et al.  Exploring issues of training data imbalance and mislabelling on random forest performance for large area land cover classification using the ensemble margin , 2015 .

[39]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[40]  David P. Roy,et al.  The global Landsat archive: Status, consolidation, and direction , 2016 .

[41]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[42]  Mark A. Friedl,et al.  Accounting for Training Data Error in Machine Learning Applied to Earth Observations , 2020, Remote. Sens..

[43]  Kelly K. Caylor,et al.  Reconciling agriculture, carbon and biodiversity in a savannah transformation frontier , 2016, Philosophical Transactions of the Royal Society B: Biological Sciences.

[44]  S. P. Abercrombie,et al.  Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product , 2019, Remote Sensing of Environment.

[45]  G. Levin Farm size and landscape composition in relation to landscape changes in Denmark , 2006 .

[46]  Steffen Fritz,et al.  Evaluation of ESA CCI prototype land cover map at 20m , 2017 .

[47]  David R. Thompson,et al.  Integrating active learning and crowdsourcing into large-scale supervised landcover mapping algorithms , 2017, PeerJ Prepr..

[48]  Matthias Drusch,et al.  Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services , 2012 .

[49]  Giles M. Foody,et al.  Good practices for estimating area and assessing accuracy of land change , 2014 .

[50]  Steffen Fritz,et al.  Estimating the global distribution of field size using crowdsourcing , 2018, Global change biology.

[51]  L. D. Estes,et al.  A platform for crowdsourcing the creation of representative, accurate landcover maps , 2016, Environ. Model. Softw..

[52]  W. Jetz,et al.  Remotely Sensed High-Resolution Global Cloud Dynamics for Predicting Ecosystem and Biodiversity Distributions , 2016, PLoS biology.

[53]  I. Luginaah,et al.  Traditional agriculture in transition: examining the impacts of agricultural modernization on smallholder farming in Ghana under the new Green Revolution , 2018, International Journal of Sustainable Development & World Ecology.

[54]  Zhe Zhu,et al.  Cirrus clouds that adversely affect Landsat 8 images: What are they and how to detect them? , 2020 .

[55]  Steffen Fritz,et al.  Highlighting continued uncertainty in global land cover maps for the user community , 2011 .

[56]  S. Flasse,et al.  Analysis of the conflict between omission and commission in low spatial resolution dichotomic thematic products: The Pareto Boundary , 2004 .

[57]  Michael Dixon,et al.  Google Earth Engine: Planetary-scale geospatial analysis for everyone , 2017 .

[58]  Peer Neubert,et al.  Compact Watershed and Preemptive SLIC: On Improving Trade-offs of Superpixel Segmentation Algorithms , 2014, 2014 22nd International Conference on Pattern Recognition.

[59]  François Waldner,et al.  Deep learning on edge: extracting field boundaries from satellite images with a convolutional neural network , 2019, ArXiv.

[60]  M. Zortea,et al.  OBJECT-BASED CLOUD AND CLOUD SHADOW DETECTION IN LANDSAT IMAGES FOR TROPICAL FOREST MONITORING , 2012 .

[61]  S. Openshaw A million or so correlation coefficients : three experiments on the modifiable areal unit problem , 1979 .

[62]  S. Fritz,et al.  Comparison of global and regional land cover maps with statistical information for the agricultural domain in Africa , 2010 .

[63]  S. Dark,et al.  The modifiable areal unit problem (MAUP) in physical geography , 2007 .

[64]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[65]  J. D. Whyatt,et al.  Line generalisation by repeated elimination of points , 1993 .

[66]  Steffen Fritz,et al.  Geo-Wiki: An online platform for improving global land cover , 2012, Environ. Model. Softw..

[67]  T. Raney,et al.  The Number, Size, and Distribution of Farms, Smallholder Farms, and Family Farms Worldwide , 2016 .

[68]  Claude Monteil,et al.  From local to global: A transfer learning-based approach for mapping poplar plantations at national scale using Sentinel-2 , 2021, ISPRS Journal of Photogrammetry and Remote Sensing.