Massively-parallel break detection for satellite data

The field of remote sensing is nowadays faced with huge amounts of data. While this offers a variety of exciting research opportunities, it also yields significant challenges regarding both computation time and space requirements. In practice, the sheer data volumes render existing approaches too slow for processing and analyzing all the available data. This work aims at accelerating BFAST, one of the state-of-the-art methods for break detection given satellite image time series. In particular, we propose a massively-parallel implementation for BFAST that can effectively make use of modern parallel compute devices such as GPUs. Our experimental evaluation shows that the proposed GPU implementation is up to four orders of magnitude faster than the existing publicly available implementation and up to ten times faster than a corresponding multi-threaded CPU execution. The dramatic decrease in running time renders the analysis of significantly larger datasets possible in seconds or minutes instead of hours or days. We demonstrate the practical benefits of our implementations given both artificial and real datasets.

[1]  Jan Verbesselt,et al.  Using spatial context to improve early detection of deforestation from Landsat time series , 2016 .

[2]  M. Herold,et al.  Near real-time disturbance detection using satellite image time series , 2012 .

[3]  Fabian Gieseke,et al.  bufferkdtree: A Python library for massive nearest neighbor queries on multi-many-core devices , 2017, Knowl. Based Syst..

[4]  Kotagiri Ramamohanarao,et al.  MASCOT: Fast and Highly Scalable SVM Cross-Validation Using GPUs and SSDs , 2014, 2014 IEEE International Conference on Data Mining.

[5]  M. Herold,et al.  Tracking disturbance-regrowth dynamics in tropical forests using structural change detection and Landsat time series , 2015 .

[6]  L. Dagum,et al.  OpenMP: an industry standard API for shared-memory programming , 1998 .

[7]  Gilberto Câmara,et al.  Big data streaming for remote sensing time series analytics using MapReduce , 2017, GEOINFO.

[8]  Kurt Keutzer,et al.  Fast support vector machine training and classification on graphics processors , 2008, ICML '08.

[9]  A. Azzouz 2011 , 2020, City.

[10]  Kung-Sik Chan,et al.  Time Series Analysis: With Applications in R , 2010 .

[11]  Michael A. Wulder,et al.  Opening the archive: How free data has enabled the science and monitoring promise of Landsat , 2012 .

[12]  Jan Verbesselt,et al.  Combining satellite data for better tropical forest monitoring , 2016 .

[13]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[14]  Nicolas Pinto,et al.  PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation , 2009, Parallel Comput..

[15]  Max Grossman,et al.  Professional CUDA C Programming , 2014 .

[16]  Tao Wang,et al.  Deep learning with COTS HPC systems , 2013, ICML.

[17]  Ranga B. Myneni,et al.  The interpretation of spectral vegetation indexes , 1995, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Fabian Gieseke,et al.  Buffer k-d Trees: Processing Massive Nearest Neighbor Queries on GPUs , 2014, ICML.

[19]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[20]  David P. Roy,et al.  A Global Analysis of Sentinel-2A, Sentinel-2B and Landsat-8 Data Revisit Intervals and Implications for Terrestrial Monitoring , 2017, Remote. Sens..