GeoDMA - Geographic Data Mining Analyst

Abstract Remote sensing images obtained by remote sensing are a key source of data for studying large-scale geographic areas. From 2013 onwards, a new generation of land remote sensing satellites from USA, China, Brazil, India and Europe will produce in 1 year as much data as 5 years of the Landsat-7 satellite. Thus, the research community needs new ways to analyze large data sets of remote sensing imagery. To address this need, this paper describes a toolbox for combing land remote sensing image analysis with data mining techniques. Data mining methods are being extensively used for statistical analysis, but up to now have had limited use in remote sensing image interpretation due to the lack of appropriate tools. The toolbox described in this paper is the Geographic Data Mining Analyst (GeoDMA). It has algorithms for segmentation, feature extraction, feature selection, classification, landscape metrics and multi-temporal methods for change detection and analysis. GeoDMA uses decision-tree strategies adapted for spatial data mining. It connects remotely sensed imagery with other geographic data types using access to local or remote database. GeoDMA has methods to assess the accuracy of simulation models, as well as tools for spatio-temporal analysis, including a visualization of time-series that helps users to find patterns in cyclic events. The software includes a new approach for analyzing spatio-temporal data based on polar coordinates transformation. This method creates a set of descriptive features that improves the classification accuracy of multi-temporal image databases. GeoDMA is tightly integrated with TerraView GIS, so its users have access to all traditional GIS features. To demonstrate GeoDMA, we show two case studies on land use and land cover change.

[1]  Jean Paul Metzger,et al.  Time-lag in biological responses to landscape changes in a highly dynamic Atlantic forest region , 2009 .

[2]  M. Steinbach,et al.  Clustering Earth Science Data: Goals, Issues and Results , 2001 .

[3]  B. Solaiman,et al.  A data mining based approach to predict spatiotemporal changes in satellite images , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[4]  Kevin McGarigal,et al.  Landscape Pattern Metrics , 2014 .

[5]  C. Pinho,et al.  INTRA-URBAN LAND COVER CLASSIFICATION FROM HIGH-RESOLUTION IMAGES USING THE C 4 . 5 ALGORITHM , 2008 .

[6]  Frank W. Gerlach,et al.  IKONOS satellite, imagery, and products , 2003 .

[7]  Y. Hao,et al.  Landscape metric performance in analyzing two decades of deforestation in the Amazon Basin of Rondonia, Brazil , 2006 .

[8]  Guaraci J. Erthal,et al.  Satellite Imagery Segmentation: a region growing approach , 1996 .

[9]  Marco A. Casanova,et al.  TerraLib: An Open Source GIS Library for Large-Scale Environmental and Socio-Economic Applications , 2008 .

[10]  H. Nagendra,et al.  Fragmentation of a Landscape: Incorporating landscape metrics into satellite analyses of land-cover change , 2002 .

[11]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[12]  Dazhong Wen Land Mosaics: The Ecology of Landscapes and Regions , 1997 .

[13]  G. Hay,et al.  Object-Based Image Analysis , 2008 .

[14]  S. M. Jong,et al.  The Importance of Scale in Object-based Mapping of Vegetation Parameters with Hyperspectral Imagery , 2007 .

[15]  Leila Maria Garcia Fonseca,et al.  GeoDMA - A Novel System for Spatial Data Mining , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[16]  Russell G. Congalton,et al.  Thematic and positional accuracy assessment of digital remotely sensed data , 2007 .

[17]  M. Goodchild GIScience, Geography, Form, and Process , 2004 .

[18]  Leila Maria Garcia Fonseca,et al.  A Geographical Approach to Self-Organizing Maps Algorithm Applied to Image Segmentation , 2011, ACIVS.

[19]  R. M. Freitas,et al.  Virtual laboratory of remote sensing time series: visualization of MODIS EVI2 data set over South America , 2011 .

[20]  R. Forman Land Mosaics: The Ecology of Landscapes and Regions , 1995 .

[21]  J. Hammersley SIMULATION AND THE MONTE CARLO METHOD , 1982 .

[22]  Rob J Hyndman,et al.  Detecting trend and seasonal changes in satellite image time series , 2010 .

[23]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[24]  Philippe De Maeyer,et al.  An automated satellite image classification design using object-oriented segmentation algorithms: A move towards standardization , 2007, Expert Syst. Appl..

[25]  Gilberto Câmara,et al.  Spring: integrating remote sensing and gis by object-oriented data modelling , 1996, Comput. Graph..

[26]  Tenley M. Conway,et al.  Determining land-use information from land cover through an object-oriented classification of IKONOS imagery , 2008 .

[27]  Frederico T. Fonseca,et al.  What's in an Image? , 2001, COSIT.

[28]  Usama M. Fayyad,et al.  Data mining and KDD: Promise and challenges , 1997, Future Gener. Comput. Syst..

[29]  Martin Herold,et al.  On the Suitability of MODIS Time Series Metrics to Map Vegetation Types in Dry Savanna Ecosystems: A Case Study in the Kalahari of NE Namibia , 2009, Remote. Sens..

[30]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[31]  Stefan Lang,et al.  Object-based image analysis for remote sensing applications: modeling reality – dealing with complexity , 2008 .

[32]  Barry Smith,et al.  On Drawing Lines on a Map , 1995, COSIT.

[33]  Wolfgang Middelmann,et al.  An Efficient Parallel Algorithm for Graph-Based Image Segmentation , 2009, CAIP.

[34]  Geoffrey J. Hay,et al.  Geographic Object-Based Image Analysis (GEOBIA): A new name for a new discipline , 2008 .

[35]  Jean Paul Metzger,et al.  The Brazilian Atlantic Forest: How much is left, and how is the remaining forest distributed? Implications for conservation , 2009 .

[36]  M. Neubert,et al.  THE POTENTIAL USE OF VERY HIGH RESOLUTION SATELLITE DATA FOR URBAN AREAS – FIRST EXPERIENCES WITH IKONOS DATA, THEIR CLASSIFICATION AND APPLICATION IN URBAN PLANNING AND ENVIRONMENTAL MONITORING , 2001 .

[37]  Arno Schäpe,et al.  Multiresolution Segmentation : an optimization approach for high quality multi-scale image segmentation , 2000 .

[38]  J. Bückner,et al.  geoAIDA-A Knowledge Based Automatic Image Data Analyser for Remote Sensing Data , 2000 .

[39]  L. Morellato,et al.  Introduction: The Brazilian Atlantic Forest1 , 2000 .

[40]  Leila Maria Garcia Fonseca,et al.  EFEITOS DA MUDANÇA DE ESCALA EM PADRÕES DE DESMATAMENTO NA AMAZÔNIA , 2012 .

[41]  Luciano Vieira Dutra,et al.  A Resegmentation Approach for Detecting Rectangular Objects in High-Resolution Imagery , 2011, IEEE Geoscience and Remote Sensing Letters.

[42]  Christopher Small Spatiotemporal dimensionality and time-space characterization of vegetation phenology from multitemporal MODIS EVI , 2011, 2011 6th International Workshop on the Analysis of Multi-temporal Remote Sensing Images (Multi-Temp).

[43]  M. Steinbach,et al.  Finding Spatio-Temporal Patterns in Earth Science Data , 2001 .

[44]  Amiya Nayak,et al.  Measuring linearity of planar point sets , 2008, Pattern Recognit..

[45]  C. A. Mücher,et al.  Remote Sensing in Landscape Ecology: Experiences and Perspectives in a European Context , 2005, Landscape Ecology.

[46]  Qinghua Ye,et al.  Handling uncertainties in image mining for remote sensing studies , 2009 .

[47]  Gilberto Câmara,et al.  Mining patterns of change in remote sensing image databases , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[48]  C. Justice,et al.  Atmospheric correction of MODIS data in the visible to middle infrared: first results , 2002 .

[49]  Z. Bochenek Rule-based classification of SPOT imagery using object-oriented approach for detailed land cover mapping , 2008 .

[50]  Maria Isabel Sobral Escada,et al.  Remote‐sensing image mining: detecting agents of land‐use change in tropical forest areas , 2008 .

[51]  C. Woodcock,et al.  The factor of scale in remote sensing , 1987 .

[52]  Scott J. Goetz,et al.  Mapping residential density patterns using multi-temporal Landsat data and a decision-tree classifier , 2004 .

[53]  Patrick J. Hayes,et al.  Modeling Cyclic Change , 1999, ER.

[54]  M. Turner Landscape ecology: what is the state of the science? , 2005 .

[55]  Geoffrey J. Hay,et al.  Free and open source geographic information tools for landscape ecology , 2009, Ecol. Informatics.

[56]  Yosio Edemir Shimabukuro,et al.  Combining wavelets and linear spectral mixture model for MODIS satellite sensor time-series analysis , 2008 .

[57]  Uwe Stilla,et al.  Machine Learning Comparison between WorldView-2 and QuickBird-2-Simulated Imagery Regarding Object-Based Urban Land Cover Classification , 2011, Remote. Sens..

[58]  David M. Theobald,et al.  Landscape dynamics of Amazonian deforestation between 1984 and 2002 in central Rondônia, Brazil: assessment and future scenarios , 2005 .

[59]  Menno-Jan Kraak,et al.  Assessing the Effectiveness of Temporal Legends in Environmental Visualization , 1997 .

[60]  A. Huete,et al.  Development of a two-band enhanced vegetation index without a blue band , 2008 .

[61]  Leila Maria Garcia Fonseca,et al.  Land-cover classification of an intra-urban environment using high-resolution images and object-based image analysis , 2012 .

[62]  Giles M. Foody,et al.  Status of land cover classification accuracy assessment , 2002 .

[63]  M. Neubert,et al.  A COMPARISON OF SEGMENTATION PROGRAMS FOR HIGH RESOLUTION REMOTE SENSING DATA , 2004 .

[64]  John F. Mustard,et al.  A curve fitting procedure to derive inter-annual phenologies from time series of noisy satellite NDVI data , 2007 .

[65]  William J. Emery,et al.  A neural network approach using multi-scale textural metrics from very high-resolution panchromatic imagery for urban land-use classification , 2009 .

[66]  J. Imbernon,et al.  Characterization of landscape patterns of deforestation in tropical rain forests , 2001 .

[67]  N. Pettorelli,et al.  Using the satellite-derived NDVI to assess ecological responses to environmental change. , 2005, Trends in ecology & evolution.

[68]  A. Huete,et al.  Overview of the radiometric and biophysical performance of the MODIS vegetation indices , 2002 .

[69]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[70]  Kevin McGarigal Landscape Pattern MetricsBased in part on the article “Landscape pattern metrics” by Kevin McGarigal, which appeared in the Encyclopedia of Environmetrics. , 2013 .

[71]  Leila Maria Garcia Fonseca,et al.  KNOWLEDGE-BASED INTERPRETATION OF REMOTE SENSING DATA WITH THE INTERIMAGE SYSTEM : MAJOR CHARACTERISTICS AND RECENT DEVELOPMENTS , 2010 .

[72]  Linda G. Shapiro,et al.  Image Segmentation Techniques , 1984, Other Conferences.

[73]  K. McGarigal,et al.  FRAGSTATS: spatial pattern analysis program for quantifying landscape structure. , 1995 .

[74]  J. Wiens Population Responses to Patchy Environments , 1976 .

[75]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[76]  Jasmin Christian Blanchette,et al.  C++ GUI programming with Qt 4 , 2004 .