Extending an open source spatial Database with geospatial image support: An image mining perspective

The nature of vector data is relatively constant, and it is revised less frequently as compared to remotely sensed earth observation data. Remote sensing images are being collected nowadays every 15 minutes from satellites such as Meteosat. In the coming years, very high spatial resolution data is expected to be available freely and frequently. Integrated GIS and remote sensing spatial analysis methods have the ability to incorporate different data sources to find attribute associations and patterns of change for knowledge discovery and change detection. GIS-based data such as vector data and DEM are overlayed with image data and results are taken up in a GIS for further processing and analysis. A platform is required to efficiently store, retrieve and manipulate such image data as layers just like other GIS data layers for hybrid GIS/RS analysis. In principle, spatial databases are the most suitable candidates as such a platform. Our work is aimed to investigate the open source spatial database PostgreSQL/PostGIS (PG/PG) as such a platform, to provide a solution for image support and an overall framework for integrated remote sensing and GIS analysis. This is definitely beyond just storage and retrieval of images in spatial databases. The requirements and available open source libraries were extensively studied to provide such an image support. The TerraLib library was proposed, and analysed to extend the PG database with image support. To demonstrate the application developed in this study, the Meteosat Second Generation (MSG) image data for a larger part of Europe was extracted from the ITC data receiver. An application programme was written to construct time series image database for extracted image data with the PG/ PG DBMS. A mining application to detect clouds patterns from time-series image and vector data stored in the PG database was developed using TerraLib conceptual schema. For this, an extensive study of data mining methods was carried out. A statistical data mining method based on the principal components analysis was adopted to extract cloud features for the Netherlands from the time series image data. Using this research platform and cloud patterns detection case application, various image mining scenarios were conducted to provide a framework for integrated image and vector data analysis top of the DBMS technology. This framework is extremely useful for studying spatio-temporal phenomena with seasonal or long intervals and region-based studies where the regions on a remote sensing image are extracted by vector data.

[1]  Chien-Ming Chen,et al.  Mining interesting association rules from customer databases and transaction databases , 2004, Inf. Syst..

[2]  Surajit Chaudhuri Data Mining and Database Systems: Where is the Intersection? , 1998, IEEE Data Eng. Bull..

[3]  Rahul Ramachandran,et al.  ADaM: a data mining toolkit for scientists and engineers , 2005, Comput. Geosci..

[4]  David J. DeWitt,et al.  Building a scaleable geo-spatial DBMS: technology, implementation, and evaluation , 1997, SIGMOD '97.

[5]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[6]  Keun Ho Ryu,et al.  Statistics Based Predictive Geo-spatial Data Mining: Forest Fire Hazardous Area Mapping Application , 2003, APWeb.

[7]  Franco Turini,et al.  Knowledge Discovery from Geographical Data , 2008, Mobility, Data Mining and Privacy.

[8]  N. M. Mattikalli Integration of remotely-sensed raster data with a vector-based geographical information system for land-use change detection , 1995 .

[9]  Mong-Li Lee,et al.  Image Mining: Trends and Developments , 2002, Journal of Intelligent Information Systems.

[10]  Alexander S. Perepechko,et al.  Spatial Database Development for Russian Urban Areas: A New Conceptual Framework , 2005 .

[11]  Ulrich Güntzer,et al.  Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[12]  Pedro Ribeiro de Andrade Neto,et al.  A Process and Environment for Embedding the R Software into TerraLib , 2005, GeoInfo.

[13]  Xiaolin Qin,et al.  Detecting outliers in spatial database , 2004, Third International Conference on Image and Graphics (ICIG'04).

[14]  Qihao Weng Land use change analysis in the Zhujiang Delta of China using satellite remote sensing, GIS and stochastic modelling. , 2002, Journal of environmental management.

[15]  Mohd. Noor Md. Sap,et al.  Outlier Detection Technique in Data Mining: A Research Perspective , 2005 .

[16]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[17]  C. Lo,et al.  Using a time series of satellite imagery to detect land use and land cover changes in the Atlanta, Georgia metropolitan area , 2002 .

[18]  H. Kriegel,et al.  Spatial Data Mining: Database Primitives, Algorithms and Efficient DBMS Support , 2000, Data Mining and Knowledge Discovery.

[19]  R. Kettig Computer classification of remotely sensed multispectral image data by extraction and classification of homogeneous objects. , 1975 .

[20]  Jiawei Han,et al.  Discovery of Spatial Association Rules in Geographic Information Databases , 1995, SSD.

[21]  Gilberto Câmara,et al.  Image Data Handling in Spatial Databases , 2003, GeoInfo.

[22]  Max J. Egenhofer,et al.  Spatial Information Appliances: A Next Generation of Geographic Information Systems , 1999 .

[23]  Elisa Bertino,et al.  Spatial Knowledge-Based Applications and Technologies: Research Issues , 2005, KES.

[24]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[25]  Elizabeth E. Ebert,et al.  A Pattern Recognition Technique for Distinguishing Surface and Cloud Types in the Polar Regions , 1987 .

[26]  B.L. Milenova,et al.  Mining high-dimensional data for information fusion: a database-centric approach , 2005, 2005 7th International Conference on Information Fusion.

[27]  S. Sumathi,et al.  Data Mining Tasks, Techniques, and Applications , 2006 .

[28]  A. Krishna Sinha,et al.  Geoinformatics : data to knowledge , 2006 .

[29]  Curt J. Ellmann,et al.  Building a Scalable GeoSpatial DBMS : Technology , Implementation , and Evaluation , 1997 .

[30]  Marco A. Casanova,et al.  TerraLib: An Open Source GIS Library for Large-Scale Environmental and Socio-Economic Applications , 2008 .

[31]  Wilfried Philips,et al.  Graph‐based change detection in geographic information using VHR satellite images , 2006 .

[32]  V. Retsios,et al.  Processing of MSG - 1 SEVIRI data in the thermal infrared - algorithm development with the use of the SPARC2004 data set , 2005 .

[33]  David J. Hand,et al.  Statistics and data mining: intersecting disciplines , 1999, SKDD.

[34]  Alan M. MacEachren,et al.  An evolving cognitive-semiotic approach to geographic visualization and knowledge construction , 2001 .

[35]  Jiawei Han,et al.  GeoMiner: a system prototype for spatial data mining , 1997, SIGMOD '97.

[36]  Anestis Antoniadis,et al.  Statistical cloud detection from SEVIRI multispectral images , 2008 .

[37]  Pieter Adriaans,et al.  Data mining , 1996 .

[38]  Sébastien Mustière,et al.  Database Requirements for Generalisation and Multiple Representations , 2007 .

[39]  Gilberto Câmara,et al.  Spring: integrating remote sensing and gis by object-oriented data modelling , 1996, Comput. Graph..

[40]  Ralph Johnson,et al.  design patterns elements of reusable object oriented software , 2019 .

[41]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[42]  Yi Zhang,et al.  GSQL-R: A query language supporting raster data , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[43]  Cristina Conde,et al.  PCA vs low resolution images in face verification , 2003, 12th International Conference on Image Analysis and Processing, 2003.Proceedings..

[44]  C. C. Petit,et al.  Integration of multi-source remote sensing data for land cover change detection , 2001, Int. J. Geogr. Inf. Sci..

[45]  M. Wachowicz,et al.  Geoinsight; an approach for developing a knowledge construction process based on the integration of GVis and KDD methods , 2001 .

[46]  Tu Bao Ho,et al.  Knowledge Acquisition by Machine Learning and Data Mining , 2007, Creative Environments.

[47]  Esteban Zimányi,et al.  Spatial Data Warehouses , 2009 .

[48]  Bradley S Peterson,et al.  What is an image? , 2008, Journal of the American Academy of Child and Adolescent Psychiatry.

[49]  S. Sumathi,et al.  Statistical Themes and Lessons for Data Mining , 2006 .

[50]  Corina da Costa Freitas,et al.  Efficient regionalization techniques for socio‐economic geographical units using minimum spanning trees , 2006, Int. J. Geogr. Inf. Sci..

[51]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[52]  G. Brent Hall,et al.  Open Source Approaches in Spatial Data Handling , 2008 .

[53]  Gilberto Câmara,et al.  Mining patterns of change in remote sensing image databases , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[54]  Yong Ge,et al.  Geo-spatial Data Analysis, Quality Assessment and Visualization , 2008, ICCSA.

[55]  Selim Aksoy,et al.  Interactive training of advanced classifiers for mining remote sensing image archives , 2004, KDD.

[56]  Yannis Manolopoulos,et al.  Spatial Databases , 2004 .

[57]  Alfred Stein Modern developments in image mining , 2008 .

[58]  William J. Mitchell,et al.  What Is an Image , 1984 .

[59]  Vittorio Castelli,et al.  Image Databases: Search and Retrieval of Digital Imagery , 2002 .

[60]  Shashi Shekhar,et al.  *Miner: A Suit of Classifiers for Spatial, Temporal, Ancillary, and Remote Sensing Data Mining , 2008, Fifth International Conference on Information Technology: New Generations (itng 2008).

[61]  D. Lu,et al.  Change detection techniques , 2004 .

[62]  B. Uma Shankar,et al.  Novel Classification and Segmentation Techniques with Application to Remotely Sensed Images , 2007, Trans. Rough Sets.

[63]  Charles Elkan,et al.  Scalability for clustering algorithms revisited , 2000, SKDD.