Spatial Data Mining Approaches for GIS – A Brief Review

Spatial Data Mining (SDM) technology has emerged as a new area for spatial data analysis. Geographical Information System (GIS) stores data collected from heterogeneous sources in varied formats in the form of geodatabases representing spatial features, with respect to latitude and longitudinal positions. Geodatabases are increasing day by day generating huge volume of data from satellite images providing details related to orbit and from other sources for representing natural resources like water bodies, forest covers, soil quality monitoring etc. Recently GIS is used in analysis of traffic monitoring, tourist monitoring, health management, and bio-diversity conservation. Inferring information from geodatabases has gained importance using computational algorithms. The objective of this survey is to provide with a brief overview of GIS data formats data representation models, data sources, data mining algorithmic approaches, SDM tools, issues and challenges. Based on analysis of various literatures this paper outlines the issues and challenges of GIS data and architecture is proposed to meet the challenges of GIS data and viewed GIS as a Bigdata problem.

[1]  Kenneth J. Dueker,et al.  A GEOGRAPHIC INFORMATION SYSTEM FRAMEWORK FOR TRANSPORTATION DATA SHARING , 2000 .

[2]  Timothy L. Nyerges,et al.  GROUP-BASED GEOGRAPHIC INFORMATION SYSTEMS FOR TRANSPORTATION IMPROVEMENT SITE SELECTION , 1997 .

[3]  Frank van Harmelen,et al.  Information Sharing on the Semantic Web , 2004, Advanced Information and Knowledge Processing.

[4]  SeegerBernhard,et al.  Multi-step processing of spatial joins , 1994 .

[5]  Bambang Parmanto,et al.  Exploring the role of GIS during community health assessment problem solving: experiences of public health professionals , 2006, International journal of health geographics.

[6]  Xinhao Wang,et al.  Integrating GIS, simulation models, and visualization in traffic impact analysis , 2005, Comput. Environ. Urban Syst..

[7]  Gangireddy Ravikumar,et al.  AN EFFECTIVE ANALYSIS OF SPATIAL DATA MINING METHODS USING RANGE QUERIES , 2012 .

[8]  G. Tradigo,et al.  Geomedica: managing and querying clinical data distributions on geographical database systems , 2010, ICCS.

[9]  Harith Alani,et al.  Geographical Information Retrieval with Ontologies of Place , 2001, COSIT.

[10]  Evans Jasmine GENE ONTOLOGY SIMILARITY METRIC BASED ON DAG USING DIABETIC GENE , 2016 .

[11]  Guoray Cai,et al.  Contextualization of Geospatial Database Semantics for Human–GIS Interaction , 2007, GeoInformatica.

[12]  Yingjie Hu,et al.  Geospatial Semantics , 2017, ArXiv.

[13]  Sumi Mehta,et al.  The Burden of Disease from Indoor Air Pollution in Developing Countries: Comparison of Estimates , 2002 .

[14]  Shih-Lung Shaw,et al.  Integrated land use and transportation interaction: a temporal GIS exploratory data analysis approach , 2003 .

[15]  Andrew Lepp,et al.  TOURIST ROLES, PERCEIVED RISK AND INTERNATIONAL TOURISM , 2003 .

[16]  Martin Trépanier,et al.  Road network monitoring: algorithms and a case study , 2006, Comput. Oper. Res..

[17]  Jin-Yong Choi,et al.  Evaluating spatial centrality for integrated tourism management in rural areas using GIS and network analysis , 2013 .

[18]  Y. Sun,et al.  Integrating spatial relations into case-based reasoning to solve geographic problems , 2012, Knowl. Based Syst..

[19]  Xiaoying Zheng,et al.  A method for extracting rules from spatial data based on rough fuzzy sets , 2014, Knowl. Based Syst..

[20]  A. Rabasa,et al.  Modelling farmland abandonment: A study combining GIS and data mining techniques , 2012 .

[21]  Y. Vagh The application of a visual data mining framework to determine soil, climate and land-use relationships , 2012 .

[22]  Grace Chang,et al.  Web-based GIS in tourism information search: Perceptions, tasks, and trip attributes , 2011 .

[23]  Jianming Xu,et al.  Application of geostatistics and GIS technique to characterize spatial variabilities of bioavailable micronutrients in paddy soils , 2004 .

[24]  Michal Bíl,et al.  Unified GIS database on cycle tourism infrastructure , 2012 .

[25]  Andrew Lepp,et al.  Sensation seeking and tourism: Tourist role, perception of risk and destination choice , 2008 .

[26]  J. K Affum,et al.  A GIS-based environmental modelling system for transportation planners , 2002 .

[27]  Chris T. Kiranoudis,et al.  A GIS-based decision support system for planning urban transportation policies , 2004, Eur. J. Oper. Res..

[28]  Daniel A. Badoe,et al.  Transportation–land-use interaction: empirical findings in North America, and their implications for modeling , 2000 .

[29]  Xin Wang,et al.  An Ontology-Based Spatial Clustering Selection System , 2009, Canadian Conference on AI.

[30]  R. Burnett,et al.  Cardiovascular Mortality and Long-Term Exposure to Particulate Air Pollution: Epidemiological Evidence of General Pathophysiological Pathways of Disease , 2003, Circulation.

[31]  R. Khera,et al.  Crime, gender, and society in India: insights from homicide data. , 2000, Population and development review.

[32]  Yelena Yesha,et al.  Data Mining: Next Generation Challenges and Future Directions , 2004 .

[33]  M. Egenhofer Categorizing Binary Topological Relations Between Regions, Lines, and Points in Geographic Databases , 1998 .

[34]  Charalampos Konstantopoulos,et al.  Mobile recommender systems in tourism , 2014, J. Netw. Comput. Appl..

[35]  Jennifer Rogalsky,et al.  The working poor and what GIS reveals about the possibilities of public transit , 2010 .

[36]  Ximing Cai,et al.  Linking GIS and water resources management models: an object-oriented method , 2002, Environ. Model. Softw..

[37]  Robert B McMaster,et al.  A Research Agenda for Geographic Information Science , 2004 .

[38]  Srinivasarao Yammani,et al.  Groundwater quality suitable zones identification: application of GIS, Chittoor area, Andhra Pradesh, India , 2007 .

[39]  Hans-Peter Kriegel,et al.  Multi-step processing of spatial joins , 1994, SIGMOD '94.

[40]  Luis Martínez-López,et al.  A mobile 3D-GIS hybrid recommender system for tourism , 2012, Inf. Sci..

[41]  Padhraic Smyth,et al.  Image database exploration: progress and challenges , 1993 .

[42]  Zhang Nan,et al.  The Design and Implement of Tourism Information System Based on GIS , 2012 .

[43]  S. Jyothi,et al.  Soil Classification Using Data Mining Techniques: A Comparative Study , 2011 .

[44]  Frederico T. Fonseca,et al.  Using Ontologies for Integrated Geographic Information Systems , 2002, Trans. GIS.

[45]  Claudio Silva,et al.  Site selection for shellfish aquaculture by means of GIS and farm-scale models, with an emphasis on data-poor environments , 2011 .

[46]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[47]  Ata M. Khan,et al.  Modelling urban transportation emissions: role of GIS , 2004, Comput. Environ. Urban Syst..

[48]  Nigel Waters,et al.  The internet, GIS and public participation in transportation planning , 2005 .

[49]  R. Law,et al.  Progress in information technology and tourism management: 20 years on and 10 years after the Internet - the state of eTourism research. , 2008 .

[50]  Corinne Mulley,et al.  GIS as a tool for selection of sample areas in a travel behaviour survey , 2014 .

[51]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[52]  Mei-Po Kwan,et al.  Interactive geovisualization of activity-travel patterns using three-dimensional geographical information systems: a methodological exploration with a large data set , 2000 .

[53]  Shihong Du,et al.  Evaluating structural and topological consistency of complex regions with broad boundaries in multi-resolution spatial databases , 2008, Inf. Sci..

[54]  Jiawei Han,et al.  Discovery of Spatial Association Rules in Geographic Information Databases , 1995, SSD.

[55]  Max J. Egenhofer,et al.  GeoSpatial Semantics, First International Conference, GeoS 2005, Mexico City, Mexico, November 29-30, 2005, Proceedings , 2005, GeoS.

[56]  Tzu-Kuang Hsu,et al.  The preference analysis for tourist choice of destination: A case study of Taiwan , 2009 .

[57]  Ralf Hartmut Güting,et al.  An introduction to spatial database systems , 1994, VLDB J..

[58]  Anthony J. T. Lee,et al.  Mining frequent trajectory patterns in spatial-temporal databases , 2009, Inf. Sci..

[59]  Max J. Egenhofer,et al.  Advances in Spatial Databases , 1997, Lecture Notes in Computer Science.

[60]  Michael F. Goodchild,et al.  Interoperating Geographic Information Systems , 2012 .

[61]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2008, International Journal of Computer Vision.

[62]  Z. JiXian,et al.  Integrated application of RS and GIS to agriculture land use planning , 2002 .

[63]  Michela Bertolotto,et al.  Exploratory spatio-temporal data mining and visualization , 2007, J. Vis. Lang. Comput..

[64]  Hans-Peter Kriegel,et al.  Spatial Data Mining: A Database Approach , 1997, SSD.

[65]  Ann Aschengrau,et al.  Spatial-temporal analysis of breast cancer in upper Cape Cod, Massachusetts , 2008, International journal of health geographics.

[66]  Paula Antunes,et al.  The application of Geographical Information Systems to determine environmental impact significance , 2001 .

[67]  Ki-Joune Li,et al.  A spatial data mining method by Delaunay triangulation , 1997, GIS '97.

[68]  Max J. Egenhofer,et al.  Spatial SQL: A Query and Presentation Language , 1994, IEEE Trans. Knowl. Data Eng..

[69]  Anthony J. T. Lee,et al.  Mining spatial association rules in image databases , 2007, Inf. Sci..

[70]  Deduction and application of generalized Euler formula in topological relation of geographic information system (GIS) , 2004 .

[71]  Andrew U. Frank,et al.  Formal specification of image schemata -- a step towards interoperability in geographic information systems , 1999, Spatial Cogn. Comput..

[72]  J. K. Mandal,et al.  A GIS Anchored System for Selection of Utility Service Stations through Hierarchical Clustering , 2013 .

[73]  Max J. Egenhofer,et al.  Metric details for natural-language spatial relations , 1998, TOIS.

[74]  Mark Gahegan,et al.  Geospatial Data Mining and Knowledge Discovery , 2000 .

[75]  Luigi Guarino,et al.  36 Geographic Information Systems (GIS) and the Conservation and Use of Plant Genetic Resources , 2002 .

[76]  M. Rada,et al.  Contrasting approved uses against actual uses at La Restinga Lagoon National Park, Margarita Island, Venezuela. A GPS and GIS method to improve management plans and rangers coverage , 2012, Journal of Coastal Conservation.

[77]  E. Nosair,et al.  Runoff Water Harvesting Optimization by Using RS, GIS and Watershed Modelling in Wadi El-Arish, Sinai , 2013 .

[78]  Jiawei Han,et al.  Discovery of Multiple-Level Association Rules from Large Databases , 1995, VLDB.

[79]  Hans-Peter Kriegel,et al.  The Impact of Global Clustering on Spatial Database Systems , 1994, VLDB.

[80]  Jean-Claude Thill,et al.  Geographic information systems for transportation in perspective , 2000 .

[81]  David W. Aha,et al.  A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithms , 1997, Artificial Intelligence Review.

[82]  Alan T. Murray,et al.  A geographical perspective on access to sexual and reproductive health care for women in rural Africa. , 2013, Social science & medicine.

[83]  Steven Schockaert,et al.  Generating approximate region boundaries from heterogeneous spatial information: An evolutionary approach , 2011, Inf. Sci..

[84]  P. Diggle,et al.  Spatial point pattern analysis and its application in geographical epidemiology , 1996 .