Integration of genetic algorithm and multiple kernel support vector regression for modeling urban growth

Abstract There are two main issues of concern for land change scientists to consider. First, selecting appropriate and independent land cover change (LCC) drivers is a substantial challenge because these drivers usually correlate with each other. For this reason, we used a well-known machine learning tool called genetic algorithm (GA) to select the optimum LCC drivers. In addition, using the best or most appropriate LCC model is critical since some of them are limited to a specific function, to discover non-linear patterns within land use data. In this study, a support vector regression (SVR) was implemented to model LCC as SVRs use various linear and non-linear kernels to better identify non-linear patterns within land use data. With such an approach, choosing the appropriate kernels to model LCC is critical because SVR kernels have a direct impact on the accuracy of the model. Therefore, various linear and non-linear kernels, including radial basis function (RBF), sigmoid (SIG), polynomial (PL) and linear (LN) kernels, were used across two phases: 1) in combination with GA, and 2) without GA present. The simulated maps resulting from each combination were evaluated using a recently modified version of the receiver operating characteristics (ROC) tool called the total operating characteristic (TOC) tool. The proposed approach was applied to simulate urban growth in Rasht County, which is located in the north of Iran. As a result, an SVR-GA-RBF model achieved the highest area under curve (AUC) value at 94% while the lowest AUC was achieved when using the SVR-LN model at 71%. The results show that the synergy between GA and SVR can effectively optimize the variables selection process used when developing an LCC model, and can enhance the predictive accuracy of SVR.

[1]  N. Cook Use and Misuse of the Receiver Operating Characteristic Curve in Risk Prediction , 2007, Circulation.

[2]  Bryan C. Pijanowski,et al.  Land use legacies of the Ohio River Basin: Using a spatially explicit land use change model to assess past and future impacts on aquatic resources , 2015 .

[3]  Jean-François Mas,et al.  Modelling deforestation using GIS and artificial neural networks , 2004, Environ. Model. Softw..

[4]  Bryan C. Pijanowski,et al.  Comparing three global parametric and local non-parametric models to simulate land use change in diverse areas of the world , 2014, Environ. Model. Softw..

[5]  V Surendar,et al.  Estimation of State of Charge of a Lead Acid Battery Using Support Vector Regression , 2015 .

[6]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7]  Robert Gilmore Pontius,et al.  Recommendations for using the relative operating characteristic (ROC) , 2013, Landscape Ecology.

[8]  Mohammad Javad Yazdanpanah,et al.  URBAN EXPANSION SIMULATION USING GEOSPATIAL INFORMATION SYSTEM AND ARTIFICIAL NEURAL NETWORKS , 2009 .

[9]  Michael Ferris,et al.  SmartScape™: A web-based decision support system for assessing the tradeoffs among multiple ecosystem services under crop-change scenarios , 2016, Comput. Electron. Agric..

[10]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[11]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[12]  Zhilei Lin,et al.  A support vector machine classifier based on a new kernel function model for hyperspectral data , 2016 .

[13]  Andreas Rienow,et al.  Supporting SLEUTH - Enhancing a cellular automaton with support vector machines for urban growth modeling , 2015, Comput. Environ. Urban Syst..

[14]  Anthony Gar-On Yeh,et al.  Neural-network-based cellular automata for simulating multiple land use changes using GIS , 2002, Int. J. Geogr. Inf. Sci..

[15]  B. Pijanowski,et al.  Using neural networks and GIS to forecast land use changes: a Land Transformation Model , 2002 .

[16]  Nancy R. Cook,et al.  Cook Prediction Use and Misuse of the Receiver Operating Characteristic Curve in Risk , 2007 .

[17]  Hossein Shafizadeh-Moghadam,et al.  Spatiotemporal variability of urban growth factors: A global and local perspective on the megacity of Mumbai , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[18]  Andries P. Engelbrecht,et al.  Computational Intelligence: An Introduction , 2002 .

[19]  Xia Li,et al.  Simulating complex urban development using kernel-based non-linear cellular automata , 2008 .

[20]  Amin Tayyebi,et al.  Simulating land use land cover change using data mining and machine learning algorithms , 2013 .

[21]  Bryan C. Pijanowski,et al.  An urban growth boundary model using neural networks, GIS and radial parameterization: An application to Tehran, Iran , 2011 .

[22]  Bryan C. Pijanowski,et al.  A big data urban growth simulation at a national scale: Configuring the GIS and neural network based Land Transformation Model to run in a High Performance Computing (HPC) environment , 2014, Environ. Model. Softw..

[23]  Eric Koomen,et al.  Comparing the input, output, and validation maps for several models of land change , 2008 .

[24]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[25]  R. Gil Pontius,et al.  Land-cover change model validation by an ROC method for the Ipswich watershed, Massachusetts, USA , 2001 .

[26]  Hossein Shafizadeh-Moghadam,et al.  Coupling machine learning, tree-based and statistical models with cellular automata to simulate urban growth , 2017, Comput. Environ. Urban Syst..

[27]  Biswajeet Pradhan,et al.  Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS , 2012, Comput. Geosci..

[28]  Giles M. Foody,et al.  Feature Selection for Classification of Hyperspectral Data by SVM , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Bryan C. Pijanowski,et al.  Hierarchical modeling of urban growth across the conterminous USA: developing meso-scale quantity drivers for the Land Transformation Model , 2013 .

[30]  Amin Tayyebi,et al.  Increases in the climate change adaption effectiveness and availability of vegetation across a coastal to desert climate gradient in metropolitan Los Angeles, CA, USA. , 2016, The Science of the total environment.

[31]  Xia Li,et al.  Cellular automata for simulating land use changes based on support vector machines , 2008, Comput. Geosci..

[32]  Robert Gilmore Pontius,et al.  The total operating characteristic to measure diagnostic ability for multiple thresholds , 2014, Int. J. Geogr. Inf. Sci..

[33]  Junmei Tang,et al.  Spatio‐temporal urban landscape change analysis using the Markov chain model and a modified genetic algorithm , 2007 .

[34]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[35]  E. Lambin,et al.  The emergence of land change science for global environmental change and sustainability , 2007, Proceedings of the National Academy of Sciences.

[36]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[37]  Andrew Ford,et al.  Pseudo-absences, pseudo-models and pseudo-niches: pitfalls of model selection based on the area under the curve , 2012, Int. J. Geogr. Inf. Sci..

[38]  Division on Earth Advancing Land Change Modeling: Opportunities and Research Requirements , 2014 .

[39]  Hossein Shafizadeh-Moghadam,et al.  Sensitivity analysis and accuracy assessment of the land transformation model using cellular automata , 2017 .

[40]  Randy L. Haupt,et al.  Practical Genetic Algorithms , 1998 .

[41]  Robert Gilmore Pontius,et al.  Useful techniques of validation for spatially explicit land-change models , 2004 .

[42]  B. Pijanowski,et al.  Urban expansion and its consumption of high-quality farmland in Beijing, China , 2015 .

[43]  P. J. García Nieto,et al.  Hybrid modelling based on support vector regression with genetic algorithms in forecasting the cyanotoxins presence in the Trasona reservoir ( Northern Spain ) , 2013 .

[44]  Jie Shan,et al.  Genetic Algorithms for the Calibration of Cellular Automata Urban Growth Modeling , 2008 .

[45]  Jonah Gamba,et al.  Simulating Urban Growth Using a Random Forest-Cellular Automata (RF-CA) Model , 2015, ISPRS Int. J. Geo Inf..

[46]  Mohsen Ahadnejad Reveshty,et al.  Integrating cellular automata, artificial neural network, and fuzzy set theory to simulate threatened orchards: application to Maragheh, Iran , 2016 .

[47]  Eric F. Lambin,et al.  Land-use and land-cover change : local processes and global impacts , 2010 .

[48]  Lawrence. Davis,et al.  Handbook Of Genetic Algorithms , 1990 .

[49]  Damaris Zurell,et al.  Collinearity: a review of methods to deal with it and a simulation study evaluating their performance , 2013 .

[50]  Xiaojun Yang,et al.  Modelling urban growth and landscape changes in the Atlanta metropolitan area , 2003, Int. J. Geogr. Inf. Sci..

[51]  Chris Brunsdon,et al.  Geographically Weighted Regression: The Analysis of Spatially Varying Relationships , 2002 .

[52]  Huasheng Hong,et al.  New insight into the correlations between land use and water quality in a coastal watershed of China: Does point source pollution weaken it? , 2016, The Science of the total environment.

[53]  Mustafa Neamah Jebur,et al.  Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS , 2014 .

[54]  Javier M. Moguerza,et al.  Support Vector Machines with Applications , 2006, math/0612817.

[55]  Hichem Omrani,et al.  FSAUA: A framework for sensitivity analysis and uncertainty assessment in historical and forecasted land use maps , 2016, Environ. Model. Softw..

[56]  Bryan C. Pijanowski,et al.  Modeling multiple land use changes using ANN, CART and MARS: Comparing tradeoffs in goodness of fit and explanatory power of data mining tools , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[57]  Zhiyong Hu,et al.  Modeling urban growth in Atlanta using logistic regression , 2007, Comput. Environ. Urban Syst..

[58]  Farhad Samadzadegan,et al.  SVM-based hyperspectral image classification using intrinsic dimension , 2013, Arabian Journal of Geosciences.

[59]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[60]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[61]  Hossein Shafizadeh-Moghadam,et al.  Performance analysis of radial basis function networks and multi-layer perceptron networks in modeling urban change: a case study , 2015, Int. J. Geogr. Inf. Sci..