Machine learning for predicting landslide risk of Rohingya refugee camp infrastructure

ABSTRACT Since the dawn of human civilization, forced migration scenarios have been witnessed in different regions and populations, and is still present in the twenty-first century. The current largest population of stateless refugees in the world, the Rohingya people, reside in the southeastern border region of Bangladesh. Due to rapid expansion of refugee camps and lack of suitable locations, a large proportion of the infrastructure are at risk of landslides. This study aims to use machine learning for predicting landslide risk of camp infrastructure using geospatial features. Four supervised classification algorithms have been employed viz., (i) Logistic Regression (LR), (ii) Multi-Layer Perceptron (MLP), (iii) Gradient Boosted Trees (GBT) and (iv) Random Forest (RF) and applied on preprocessing varied versions of features. Results show that RF achieves accuracy of 76.19% and AUC of 0.76 on un-scaled features which is higher than all other algorithms. The applications of the study reside in refugee management and landslide susceptibility mapping of Rohingya camps, which can both potentially save refugee lives and serve as a case study for global applications.

[1]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[2]  Hamid Reza Pourghasemi,et al.  Erratum to: Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2016, Landslides.

[3]  Travis E. Oliphant,et al.  Guide to NumPy , 2015 .

[4]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[5]  J. Friedman Stochastic gradient boosting , 2002 .

[6]  Gunnar Rätsch,et al.  The Feature Importance Ranking Measure , 2009, ECML/PKDD.

[7]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[8]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[9]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[10]  H. McConachie,et al.  Displaced Rohingya children at high risk for mental health problems: Findings from refugee camps within Bangladesh , 2018, Child: care, health and development.

[11]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[12]  K. Verdin,et al.  New Global Hydrography Derived From Spaceborne Elevation Data , 2008 .

[13]  F. Dai,et al.  Assessment of land-slide susceptibility on the natural terrain of Lantau Island , 2001 .

[14]  R. Sidle,et al.  Watershed and Forest Management for Landslide Risk Reduction , 2009 .

[15]  Edzer Pebesma,et al.  Applied Spatial Data Analysis with R. Springer , 2008 .

[16]  Saro Lee,et al.  Application of logistic regression model and its validation for landslide susceptibility mapping using GIS and remote sensing data , 2005 .

[17]  Weitao Chen,et al.  Forested landslide detection using LiDAR data and the random forest algorithm: A case study of the Three Gorges, China , 2014 .

[18]  Dieu Tien Bui,et al.  A novel hybrid approach of landslide susceptibility modelling using rotation forest ensemble and different base classifiers , 2019, Geocarto International.

[19]  Thomas A. Hennig,et al.  The Shuttle Radar Topography Mission , 2001, Digital Earth Moving.

[20]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[21]  B. Pradhan,et al.  A comparative assessment of flood susceptibility modeling using Multi-Criteria Decision-Making Analysis and Machine Learning Methods , 2019, Journal of Hydrology.

[22]  B. Pham,et al.  Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. , 2019, The Science of the total environment.

[23]  A. Zhu,et al.  GIS-based landslide susceptibility evaluation using a novel hybrid integration approach of bivariate statistical based random forest method , 2018 .

[24]  R. Fell Landslide risk assessment and acceptable risk , 1994 .

[25]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility assessment in Lianhua County (China); a comparison between a random forest data mining technique and bivariate and multivariate statistical models , 2016 .

[26]  Daniel C. Ehlman,et al.  Rapid behavioral assessment of barriers and opportunities to improve vaccination coverage among displaced Rohingyas in Bangladesh, January 2018. , 2019, Vaccine.

[27]  Isik Yilmaz,et al.  Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat - Turkey) , 2009, Comput. Geosci..

[28]  K. McCONNACHIE The Rohingya refugee crisis , 2015 .

[29]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[30]  Zhi-Hua Zhou,et al.  Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[31]  Ray Bareiss,et al.  Concept Learning and Heuristic Classification in WeakTtheory Domains , 1990, Artif. Intell..

[32]  Michael Dixon,et al.  Google Earth Engine: Planetary-scale geospatial analysis for everyone , 2017 .

[33]  A. Ullah Rohingya Refugees to Bangladesh: Historical Exclusions and Contemporary Marginalization , 2011 .

[34]  Bayes Ahmed,et al.  Application of Bivariate and Multivariate Statistical Techniques in Landslide Susceptibility Modeling in Chittagong City Corporation, Bangladesh , 2017, Remote. Sens..

[35]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[36]  B. Pradhan,et al.  Delineation of landslide hazard areas on Penang Island, Malaysia, by using frequency ratio, logistic regression, and artificial neural network models , 2010 .

[37]  B. Ahmed,et al.  Humanitarian disaster for Rohingya refugees: impending natural hazards and worsening public health crises. , 2018, The Lancet. Global health.

[38]  C. F. Lee,et al.  Assessment of landslide susceptibility on the natural terrain of Lantau Island, Hong Kong , 2001 .

[39]  J. Kent Information gain and a general measure of correlation , 1983 .

[40]  Richard Bernknopf,et al.  A Probabilistic Approach to Landslide Hazard Mapping in Cincinnati, Ohio, with Applications for Economic Evaluation , 1988 .

[41]  K. Islam,et al.  Massive diphtheria outbreak among Rohingya refugees: lessons learnt. , 2019, Journal of travel medicine.

[42]  Munshi Khaledur Rahman,et al.  Rohingya Refugee Crisis and Forest Cover Change in Teknaf, Bangladesh , 2018, Remote. Sens..

[43]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[44]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[45]  Saro Lee,et al.  Landslide susceptibility mapping using random forest and boosted tree models in Pyeong-Chang, Korea , 2018 .

[46]  J. Seibert,et al.  On the calculation of the topographic wetness index: evaluation of different methods based on field observations , 2005 .

[47]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[48]  M. Aniya Landslide‐Susceptibility Mapping in the Amahata River Basin, Japan , 1985 .

[49]  N. Kipgen Addressing the Rohingya Problem , 2014 .

[50]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[51]  Chris Lewa North Arakan: an open prison for the Rohingya in Burma , 2009 .

[52]  Saro Lee,et al.  Landslide susceptibility mapping in the Damrei Romel area, Cambodia using frequency ratio and logistic regression models , 2006 .

[53]  Guangqing Chi,et al.  Applied Spatial Data Analysis with R , 2015 .

[54]  A. Cowley,et al.  The Slow-Burning Genocide of Myanmar's Rohingya , 2014 .

[55]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[56]  N. Alam,et al.  Field epidemiology in action: an Australian perspective of epidemic response to the Rohingya health emergencies in Cox’s Bazar, Bangladesh , 2019, Global Biosecurity.

[57]  F. Renaud,et al.  The Role of Ecosystems in Disaster Risk Reduction , 2013 .

[58]  André Stumpf,et al.  Object-oriented mapping of landslides using Random Forests , 2011 .

[59]  Biswajeet Pradhan,et al.  Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling , 2010, Environ. Model. Softw..

[60]  Roger Bivand,et al.  Bindings for the Geospatial Data Abstraction Library , 2015 .

[61]  E. Rotigliano,et al.  Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: application to the 2009 storm event in Messina (Sicily, southern Italy) , 2015, Natural Hazards.

[62]  S. Castles Towards a Sociology of Forced Migration and Social Transformation , 2003 .

[63]  C. Gokceoğlu,et al.  Landslide susceptibility mapping of the slopes in the residual soils of the Mengen region (Turkey) by deterministic stability analyses and image processing techniques , 1996 .

[64]  Veronica Tofani,et al.  Landslide susceptibility estimation by random forests technique: sensitivity and scaling issues , 2013 .

[65]  E. Yesilnacar,et al.  Landslide susceptibility mapping : A comparison of logistic regression and neural networks methods in a medium scale study, Hendek Region (Turkey) , 2005 .

[66]  Fred L. Drake,et al.  The Python Language Reference Manual , 1999 .

[67]  Saro Lee,et al.  Statistical analysis of landslide susceptibility at Yongin, Korea , 2001 .

[68]  E. Pebesma,et al.  Classes and Methods for Spatial Data , 2015 .

[69]  Boris Schröder,et al.  How can statistical models help to determine driving factors of landslides , 2012 .

[70]  Bayes Ahmed,et al.  Landslide susceptibility modelling applying user-defined weighting and data-driven statistical techniques in Cox’s Bazar Municipality, Bangladesh , 2015, Natural Hazards.

[71]  R. Nagarajan,et al.  Landslide hazard susceptibility mapping based on terrain and climatic factors for tropical monsoon regions , 2000 .

[72]  J. Leaning,et al.  The Rohingya people of Myanmar: health, human rights, and identity , 2017, The Lancet.

[73]  Kaitlin P. Ward,et al.  Assessing mental health and service needs among refugees in Malaysia , 2019 .

[74]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[75]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[76]  S. DeYoung,et al.  Maternal health services for refugee populations: Exploration of best practices , 2018, Global public health.

[77]  B. Pradhan,et al.  Landslide hazard mapping at Selangor, Malaysia using frequency ratio and logistic regression models , 2007 .

[78]  John C. Davis,et al.  Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas, USA , 2003 .

[79]  Salman Majeed Islamophobia and the Mental Health of Rohingya Refugees , 2018, Islamophobia and Psychiatry.

[80]  M. Eeckhaut,et al.  Prediction of landslide susceptibility using rare events logistic regression: A case-study in the Flemish Ardennes (Belgium) , 2006 .

[81]  Tamanna Motahar,et al.  Understanding the political ecology of forced migration and deforestation through a multi-algorithm classification approach: the case of Rohingya displacement in the southeastern border region of Bangladesh , 2019, Geology, Ecology, and Landscapes.

[82]  B. Ahmed Landslide susceptibility mapping using multi-criteria evaluation techniques in Chittagong Metropolitan Area, Bangladesh , 2015, Landslides.

[83]  Vanessa Souza Araujo,et al.  Using Resistin, Glucose, Age and BMI and Pruning Fuzzy Neural Network for the Construction of Expert Systems in the Prediction of Breast Cancer , 2019, Mach. Learn. Knowl. Extr..

[84]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[85]  Hema Letchamanan Myanmar?s Rohingya Refugees in Malaysia: Education and the Way Forward (doi: 10.14425/00.50.24) , 2013 .