KNOWLEDGE DISCOVERY FROM LAND RECORD SYSTEMS

SUMMARY Decision support, planning, and policy making are core processes in land management. Performing these tasks may require the examination and analysis of large numbers of land records. In current land management systems, the analysis of data is done by people, which may be slow, inaccurate and require substantial resources. This paper explores data mining as a technique for automating data processing and analysis of land records. Automated data mining may be useful in two ways. Firstly, it may assist in the discovery of important knowledge and patterns of behaviour from large digital datasets, which otherwise would be extremely difficult to do manually. Secondly, automation may reduce the level of skills required to run a system. Scenarios are explored for applying data mining techniques to detect manipulations of land records, where post conflict is the situational context. Of particular interest are fraudulent changes in the land records by the state or powerful factions to grab land from unsuspecting land holders on the ground. The results of preliminary tests on simulated data are reported. Specifically, outlier detection techniques are used to potentially identify patterns such as an unusual number of transactions in short periods of time, and periods with no or very few transactions.

[1]  Simaan M. AbouRizk,et al.  Decision support in construction equipment management using a nonparametric outlier mining algorithm , 2008, Expert Syst. Appl..

[2]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[3]  Christos Makris,et al.  Mining source code elements for comprehending object-oriented systems and evaluating their maintainability , 2006, SKDD.

[4]  S. Keskin,et al.  A land information system for Turkey—a key to the country's sustainable development , 2003 .

[5]  Tong Zhang,et al.  Text Mining: Predictive Methods for Analyzing Unstructured Information , 2004 .

[6]  Zachary Eyler-Walker,et al.  Closing the gap: automated screening of tax returns to identify egregious tax shelters , 2006, SKDD.

[7]  Sholom M. Weiss,et al.  Analytics-driven solutions for customer targeting and sales-force allocation , 2007, IBM Syst. J..

[8]  Paolo Giudici,et al.  Improving Markov Chain Monte Carlo Model Search for Data Mining , 2004, Machine Learning.

[9]  R. Bharat Rao,et al.  Data mining for improved cardiac care , 2006, SKDD.

[10]  Hsin-Chia Fu,et al.  A polygon description based similarity measurement of stock market behavior , 2007, 2007 IEEE Congress on Evolutionary Computation.

[11]  Anselm Haanen,et al.  e-Cadastre - Automation of the New Zealand Survey System , 2002 .

[12]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[13]  J. Zevenbergen,et al.  Legal aspects of land administration in post conflict areas , 2004 .

[14]  Zengyou He,et al.  An Optimization Model for Outlier Detection in Categorical Data , 2005, ICIC.

[15]  Max Bramer,et al.  Principles of Data Mining (Undergraduate Topics in Computer Science) , 2007 .

[16]  Max Bramer,et al.  Principles of Data Mining , 2013, Undergraduate Topics in Computer Science.

[17]  Maria E. Orlowska,et al.  Projected outlier detection in high-dimensional mixed-attributes data set , 2009, Expert Syst. Appl..

[18]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[19]  Rada Mihalcea,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.