Extracting Patterns from Socioeconomic Databases to Characterize Small Farmers with High and Low Corn Yields in Mozambique: a Data Mining Approach

Mozambique is mainly a rural country. Agriculture is a pillar of the Mozambique economy and is the main source of income for 80% of the population living in rural areas. One of the major problems in the agricultural sector is low productivity, which for most crops is the lowest in Africa. The main food crop cultivated in Mozambique is maize. This research aims to characterize households with high and low maize yields based on the National Agricultural Survey Data from 2007 and 2008 using a data mining approach. To this end, we used: a) decision trees, b) association rules, and c) classification rules. The results show that households with high maize yields are those with the capacity to generate income through the commercialization of their production and agricultural assets. Households with low maize yields are associated with production loss before harvest which results in food insecurity.

[1]  R. Uaiene Introduction of New Agricultural Technologies and Marketing Strategies in Central Mozambique , 2006 .

[2]  Antonio Mucherino,et al.  Recent Developments in Data Mining and Agriculture , 2011, ICDM.

[3]  I. Darnhofer,et al.  Assessing the impact of improved agricultural technologies on household income in rural Mozambique , 2011 .

[4]  Panos M. Pardalos,et al.  A survey of data mining techniques applied to agriculture , 2009, Oper. Res..

[5]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[6]  FARM HOUSEHOLD EFFICIENCY IN MOZAMBIQUE , 2009 .

[7]  Elizabeth Pattey,et al.  Corn yield prediction with artificial neural network trained using airborne remote sensing and topographic data , 2000, IGARSS 2000. IEEE 2000 International Geoscience and Remote Sensing Symposium. Taking the Pulse of the Planet: The Role of Remote Sensing in Managing the Environment. Proceedings (Cat. No.00CH37120).

[8]  I. Darnhofer,et al.  The role of nonfarm income in coping with the effects of drought in southern Mozambique , 2011 .

[9]  Benchaphun Ekasingh,et al.  Searching for simplified farmers' crop choice models for integrated watershed management in Thailand: A data mining approach , 2009, Environ. Model. Softw..

[10]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[11]  Rudolf Kruse,et al.  Exploratory Hierarchical Clustering for Management Zone Delineation in Precision Agriculture , 2011, ICDM.

[12]  T. Walker,et al.  Priority Setting for Public-Sector Agricultural Research in Mozambique with the National Agricultural Survey Data , 2006 .

[13]  Panos M. Pardalos,et al.  Data Mining in Agriculture , 2008 .

[14]  B. Ekasingh,et al.  A data mining approach to simulating farmers' crop choices for integrated water resources management. , 2005, Journal of environmental management.

[15]  Benchaphun Ekasingh,et al.  Modeling Farmers' Crop Choice Using Data Mining Approach: A Revisit , 2005 .

[16]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[17]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[18]  Shiv O. Prasher,et al.  Measuring performance in precision agriculture: CART-A decision tree approach , 2006 .

[19]  Anupam Joshi,et al.  Application of neural networks: precision farming , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).