Learning Dynamics of Pesticide Abuse through Data Mining

Recent studies by agriculture researchers in Pakistan have shown that attempts of crop yield maximization through pro-pesticide state policies have led to a dangerously high pesticide usage. These studies have reported a negative correlation between pesticide usage and crop yield in Pakistan. Hence excessive use (or abuse) of pesticides is harming the farmers with adverse financial, environmental and social impacts. In this work we have shown that how data mining integrated agricultural data including pest scouting, pesticide usage and meteorological recordings is useful for optimization (and reduction) of pesticide usage. The data used in this work has never been utilized in this manner ever before. We have performed unsupervised clustering of this data through Recursive Noise Removal (RNR) heuristic of Abdullah and Brobst (2003). These clusters reveal interesting patterns of farmer practices along with pesticide usage dynamics and hence help identify the reasons for this pesticide abuse.

[2]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[3]  Ian H. Witten,et al.  Applying a Machine Learning Workbench: Experience with Agricultural Databases , 1996 .

[4]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[5]  H. Wallace,et al.  New Frontiers , 1934 .

[6]  Lloyd A. Smith,et al.  Practical feature subset selection for machine learning , 1998 .

[7]  Vladik Kreinovich,et al.  Some practical applications of soft computing and data mining , 2001 .

[8]  Karl Branting,et al.  CARMA: A Case-Based Rangeland Management Adviser , 2002, AI Mag..

[9]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[10]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[11]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[12]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[13]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[14]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[15]  A. Abdullah On placement of logic schematics , 1993, Proceedings of TENCON '93. IEEE Region 10 International Conference on Computers, Communications and Automation.

[16]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[18]  G. Holmes,et al.  Developing innovative applications in agriculture using data mining , 1999 .

[19]  Ian H. Witten,et al.  WEKA: a machine learning workbench , 1994, Proceedings of ANZIIS '94 - Australian New Zealnd Intelligent Information Systems Conference.

[20]  Andrew W. Moore,et al.  Locally Weighted Learning , 1997, Artificial Intelligence Review.

[21]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[22]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[23]  P. Avesani,et al.  Feeding Data Mining , 2002 .

[24]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[25]  Camil Demetrescu,et al.  Breaking cycles for minimizing crossings , 2001, JEAL.

[26]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[27]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[28]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[29]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[30]  Susan T. Dumais,et al.  Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.

[31]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.