Data Analysis using Multidimensional Modeling, Statistical Analysis and Data Mining on Agriculture Parameters

Abstract Generation of agriculture data has increased over past years to judge impact of agriculture parameters to make action plan and to examine agriculture productivity. This data is generally spatio – temporal in nature and may have additional dimensions such as agriculture parameters (agriculture land, arable land etc), environmental attributes (Co2 emission etc) and geographical attributes (region, state etc). It's a challenging task to analyze this growing data and generate useful results. Various methods are available to analyze data which makes use of various parameters to generate results. In this paper first we build a multidimensional model of data then apply multidimensional analysis, statistical analysis (as co-relation) on multidimensional model and data mining technique (as association rule mining) on correlated data. Our analysis approach helps us to build model and apply advance techniques like multidimensional data analysis, statistical mining and data mining to extract knowledge from this model. There are various data collecting agencies like World Bank, IMF, Department of Economics and Statistics and lot of private agencies like ORG, AC-Nielsen. We have presented our approach using a case study to analyze agriculture productivity using various agriculture related parameters. We have used data available on World Bank website.

[1]  C.Z. Radulescu A Multidimensional Data Model for Analysis of Agricultural Soil Characteristics , 2008, 2008 First International Conference on Complexity and Intelligence of the Artificial and Natural Complex Systems. Medical Applications of the Complex Systems. Biomedical Computing.

[2]  A. Casali,et al.  Discovering Correlated Parameters in Semiconductor Manufacturing Processes: A Data Mining Approach , 2012, IEEE Transactions on Semiconductor Manufacturing.

[3]  Jens Kohl,et al.  Using Multivariate Split Analysis for an Improved Maintenance of Automotive Diagnosis Functions , 2011, 2011 15th European Conference on Software Maintenance and Reengineering.

[4]  Gao Yi-yang,et al.  Data mining and analysis of our agriculture based on the decision tree , 2009, 2009 ISECS International Colloquium on Computing, Communication, Control, and Management.

[5]  Alok N. Choudhary,et al.  A parallel scalable infrastructure for OLAP and data mining , 1999, Proceedings. IDEAS'99. International Database Engineering and Applications Symposium (Cat. No.PR00265).

[6]  Cláudia Antunes,et al.  Towards the Integration of Constrained Mining with Star Schemas , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[7]  A new method of spatialization of crop area statistical data supported by remote sensing technology , 2012, 2012 First International Conference on Agro- Geoinformatics (Agro-Geoinformatics).