Using D2K Data Mining Platform for Understanding the Dynamic Evolution of Land-Surface Variables

The objective of our project is to develop data mining and knowledge discovery in databases (KDD) techniques, using the “Data to Knowledge” (D2K) platform developed by National Center for Supercomputing Application (NCSA), to facilitate analysis, visualization and modeling of land-surface variables obtained from the TERRA and AQUA platforms in support of climate and weather applications. The project is developing capacity to access very large multivariate datasets; represent heterogeneous data types; integrate multiple GIS data sets stored in many GIS file formats; analyze variable relationships and model their dependencies using cluster and grid computing; and visualize input data, as well, as extracted features, integrated data sets and data mining results.