A UML-extended Approach for Mining OLAP Data Cubes in Complex Knowledge Discovery Environments

In this paper, we propose theoretical assertions and practical instances of an innovative UML-extended approach for mining OLAP data cubes in complex knowledge discovery environments. This analytical contribution is further extended by means of a comprehensive set of case studies that clearly demonstrate the feasibility and the benefits of the proposed approach in the context of next generation DataWarehousing/Data-Mining platforms.

[1]  Jesús Pardillo,et al.  Integrating Clustering Data Mining into the Multidimensional Modeling of Data Warehouses with UML Profiles , 2007, DaWaK.

[2]  Elio Masciari,et al.  Improving OLAP analysis of multidimensional data streams via efficient compression techniques , 2009 .

[3]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[4]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Databases: An Overview , 1992, AI Mag..

[5]  Alfredo Cuzzocrea,et al.  An OLAM-Based Framework for Complex Knowledge Pattern Discovery in Distributed-and-Heterogeneous-Data-Sources and Cooperative Information Systems , 2007, DaWaK.

[6]  Juan Trujillo,et al.  A UML 2.0 profile to design Association Rule mining models in the multidimensional conceptual modeling of data warehouses , 2007, Data Knowl. Eng..

[7]  Weng Dan-da The Data Warehouse and Data Marts , 2007 .

[8]  I. Kononenko,et al.  INDUCTION OF DECISION TREES USING RELIEFF , 1995 .

[9]  Alfredo Cuzzocrea,et al.  Model-driven data mining engineering: from solution-driven implementations to 'composable' conceptual data mining models , 2011, Int. J. Data Min. Model. Manag..

[10]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[11]  Jiawei Han,et al.  MAIDS: mining alarming incidents from data streams , 2004, SIGMOD '04.

[12]  Juan Trujillo,et al.  Conceptual Modeling for Classification Mining in Data Warehouses , 2006, DaWaK.

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[15]  Yixin Chen,et al.  Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams , 2005, Distributed and Parallel Databases.