Data mining applications for diverse industrial application domains with smart archive

The increasing significance of data mining in many application domains of computational technology has resulted in a considerable body of work concerned with designing architectural models and collections of reusable software components to allow for more rapid deployment of data mining methods wherever they are deemed useful. Early work involved integrating machine learning algorithms and knowledge management features into class libraries; more recently the data mining research community has progressed towards application frameworks, which are based on the notion of a reusable high-level design rather than specific algorithms. Smart Archive provides such a design, fine-tuned for applications that process continuous measurements and make use of historical data. In this paper a pre-existing foundational framework is extended with a new layer of services and the overall system architecture established by the framework is examined. Two case studies drawn from diverse application domains—steelmaking and personal fitness products—are presented in order to validate the proposed design. The case studies show that Smart Archive is beneficial in integration, coding and testing tasks and suitable for both online and batch processing and for both localized and distributed applications.

[1]  Ingolf Geist,et al.  A framework for data mining and KDD , 2002, SAC '02.

[2]  Ian Witten,et al.  Data Mining , 2000 .

[3]  Juha Röning,et al.  Smart archive: a component-based data mining application framework , 2005, 5th International Conference on Intelligent Systems Design and Applications (ISDA'05).

[4]  Yi Zhi Zhao,et al.  Knowledge discovery application framework for manufacturing execution, diagnosis and optimization , 2003, IEEE International Conference on Industrial Informatics, 2003. INDIN 2003. Proceedings..

[5]  Juha Röning,et al.  Modelling the strength of steel plates using regression analysis and neural networks , .

[6]  William M. Pottenger,et al.  A software infrastructure for research in textual data mining , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[7]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[8]  Ingo Mierswa,et al.  YALE: rapid prototyping for complex data mining tasks , 2006, KDD '06.

[9]  David A. Bell,et al.  Designing a Kernel for Data Mining , 1997, IEEE Expert.

[10]  William M. Pottenger,et al.  A Software Infrastructure for Research in Textual Data Mining , 2004, Int. J. Artif. Intell. Tools.

[11]  Douglas C. Schmidt,et al.  Object-oriented application frameworks , 1997, CACM.