Decomposition of Data Mining Algorithms into Unified Functional Blocks

The present paper describes the method of creating data mining algorithms from unified functional blocks. This method splits algorithms into independently functioning blocks. These blocks must have unified interfaces and implement pure functions. The method allows us to create new data mining algorithms from existing blocks and improves the existing algorithms by optimizing single blocks or the whole structure of the algorithms. This becomes possible due to a number of important properties inherent in pure functions and hence functional blocks.

[1]  Felix Joachimski Confluence of the coinductive [lambda]-calculus , 2004, Theor. Comput. Sci..

[2]  Jan Vitek,et al.  Evaluating the Design of the R Language - Objects and Functions for Data Analysis , 2012, ECOOP.

[3]  Shamkant B. Navathe,et al.  An Efficient Algorithm for Mining Association Rules in Large Databases , 1995, VLDB.

[4]  Lior Rokach,et al.  Decomposition Methodology for Knowledge Discovery and Data Mining - Theory and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[5]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[6]  Saso Dzeroski,et al.  Combining Multiple Models with Meta Decision Trees , 2000, PKDD.

[7]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[8]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[9]  James Lyle Peterson,et al.  Petri net theory and the modeling of systems , 1981 .

[10]  Philip S. Yu,et al.  Using a Hash-Based Method with Transaction Trimming for Mining Association Rules , 1997, IEEE Trans. Knowl. Data Eng..

[11]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[12]  A. Church,et al.  Some properties of conversion , 1936 .

[13]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[14]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[15]  Nittaya Kerdprasop,et al.  Mining Frequent Patterns with Functional Programming , 2007 .

[16]  Lloyd Allison,et al.  Models for machine learning and data mining in functional programming , 2004, Journal of Functional Programming.

[17]  Sylvia Tippmann,et al.  Programming tools: Adventures with R , 2014, Nature.

[18]  Roberto Souto Maior de Barros,et al.  On the Use of Data Mining Tools for Data Preparation in Classification Problems , 2012, 2012 IEEE/ACIS 11th International Conference on Computer and Information Science.

[19]  Ramakrishnan Kannan,et al.  NIMBLE: a toolkit for the implementation of parallel data mining and machine learning algorithms on mapreduce , 2011, KDD.

[20]  Lior Rokach,et al.  Decomposition Methodology for Knowledge Discovery and Data Mining , 2005, The Data Mining and Knowledge Discovery Handbook.

[21]  Henk Barendregt,et al.  The Lambda Calculus: Its Syntax and Semantics , 1985 .

[22]  Ian H. Witten,et al.  Weka: Practical machine learning tools and techniques with Java implementations , 1999 .

[23]  Tadao Murata,et al.  Petri nets: Properties, analysis and applications , 1989, Proc. IEEE.