ASSOCIATION RULES MINING IN BIG DATA

The paper proposes a method for Big data analyzing in the presence of different data sources and different methods of processing these data. The Big data definition is given, the main problems of data mining process are described. The concept of association rules is introduced and the method of association rules searching for working with Big Data is modified. The method of finding dependencies is developed, efficiency and possibility of its parallelization are determined. The developed algorithm makes it possible to assert that the task of detecting association dependencies in distributed databases belongs to the class of P-tasks. The algorithm for finding association dependencies is well-solved with MapReduce. The low asymptotic complexity of the developed association rules mining algorithm and a wide set of data types supported for analysis allow to apply the proposed algorithm in practically all subject areas working with association dependencies in the data domain.

[1]  Iryna Perova,et al.  FAST MEDICAL DIAGNOSTICS USING AUTOASSOCIATIVE NEURO-FUZZY MEMORY , 2017 .

[2]  Daniel Sánchez,et al.  A New Framework to Assess Association Rules , 2001, IDA.

[3]  Natalya Shakhovska,et al.  Application of algorithms of classification for uncertainty reduction , 2013 .

[4]  Vasyl Lytvyn,et al.  Smart Data Integration by Goal Driven Ontology Learning , 2016, INNS Conference on Big Data.

[5]  Zhen Liu,et al.  MapReduce as a programming model for association rules algorithm on Hadoop , 2010, The 3rd International Conference on Information Sciences and Interaction Sciences.

[6]  Savita Shiwani,et al.  An Efficient Enhancement of Mining Top-K Association Rule , 2014 .

[7]  Kurt Hornik,et al.  Mining Association Rules and Frequent Itemsets , 2015 .

[8]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[9]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[10]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[11]  T. Amudha,et al.  An Improved Association Rule Mining Technique for Xml Data Using Xquery and Apriori Algorithm , 2009, 2009 IEEE International Advance Computing Conference.

[12]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[13]  Daniel Hunyadi,et al.  Performance comparison of apriori and FP-growth algorithms in generating association rules , 2011 .

[14]  S. Vijayarani,et al.  Comparative analysis of association rule mining algorithms , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[15]  Natalya Schahovska Datawarehouse and dataspace — information base of decision support system , 2011, 2011 11th International Conference The Experience of Designing and Application of CAD Systems in Microelectronics (CADSM).

[16]  Parveen Kumar,et al.  FP-tree and COFI Based Approach for Mining of Multiple Level Association Rules in Large Databases , 2010, ICCA 2010.

[17]  Adewale Opeoluwa Ogunde,et al.  A partition enhanced mining algorithm for distributed association rule mining systems , 2015 .

[18]  Eyke Hüllermeier,et al.  Association Rules for Expressing Gradual Dependencies , 2002, PKDD.

[19]  Natalya Shakhovska Consolidated processing for differential information products , 2011, Perspective Technologies and Methods in MEMS Design.

[20]  Sanjeev Rao,et al.  Implementing Improved Algorithm Over APRIORI Data Mining Association Rule Algorithm , 2012 .

[21]  M. Dolores Ruiz,et al.  New Approaches for Discovering Exception and Anomalous Rules , 2011, Int. J. Uncertain. Fuzziness Knowl. Based Syst..