This work attempt to developing the FP-Growth data mining algorithm through use several knowledge constructions to build up a novel tool called Frequency Pattern-Knowledge Constructions (FP-KC) to find the association rules and to satisfy the goal of dimension reduction methods is using the correlation structure among the predicator variables by reduction the main three dimensions (features, samples and value of features). FP-KC attempts to combine between the features of principle component analysis and frequency pattern growth. This done using the three criteria (Eigenvalue, cumulative variability and Scree plot). There are many reasons for developing the FP-Growth data mining algorithm in build up a novel algorithm FP-KC to find the association rules: (a) the size of an FP-tree is typically smaller than the size of the uncompressed data because many records in dataset often share a few items in common.(b) Given the best result, if all the records have the same set of items, and this point always satisfy in the scientific dataset. (c) FP-growth is an efficient algorithm because it illustrates how a compact representation of the transaction data set helps to efficiently generate frequent item sets. (d) The run-time performance of FP-growth depends on the compaction factor of the data set. The performance of FP-KC test using five huge databases including (Primate splice-junction gene sequences, Diabetes, DNA, GIS and Watermarking). The confidence' degree of the all association rules yield by FP-KC is equal to 95%.
[1]
Lior Rokach,et al.
Data Mining And Knowledge Discovery Handbook
,
2005
.
[2]
Oded Maimon,et al.
Dimension Reduction and Feature Selection
,
2010,
Data Mining and Knowledge Discovery Handbook.
[3]
Daniel T. Larose,et al.
Data Mining Methods and Models: Larose/Data Mining Methods and Models
,
2005
.
[4]
Vipin Kumar,et al.
Introduction to Data Mining
,
2022,
Data Mining and Machine Learning Applications.
[5]
Christopher J. C. Burges,et al.
Geometric Methods for Feature Extraction and Dimensional Reduction - A Guided Tour
,
2005,
Data Mining and Knowledge Discovery Handbook.
[6]
Daniel T. Larose,et al.
Data mining methods and models
,
2006
.
[7]
Larry Bull,et al.
Feature Construction and Selection Using Genetic Programming and a Genetic Algorithm
,
2003,
EuroGP.
[8]
Ashok N. Srivastava,et al.
Data Mining: Concepts, Models, Methods, and Algorithms
,
2005,
J. Comput. Inf. Sci. Eng..
[9]
Daniel T. Larose,et al.
Discovering Knowledge in Data: An Introduction to Data Mining
,
2005
.
[10]
D. Edwards.
Data Mining: Concepts, Models, Methods, and Algorithms
,
2003
.