Contextual fuzzy type-2 hierarchies for decision trees (CoFuH-DT) — An accelerated data mining technique

Advanced data mining techniques (ADMT) are very powerful tools for classification, understanding and prediction of object behaviors, providing descriptive relationships between objects such as a customer and a product they intend to buy. ADMT typically consists of classifiers and association techniques, among them, decision trees (DT). However, some important relationships are not readily apparent in a traditional decision tree. In addition, decision trees can grow quite large as the number of dimensions and their corresponding elements increase, requiring significant resources for processing. In either case, rules governing these relationships can be difficult to construct. This paper presents CoFuH-DT, a new algorithm for capturing intrinsic relationships among the nodes of DT, based upon a proposed concept of type-2 fuzzy ldquocontextsrdquo. This algorithm modifies a decision tree, first by generating type-1 fuzzy extensions of the underlying DT criteria or ldquoconditionsrdquo; combining further those extensions into new abstractions overlaid with type-2 contexts. The resulting fuzzy type-2 classification is then able to capture intrinsic relationships that are otherwise non-intuitive. In addition, performing fuzzy setbased operations simplifies the decision tree much faster than traditional search techniques in order to aid in rule construction. Testing presented on a virtual store example demonstrates savings of multiple orders of magnitude in terms of nodes and applicable conditions resulting in 1) reduced complexity of decision tree, 2) ability to data mine difficult to detect interrelationships, 3) substantial acceleration of decision tree search, making it applicable for 4) real-time data mining of new knowledge.

[1]  Gediminas Adomavicius,et al.  Using Data Mining Methods to Build Customer Profiles , 2001, Computer.

[2]  Jörg P. Müller,et al.  A personalized product recommendation algorithm based on preference and intention learning , 2005, Seventh IEEE International Conference on E-Commerce Technology (CEC'05).

[3]  K. McCarty,et al.  Line-of-sight tracking based upon modern heuristics approach , 2008, 2008 3rd IEEE Conference on Industrial Electronics and Applications.

[4]  Min Dai,et al.  Data Mining Used in Rule Design for Active Database Systems , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[5]  Umeshwar Dayal,et al.  A distributed OLAP infrastructure for e-commerce , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[7]  Hyung Lee-Kwang,et al.  A fuzzy decision tree induction method for fuzzy data , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[8]  Irene S. Y. Kwan A mental cognitive model of Web semantic for e-customer profile , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.

[9]  Juan Sun,et al.  A fuzzy matching method of fuzzy decision trees , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[10]  Fachao Li,et al.  Analysis on the fuzzy filter in fuzzy decision trees , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[11]  Duen-Ren Liu,et al.  A Hybrid of Sequential Rules and Collaborative Filtering for Product Recommendation , 2007, The 9th IEEE International Conference on E-Commerce Technology and The 4th IEEE International Conference on Enterprise Computing, E-Commerce and E-Services (CEC-EEE 2007).

[12]  Andrzej Pieczynski,et al.  Employing fuzzy logic in feature diagrams to model variability in software product-lines , 2003, 10th IEEE International Conference and Workshop on the Engineering of Computer-Based Systems, 2003. Proceedings..

[13]  Michael Hanss,et al.  Applied Fuzzy Arithmetic , 2005 .

[14]  Sung-Shun Weng,et al.  Personalized product recommendation in e-commerce , 2004, IEEE International Conference on e-Technology, e-Commerce and e-Service, 2004. EEE '04. 2004.

[15]  Jerry M. Mendel,et al.  Type-2 fuzzy sets made simple , 2002, IEEE Trans. Fuzzy Syst..

[16]  R. Matsubara,et al.  Application of data mining to customer profile analysis in the power electric industry , 2002, 2002 IEEE Power Engineering Society Winter Meeting. Conference Proceedings (Cat. No.02CH37309).

[17]  Xizhao Wang,et al.  A comparative study on heuristic algorithms for generating fuzzy decision trees , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[18]  Ke-jun Fu,et al.  Using the Data Mining Approach to Determine the Product Preferences of Target Customers , 2007, 2007 International Conference on Service Systems and Service Management.

[19]  Alfred Stein,et al.  Incorporating Uncertainty via Hierarchical Classification Using Fuzzy Decision Trees , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Ke Wang,et al.  Mining Customer Value: From Association Rules to Direct Marketing , 2005, Data Mining and Knowledge Discovery.

[21]  Choochart Haruechaiyasak,et al.  A dynamic framework for maintaining customer profiles in e-commerce recommender systems , 2005, 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service.

[22]  Shichao Zhang,et al.  Product hierarchy-based customer profiles for electronic commerce recommendation , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[23]  Earl Cox,et al.  Fuzzy Modeling And Genetic Algorithms For Data Mining And Exploration , 2005 .