A NEW ALGORITHM FOR OPTIMIZATION OF FUZZY DECISION TREE IN DATA MINING

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification tasks. Nevertheless, there are still a lot of problems especially when dealing with numerical (continuous valued) attributes. Some of those problems can be solved using fuzzy decision trees (FDT). Over the years, additional methodologies have been investigated and proposed to deal with continuous or multi-valued data, and with missing or noisy features. Recently, with the growing popularity of fuzzy representation, a few researchers independently have proposed to utilize fuzzy representation in decision trees to deal with similar situations. Fuzzy representation bridges the gap between symbolic and non symbolic data by linking qualitative linguistic terms with quantitative data. In this paper, a new method of fuzzy decision trees is presented. This method proposed a new method for handling continuous valued attributes with user defined membership. The results of crisp and fuzzy decision trees are compared at the end.

[1]  H. A. Nefeslioglu,et al.  Assessment of Landslide Susceptibility by Decision Trees in the Metropolitan Area of Istanbul, Turkey , 2010 .

[2]  Mark Last,et al.  A compact and accurate model for classification , 2004, IEEE Transactions on Knowledge and Data Engineering.

[3]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[4]  Jason Catlett,et al.  On Changing Continuous Attributes into Ordered Discrete Attributes , 1991, EWSL.

[5]  Oded Maimon Knowledge Discovery and Data Mining : The Info-Fuzzy Network (IFN) Methodology , 2000 .

[6]  Nikola K. Kasabov,et al.  Adaptation and interaction in dynamical systems: Modelling and rule discovery through evolving connectionist systems , 2006, Appl. Soft Comput..

[7]  Günter Seidelmann Using Heuristics to Speed up Induction on Continuous-Valued Attributes , 1993, ECML.

[8]  Alberto Suárez,et al.  Aggregation Ordering in Bagging , 2004 .

[10]  I. Hatono,et al.  Fuzzy decision trees by fuzzy ID3 algorithm and its application to diagnosis systems , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[11]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[12]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[13]  Philip M. Long,et al.  Tracking drifting concepts by minimizing disagreements , 2004, Machine Learning.

[14]  Abraham Kandel,et al.  Anytime Algorithm for Feature Selection , 2000, Rough Sets and Current Trends in Computing.

[15]  Alex Berson,et al.  Data Warehousing, Data Mining, and OLAP , 1997 .

[16]  Usama M. Fayyad,et al.  On the Handling of Continuous-Valued Attributes in Decision Tree Generation , 1992, Machine Learning.

[17]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[18]  Zuhair Bandar,et al.  On constructing a fuzzy inference framework using crisp decision trees , 2006, Fuzzy Sets Syst..

[19]  Christophe G. Giraud-Carrier,et al.  A Note on the Utility of Incremental Learning , 2000, AI Commun..

[20]  Xiaofeng Yang,et al.  CBERS-02 Remote Sensing Data Mining Using Decision Tree Algorithm , 2008, First International Workshop on Knowledge Discovery and Data Mining (WKDD 2008).

[21]  David F. Lobach,et al.  Medical data mining: knowledge discovery in a clinical data warehouse , 1997, AMIA.

[22]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[23]  Reema Thareja,et al.  Data Warehousing , 2018, Encyclopedia of GIS.

[24]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[25]  David E. Culler,et al.  Wireless Sensor Networks - Introduction , 2004, Commun. ACM.

[26]  Philip S. Yu,et al.  On demand classification of data streams , 2004, KDD.

[27]  Trevor Darrell,et al.  MULTIMODAL INTERFACES THAT Flex, Adapt, and Persist , 2004 .

[28]  R. Clarke,et al.  Use of classification and regression trees (CART) to classify remotely-sensed digital images , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[29]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[30]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..