Improvement of ID3 algorithm based on simplified information entropy and coordination degree

In data classification mining, the decision tree method is a key algorithm. ID3 (Iterative Dichotomiser 3) algorithm which was presented by Quinlan is a famous decision tree algorithms, but ID3 has some shortcomings such as high complex computation in computing the information entropy expression, multivalue bios problem in the process of selecting an optimal attribute, large scales, etc. In order to solve the above problems, the improved ID3 algorithm is proposed, which combines the simplified information entropy with coordination degree in rough set theory. The experiment result has proved the feasibility of the optimized way.

[1]  R. Tarter Evaluation and treatment of adolescent substance abuse: a decision tree method. , 1990, The American journal of drug and alcohol abuse.

[2]  Zhang Pingyu,et al.  Coordination Degree of Urban Population, Economy, Space, and Environment in Shenyang Since 1990 , 2008 .

[3]  Margarita Rivero,et al.  On a Riemann–Liouville Generalized Taylor's Formula , 1999 .

[4]  David J. Hand,et al.  Statistical Classification Methods in Consumer Credit Scoring: a Review , 1997 .

[5]  Ralph Grishman,et al.  A Decision Tree Method for Finding and Classifying Names in Japanese Texts , 1998, VLC@COLING/ACL.

[6]  Katherine E Henson,et al.  Risk of Suicide After Cancer Diagnosis in England , 2018, JAMA psychiatry.

[7]  B B B X R X X,et al.  MMR : AN ALGORITHM FOR CLUSTERING CATEGORICAL DATA USING ROUGH SET THEORY , 2007 .

[8]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[9]  Stephen Grossberg,et al.  ARTMAP: supervised real-time learning and classification of nonstationary data by a self-organizing neural network , 1991, [1991 Proceedings] IEEE Conference on Neural Networks for Ocean Engineering.

[10]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[11]  Chen Hong-qi,et al.  Coordination degree analysis of regional industry water use system , 2004 .

[12]  Jiye Liang,et al.  The Information Entropy, Rough Entropy And Knowledge Granulation In Rough Set Theory , 2004, Int. J. Uncertain. Fuzziness Knowl. Based Syst..