Foundations of Classification

Classification is one of the main tasks in machine learning, data mining, and pattern recognition. A granular computing model is suggested for learning two basic issues of concept formation and concept relationship identification. A classification problem can be considered as a search for suitable granules organized under a partial order. The structures of search space, solutions to a consistent classification problem, and the structures of solution space are discussed. A classification rule induction method is proposed. Instead of searching for a suitable partition, we concentrate on the search for a suitable covering of the given universe. This method is more general than partition-based methods. For the design of covering granule selection heuristics, several measures on granules are suggested.

[1]  R. Wille Concept lattices and conceptual knowledge systems , 1992 .

[2]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[3]  Jadzia Cendrowska,et al.  PRISM: An Algorithm for Inducing Modular Rules , 1987, Int. J. Man Mach. Stud..

[4]  Potential Applications of Granular Computing in Knowledge Discovery and Data Mining , 1999 .

[5]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[7]  Cezary Z. Janikow,et al.  Fuzzy decision trees: issues and methods , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[8]  Yiyu Yao,et al.  Level Construction of Decision Trees in a Partition-based Framework for Classi cation , 2004, SEKE.

[9]  Max Bramer,et al.  Automatic Induction of Classification Rules from Examples Using N-Prism , 2000 .

[10]  Yiyu Yao,et al.  An Analysis of Quantitative Measures Associated with Rules , 1999, PAKDD.

[11]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[12]  Shusaku Tsumoto,et al.  Modelling Medical Diagnostic Rules Based on Rough Sets , 1998, Rough Sets and Current Trends in Computing.

[13]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[14]  Stéphane Demri,et al.  Logical Analysis of Indiscernibility , 1998 .

[15]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[16]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[17]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[18]  Yiyu Yao,et al.  Granular Computing as a Basis for Consistent Classification Problems , 2002 .

[19]  Y. Yao Information granulation and rough set approximation , 2001 .

[20]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[21]  Yiyu Yao,et al.  A Granular Computing Approach to Machine Learning , 2002, FSKD.

[22]  Yiyu Yao,et al.  Induction of Classification Rules by Granular Computing , 2002, Rough Sets and Current Trends in Computing.

[23]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[24]  Wang Guo,et al.  Decision Table Reduction based on Conditional Information Entropy , 2002 .

[25]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[26]  Guoyin Wang Algebra view and information view of rough sets theory , 2001, SPIE Defense + Commercial Sensing.

[27]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[28]  Yiyu Yao,et al.  Granular Computing: basic issues and possible solutions , 2007 .

[29]  Yiyu Yao,et al.  On modeling data mining with granular computing , 2001, 25th Annual International Computer Software and Applications Conference. COMPSAC 2001.

[30]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[31]  Jean-Gabriel Ganascia TDIS : an Algebraic Formalization , 1993, IJCAI.

[32]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[33]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.