Knowledge-Rich Similarity-Based Classification

This paper proposes to enhance similarity-based classification with different types of imperfect domain knowledge. We introduce a hierarchy of knowledge types and show how the types can be incorporated into similarity measures. Furthermore, we analyze how properties of the domain theory, such as partialness and vagueness, influence classification accuracy. Experiments in a simple domain suggest that partial knowledge is more useful than vague knowledge. However, for data sets from the UCI Machine Learning Repository, we show that even vague domain knowledge that in isolation performs at chance level can substantially increase classification accuracy when being incorporated into similarity-based classification.

[1]  Ray Bareiss,et al.  Concept Learning and Heuristic Classification in WeakTtheory Domains , 1990, Artif. Intell..

[2]  Ralph Bergmann,et al.  Explanation-Based Similarity: A Unifying Approach for Integrating Domain Knowledge into Case-Based Reasoning for Diagnosis and Planning Tasks , 1993, EWCBR.

[3]  Ralph Bergmann,et al.  Techniques and Knowledge Used for Adaptation During Case-Based Problem Solving , 1998, IEA/AIE.

[4]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[5]  W AhaDavid Tolerating noisy, irrelevant and novel attributes in instance-based learning algorithms , 1992 .

[6]  Peter D. Turney The Management of Context-Sensitive Features: A Review of Strategies , 2002, ArXiv.

[7]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[8]  Paul E. Utgoff,et al.  Automatic Feature Generation for Problem Solving Systems , 1992, ML.

[9]  Pavel Brazdil,et al.  Proceedings of the European Conference on Machine Learning , 1993 .

[10]  Padraig Cunningham,et al.  Using Introspective Learning to Improve Retrieval in CBR: A Case Study in Air Traffic Control , 1997, ICCBR.

[11]  Jude Shavlik,et al.  Refinement ofApproximate Domain Theories by Knowledge-Based Neural Networks , 1990, AAAI.

[12]  David L. Waltz,et al.  Trading MIPS and memory for knowledge engineering , 1992, CACM.

[13]  Raymond J. Mooney,et al.  Constructive Induction in Theory Refinement , 1991, ML.

[14]  S. Salzberg A nearest hyperrectangle learning method , 2004, Machine Learning.

[15]  Agnar Aamodt,et al.  Explanation-Driven Case-Based Reasoning , 1993, EWCBR.

[16]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[17]  Jörg Walter Schaaf,et al.  Detecting Gestalts in CAD-Plans to be Used as Indices for Case-Retrieval in Architecture , 1994, KI.

[18]  Timo Steffens,et al.  Adapting Similarity Measures to Agent Types in Opponent Modelling , 2004 .

[19]  B. Díaz-Agudo,et al.  Knowledge Intensive CBR Made Affordable , 2001 .

[20]  Paul W. H. Chung,et al.  Proceedings of the Sixth International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems , 1993 .

[21]  David W. Aha,et al.  Tolerating Noisy, Irrelevant and Novel Attributes in Instance-Based Learning Algorithms , 1992, Int. J. Man Mach. Stud..

[22]  R. Bergmann On the Use of Taxonomies for Representing Case Features and Local Similarity Measures , 1998 .

[23]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[24]  David W. Aha,et al.  A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithms , 1997, Artificial Intelligence Review.

[25]  Ryszard S. Michalski,et al.  Conceptual Clustering: Inventing Goal-Oriented Classifications of Structured Objects , 1986 .

[26]  David C. Wilson,et al.  Combining CBR with Interactive Knowledge Acquisition, Manipulation and Reuse , 1999, ICCBR.

[27]  Michael M. Richter,et al.  Fallbasiertes Schliessen: Vergangenheit, Gegenwart, Zukunft , 2003, Inform. Spektrum.

[28]  Timo Steffens,et al.  Similarity Measures Based on Imperfect Domain Theories , 2004 .

[29]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[30]  Charles X. Ling,et al.  Computing Optimal Attribute Weight Settings for Nearest Neighbor Algorithms , 1997, Artificial Intelligence Review.

[31]  Stefan Wess,et al.  Topics in Case-Based Reasoning , 1994 .

[32]  Walter Daelemans,et al.  Generalization performance of backpropagation learning on a syllabification task , 1992 .

[33]  D. Gentner,et al.  Respects for similarity , 1993 .