Building similarity metrics reflecting utility in case-based reasoning

Fundamental to case-based reasoning is the idea that similar problems have similar solutions. The meaning of the concept of "similarity" can vary in different situations and remains an issue. Since we want to identify and retrieve truly useful or relevant cases for problem solving, the metrics of similarity must be defined suitably to reflect the utility of cases for solving a particular target problem. A framework for utility-oriented similarity modeling is developed in this paper. The main idea is to exploit a case library to obtain adequate samples of utility from pairs of cases. The task of similarity modeling then becomes the customization of the parameters in a similarity metric to minimize the discrepancy between the assessed similarity values and the utility scores desired. A new structure for similarity metrics is introduced which enables the encoding of single feature impacts and more competent approximation of case utility. Preliminary experimental results have shown that the proposed approach can be used for learning with a surprisingly small case base without the risk of over-fitting and that it yields stable system performance with variations in the threshold selected for case retrieval.

[1]  Padraig Cunningham,et al.  Using Introspective Learning to Improve Retrieval in CBR: A Case Study in Air Traffic Control , 1997, ICCBR.

[2]  David W. Aha,et al.  Weighting Features , 1995, ICCBR.

[3]  Armin Stahl,et al.  Learning Feature Weights from Case Order Feedback , 2001, ICCBR.

[4]  David G. Lowe,et al.  Similarity Metric Learning for a Variable-Kernel Classifier , 1995, Neural Computation.

[5]  Susan Craw,et al.  Genetic Algorithms to Optimise CBR Retrieval , 2000, EWCBR.

[6]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[7]  David L. Waltz,et al.  Trading MIPS and memory for knowledge engineering , 1992, CACM.

[8]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[9]  Didier Dubois,et al.  A review of fuzzy set aggregation connectives , 1985, Inf. Sci..

[10]  Armin Stahl,et al.  Using Evolution Programs to Learn Local Similarity Measures , 2003, ICCBR.

[11]  Karl Branting,et al.  Acquiring Customer Preferences from Return-Set Selections , 2001, ICCBR.

[12]  Nick Cercone,et al.  Rule-Induction and Case-Based Reasoning: Hybrid Architectures Appear Advantageous , 1999, IEEE Trans. Knowl. Data Eng..

[13]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[14]  Ron Kohavi,et al.  The Utility of Feature Weighting in Nearest-Neighbor Algorithms , 1997 .

[15]  Padraig Cunningham,et al.  Improving Recommendation Ranking by Learning Personal Feature Weights , 2004, ECCBR.

[16]  Francesco Ricci,et al.  Learning a Local Similarity Metric for Case-Based Reasoning , 1995, ICCBR.

[17]  Werner Dubitzky,et al.  A Genetic Algorithm and Growing Cell Structure Approach to Learning Case Retrieval Structures , 2000, Soft Computing in Case Based Reasoning.