Discovering relevance knowledge in data: a growing cell structures approach

Both information retrieval and case-based reasoning systems rely on effective and efficient selection of relevant data. Typically, relevance in such systems is approximated by similarity or indexing models. However, the definition of what makes data items similar or how they should be indexed is often nontrivial and time-consuming. Based on growing cell structure artificial neural networks, this paper presents a method that automatically constructs a case retrieval model from existing data. Within the case-based reasoning (CBR) framework, the method is evaluated for two medical prognosis tasks, namely, colorectal cancer survival and coronary heart disease risk prognosis. The results of the experiments suggest that the proposed method is effective and robust. To gain a deeper insight and understanding of the underlying mechanisms of the proposed model, a detailed empirical analysis of the models structural and behavioral properties is also provided.

[1]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[2]  Luigi Portinale,et al.  Retrieval in a Prototype-Based Case Library: A Case Study in Diabetes Therapy Revision , 1998, EWCBR.

[3]  Francesco Ricci,et al.  Learning a Local Similarity Metric for Case-Based Reasoning , 1995, ICCBR.

[4]  Dieter Merkl,et al.  Visualizing Similarities in High Dimensional Input Spaces with a Growing and Splitting Neural Network , 1996, ICANN.

[5]  Ramesh C. Jain,et al.  Similarity indexing: algorithms and performance , 1996, Electronic Imaging.

[6]  Zhiming Zhang,et al.  Similarity Measures for Retrieval in Case-Based Reasoning Systems , 1998, Appl. Artif. Intell..

[7]  Bernd Fritzke Growing self-organizing networks - Why ? , 1996, ESANN.

[8]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[9]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[10]  E.B Reategui,et al.  Combining a neural network with case-based reasoning in a diagnostic system , 1997, Artif. Intell. Medicine.

[11]  Francisco Azuaje,et al.  On local and global feature weight discovery for case-based reasoning , 1999, CATA.

[12]  Francisco Azuaje,et al.  Retrieval strategies for case-based reasoning: a categorised bibliography , 2000, The Knowledge Engineering Review.

[13]  David B. Skalak,et al.  Prototype and Feature Selection by Sampling and Random Mutation Hill Climbing Algorithms , 1994, ICML.

[14]  Jerzy Surma,et al.  A Study on Competence-Preserving Case Replacing Strategies in Case-Based Reasoning , 1998, EWCBR.

[15]  Mark T. Keane,et al.  Learning Adaptation Rules from a Case-Base , 1996, EWCBR.

[16]  Maria Malek,et al.  A Connectionist Indexing Approach for CBR Systems , 1995, ICCBR.

[17]  Michael Anderson,et al.  A deployed engineering design retrieval system using neural networks , 1997, IEEE Trans. Neural Networks.

[18]  A. Tversky Features of Similarity , 1977 .

[19]  F. Azuaje,et al.  Improving clinical decision support through case-based data fusion , 1999, IEEE Transactions on Biomedical Engineering.

[20]  Bernd Fritzke,et al.  A Self-Organizing Network that Can Follow Non-stationary Distributions , 1997, ICANN.

[21]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[22]  R. Parasuraman The attentive brain , 1998 .

[23]  David W. Aha,et al.  Weighting Features , 1995, ICCBR.

[24]  K. Anderson,et al.  An updated coronary risk profile. A statement for health professionals. , 1991, Circulation.

[25]  Bernd Fritzke,et al.  Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.

[26]  Agnar Aamodt,et al.  A knowledge-intensive, integrated approach to problem solving and sustained learning , 1992 .

[27]  John G. Hughes,et al.  Hybrid Data Mining Systems: The Next Generation , 1998, PAKDD.

[28]  X. Wu,et al.  Predicting coronary disease risk based on short-term RR interval measurements: a neural network approach , 1999, Artif. Intell. Medicine.

[29]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[30]  Sholom M. Weiss,et al.  Predictive data mining - a practical guide , 1997 .

[31]  Barry Smyth,et al.  Remembering To Forget: A Competence-Preserving Case Deletion Policy for Case-Based Reasoning Systems , 1995, IJCAI.

[32]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[33]  David A. Bell,et al.  Discovering Case Knowledge Using Data Mining , 1998, PAKDD.