论文信息 - How Machine Learning can be Beneficial for Textual Case-Based Reasoning

How Machine Learning can be Beneficial for Textual Case-Based Reasoning

In this paper, we discuss the benefits and limitations of Machine Learning (ML) for Case-Based Reasoning (CBR) in domains where the cases are text documents. In textual CBR, the bottleneck is often indexing new cases. While ML has the potential to help build large case-bases from a small start-up collection by learning to classify texts under the index-terms, we found in experiments with a real CBR system, that the problem is often beyond the power of purely inductive ML. CBR indices are very complex and the number of training instances in a typical case base is too small reliably to generalize from. We argue that adding domain knowledge can help overcome these problems and give illustrating examples.

Kevin D. Ashley | Stefanie Br

[1] Ralf D. Brown,et al. Example-Based Machine Translation in the Pangloss System , 1996, COLING.

[2] Avrim Blum,et al. Empirical Support for Winnow and Weighted-Majority Based Algorithms: Results on a Calendar Scheduling Domain , 1995, ICML.

[3] Tom M. Mitchell,et al. Learning to Extract Symbolic Knowledge from the World Wide Web , 1998, AAAI/IAAI.

[4] James P. Callan,et al. Training algorithms for linear text classifiers , 1996, SIGIR '96.

[5] Thorsten Joachims,et al. A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[6] Vincent A. W. M. M. Aleven,et al. Teaching case-based argumentation through a model and examples , 1997 .

[7] Kevin D. Ashley,et al. Using Machine Learning for Assigning Indices to Textual Cases , 1997, ICCBR.

[8] Daphne Koller,et al. Hierarchically Classifying Documents Using Very Few Words , 1997, ICML.