Identifying Facts for TCBR

This paper explores a method to algorithmically distinguish case-specific facts from potentially reusable or adaptable elements of cases in a textual case-based reasoning (TCBR) system. In the legal domain, documents often contain casespecific facts mixed with case-neutral details of law, precedent, conclusions the attorneys reach by applying their interpretation of the law to the case facts, and other aspects of argumentation that attorneys could potentially apply to similar situations. The automated distinction of these two categories, namely facts and other elements, has the potential to improve quality of automated textual case acquisition. The goal is ultimately to distinguish case problem from solution. To separate fact from other elements, we use an information gain (IG) algorithm to identify words that serve as efficient markers of one or the other. We demonstrate that this technique can successfully distinguish case-specific fact paragraphs from others, and propose future work to overcome some of the limitations of this pilot project.

[1]  Rosina O. Weber,et al.  Investigating Graphs in Textual Case-Based Reasoning , 2004, ECCBR.

[2]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[3]  Rosina O. Weber,et al.  Integrated Approach to Detect Inconspicuous Content , 2005, Wissensmanagement.

[4]  Kevin D. Ashley,et al.  Textual case-based reasoning , 2005, Knowl. Eng. Rev..

[5]  Robert Burgin,et al.  Performance Standards and Evaluations in IR Test Collections: Cluster-Based Retrieval Models , 1997, Inf. Process. Manag..

[6]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[7]  Ivan Koychev,et al.  Feature Selection and Generalisation for Retrieval of Textual Cases , 2004, ECCBR.

[8]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[9]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1988, IJCAI 1989.

[10]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[11]  David C. WilsonComputer,et al.  Cbr Textuality , 1999 .

[12]  Christopher K. Riesbeck,et al.  Inside Case-Based Reasoning , 1989 .

[13]  Stefan Wess,et al.  Case-Based Reasoning Technology: From Foundations to Applications , 1998, Lecture Notes in Computer Science.

[14]  Luc Lamontagne,et al.  Textual Reuse for Email Response , 2004, ECCBR.

[15]  Kevin D. Ashley,et al.  Reasoning with Textual Cases , 2005, ICCBR.

[16]  Rosina O. Weber,et al.  Integrated Approach to Detect Inconspicuous Contents , 2005, Wissensmanagement.

[17]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[18]  Mario Lenz,et al.  Textual CBR , 1998, Case-Based Reasoning Technology.