The exploration of legal text corpora with hierarchical neural networks: a guided tour in public international law

The classification of feature vectors representing the interpretation of legal documents improves the search for similar or related documents, the interpretation of these documents as well as the navigation within the text corpus. The need for effective approaches of classification is dramatically increased nowadays due to the advent of massive digital libraries containing free-form legal text documents. What we are looking for are powerful methods for the exploration of such libraries whereby the detection of similarities between groups of documents is the overall goal. In other words, methods that may be used to gain insight in the inherent structure of the various items contained in a text archive are needed. In this paper we present the results from a case study in legal document classification based on an experimental document archive comprising important treaties in public international law. The core task of classification is performed by a non-standard neural network model with a layered architecture consisting of mutually independent unsupervised neural networks. The distinguished features of this learning architecture is the remarkably fast training time combined with the benefit of explicit cluster representation. The access to legal text archives may be enhanced by guided tours providing the means for convenient voyage in an environment of dynamically classified legal documents.

[1]  Gary Marchionini,et al.  A self-organizing semantic map for information retrieval , 1991, SIGIR '91.

[2]  Daniel E. Rose A Symbolic and Connectionist Approach To Legal Information Retrieval , 1994 .

[3]  D. Merkl,et al.  CONCAT - Connotation Analysis of Thesauri Based on the Interpretation of Context Meaning , 1994, DEXA.

[4]  D. Merkl,et al.  Content-based software classification by self-organization , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[5]  Vey Mestdagh NEURAL SCHEMATA IN AUTOMATED JUDICIAL PROBLEM SOLVING , .

[6]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[7]  Trevor J. M. Bench-Capon Neural networks and open texture , 1993, ICAIL '93.

[8]  A Min Tjoa,et al.  Data Mining in Large Free Text Document Archives , 1996, CODAS.

[9]  Risto Miikkulainen,et al.  Subsymbolic natural language processing - an integrated model of scripts, lexicon, and memory , 1993, Neural network modeling and connectionism.

[10]  W. Bruce Croft,et al.  A Comparison of Text Retrieval Models , 1992, Comput. J..

[11]  Dieter Merkl,et al.  The Effects of Lateral Inhibition on Learning Speed and Precision of a Self-Organizing Feature Map , 1995 .

[12]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[13]  Cees Groendijk,et al.  Networks at work: a connectionist approach to non-deductive legal reasoning , 1991, ICAIL '91.

[14]  Timo Honkela,et al.  Contextual Relations of Words in Grimm Tales, Analyzed by Self-Organizing Map , 1995 .

[15]  Gerald Salton,et al.  Automatic text processing , 1988 .

[16]  Dieter Merkl,et al.  A Connectionist View on Document Classification , 1995, Australasian Database Conference.

[17]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[18]  Erich Schweighofer,et al.  The Automatic Generation of Hypertext Links in Legal Documents , 1996, DEXA.

[19]  Marie Cottrell,et al.  A Kohonen map representation to avoid misleading interpretations , 1996, The European Symposium on Artificial Neural Networks.

[20]  Timo Honkela,et al.  Self-Organizing Maps of Document Collections: A New Approach to Interactive Exploration , 1996, KDD.

[21]  Daniel E. Rose,et al.  Legal information retrieval a hybrid approach , 1989, ICAIL '89.

[22]  Lothar Philipps Are Legal Decisions based on the Application of Rules or Prototype Recognition , 1989 .

[23]  Andreas Rauber,et al.  On the Similarity of Eagles, Hawks, and Cows: Visualization of Semantic Similarity in Self-Organizin , 1997 .

[24]  Werner Winiwarter,et al.  Information filtering: the computation of similarities in large corpora of legal texts , 1995, ICAIL '95.

[25]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[26]  A. Ultsch,et al.  Self-Organizing Neural Networks for Visualisation and Classification , 1993 .

[27]  Gary Marchionini,et al.  Finding facts vs. browsing knowledge in hypertext systems , 1988, Computer.

[28]  Spiros Simitis,et al.  Informationskrise des Rechts und Datenverarbeitung , 1970 .

[29]  Jakob Nielsen,et al.  Hypertext and hypermedia , 1990 .

[30]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[31]  Heikki Mannila,et al.  Methods and Problems in Data Mining , 1997, ICDT.

[32]  Richard K. Belew,et al.  A connectionist approach to conceptual information retrieval , 1987, ICAIL '87.

[33]  Dieter Merkl CONTENT-BASED DOCUMENT CLASSIFICATION WITH HIGHLY COMPRESSED INPUT DATA , 1995 .