Legal information retrieval and application to e-rulemaking

The complexity and diversity of government regulations make understanding the regulations a non-trivial task. One of the issues is the existence of multiple sources of regulations and interpretive guides; the latter are often independent of governing bodies. This work aims to develop an information infrastructure for legal information retrieval with applications to electronic-rulemaking. The pilot study focuses on accessibility regulations from the US Federal government, private organizations and European agencies. A shallow parser is developed to consolidate different regulations into a unified XML format, which is well suited for handling semi-structured data such as legal documents. Handcrafted rules and a text mining tool are developed to extract the important features, such as concepts, measurements, effective dates and so on, and to incorporate them into the corpus.To compare and locate related provisions from different regulatory documents, we employ Information Retrieval techniques to combine generic features with domain knowledge. Structural information from regulations, such as the hierarchical organization of provisions and heavy referencing among provisions, are used to help improve the relatedness analysis. Results are obtained to illustrate the use of regulatory structure and domain knowledge in provision comparisons. Application to an e-rulemaking scenario for a rights-of-way drafted regulation is shown to demonstrate extended capabilities of the prototype system.

[1]  L. Karl Brenting,et al.  Reasoning with portions of precedents , 1991, ICAIL '91.

[2]  Berthier A. Ribeiro-Neto,et al.  Local versus global link information in the Web , 2003, TOIS.

[3]  K. Branting,et al.  Building Explanations from Rules and Structured Cases , 1991, Int. J. Man Mach. Stud..

[4]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[5]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[6]  C. Lee Giles,et al.  CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications , 1998, AGENTS '98.

[7]  John Zeleznikow,et al.  Building intelligent legal information systems : representation and reasoning in law , 1994 .

[8]  Kevin D. Ashley,et al.  Improving the representation of legal case texts with information extraction methods , 2001, ICAIL '01.

[9]  Edwina L. Rissland,et al.  BankXX: a program to generate argument through case-base research , 1993, ICAIL '93.

[10]  Peter Jackson,et al.  A machine learning approach to prior case retrieval , 2001, ICAIL '01.

[11]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[12]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[13]  Bruce M. McLaren,et al.  Extensionally defining principles and cases in ethics: An AI model , 2003, Artif. Intell..

[14]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[15]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[16]  Toshio Odanaka,et al.  ADAPTIVE CONTROL PROCESSES , 1990 .

[17]  Alan F. Smeaton,et al.  A Connectivity Analysis Approach to Increasing Precision in Retrieval From Hyperlinked Documents , 1999, TREC.

[18]  Marie-Francine Moens,et al.  Abstracting of legal cases: the SALOMON experience , 1997, ICAIL '97.

[19]  Marek J. Sergot,et al.  The British Nationality Act as a logic program , 1986, CACM.

[20]  Edwina L. Rissland,et al.  CABARET: Rule Interpretation in a Hybrid Architecture , 1991, Int. J. Man Mach. Stud..

[21]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[22]  Edwina L. Rissland,et al.  Dimension-based analysis of hypotheticals from supreme court oral argument , 1988, ICAIL '89.

[23]  Kincho H. Law,et al.  A comparative analysis framework for semi-structured documents, with applications to government regulations , 2004 .

[24]  Berthier A. Ribeiro-Neto,et al.  Link-based and content-based evidential information in a belief network model , 2000, SIGIR '00.

[25]  Edwina L. Rissland,et al.  What You Saw Is What You Want: Using Cases to Seed Information Retrieval , 1997, ICCBR.

[26]  Kevin D. Ashley,et al.  AI and Law: A fruitful synergy , 2003, Artif. Intell..

[27]  Kincho H. Law,et al.  Similarity analysis on government regulations , 2003, KDD '03.

[28]  Jochen Dörre,et al.  Text mining: finding nuggets in mountains of textual data , 1999, KDD '99.

[29]  D. Gentner,et al.  Structure mapping in analogy and similarity. , 1997 .

[30]  Radboud Winkels,et al.  METAlex: Legislation in XML , 2002 .

[31]  Trevor J. M. Bench-Capon Knowledge-based systems and legal applications , 1991 .

[32]  Paul Thompson Automatic categorization of case law , 2001, ICAIL '01.

[33]  Leon Sterling,et al.  JUSTICE: a judicial search tool using intelligent concept extraction , 1999, ICAIL '99.

[34]  John Zeleznikow,et al.  Building intelligent legal information systems , 1994 .

[35]  Gene H. Golub,et al.  Matrix computations , 1983 .

[36]  R. Bellman,et al.  V. Adaptive Control Processes , 1964 .

[37]  Luca Dini,et al.  For the Automated Mark-Up of Italian Legislative Texts in XML , 2002 .

[38]  Andreas Rauber,et al.  Automatic text representation, classification and labeling in European law , 2001, ICAIL '01.

[39]  N Burrows,et al.  The Scottish Executive , 1999 .