Automatic semantic edge labeling over legal citation graphs

A large number of cross-references to various bodies of text are used in legal texts, each serving a different purpose. It is often necessary for authorities and companies to look into certain types of these citations. Yet, there is a lack of automatic tools to aid in this process. Recently, citation graphs have been used to improve the intelligibility of complex rule frameworks. We propose an algorithm that builds the citation graph from a document and automatically labels each edge according to its purpose. Our method uses the citing text only and thus works only on citations who’s purpose can be uniquely identified by their surrounding text. This framework is then applied to the US code. This paper includes defining and evaluating a standard gold set of labels that cover a vast majority of citation types which appear in the “US Code” but are still short enough for practical use. We also proposed a novel linear-chain conditional random field model that extracts the features required for labeling the citations from the surrounding text. We then analyzed the effectiveness of different clustering methods such as K-means and support vector machine to automatically label each citation with the corresponding label. Besides this, we talk about the practical difficulties of this task and give a comparison of human accuracy compared to our end-to-end algorithm.

[1]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[2]  A. Strauss,et al.  The discovery grounded theory: strategies for qualitative inquiry , 1967 .

[3]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[4]  Edwina L. Rissland,et al.  AI and Legal Reasoning , 1985, IJCAI.

[5]  Edwina L. Rissland Artificial Intelligence and Legal Reasoning: A Discussion of the Field and Gardner's Book , 1988, AI Mag..

[6]  Jiming Chen Something Old, Something New, Something Borrowed, Something Blue (reviewing The Bluebook: A Uniform System of Citation ) , 1991 .

[7]  Henry Prakken,et al.  A logical framework for modelling legal argument , 1993, International Conference on Artificial Intelligence and Law.

[8]  Terrill Pollman,et al.  ALWD Citation Manual: A Professional System of Citation , 2000 .

[9]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[10]  Tom M. van Engers,et al.  Automated Detection of Reference Structures in Law , 2006, JURIX.

[11]  Ben Taskar,et al.  An Introduction to Conditional Random Fields for Relational Learning , 2007 .

[12]  A. Antón,et al.  A Systematic Method for Acquiring Regulatory Requirements : A Frame-Based Approach , 2007 .

[13]  Paul Zhang,et al.  Semantics-based legal citation network , 2007, ICAIL.

[14]  E. D. Maat,et al.  UvA-DARE (Digital Academic Repository) Making sense of legal texts , 2012 .

[15]  Abdelwahab Hamou-Lhadj,et al.  Citation Analysis: An Approach for Facilitating the Understanding and the Analysis of Regulatory Compliance Documents , 2009, 2009 Sixth International Conference on Information Technology: New Generations.

[16]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[17]  Herbert L. Roitblat,et al.  Document categorization in legal electronic discovery: computer classification vs. manual review , 2010, J. Assoc. Inf. Sci. Technol..

[18]  Herbert L. Roitblat,et al.  Document categorization in legal electronic discovery: computer classification vs. manual review , 2010 .

[19]  Achim G. Hoffmann,et al.  LEXA: Towards Automatic Legal Citation Classification , 2010, Australasian Conference on Artificial Intelligence.

[20]  R. Posner The Bluebook Blues (reviewing Harvard Law Review Association, The Bluebook: A Uniform System of Citation (19th ed., 2010)) , 2011 .

[21]  Abdelwahab Hamou-Lhadj,et al.  An approach based on citation analysis to support effective handling of regulatory compliance , 2011, Future Gener. Comput. Syst..

[22]  Annie I. Antón,et al.  A legal cross-references taxonomy for reasoning about compliance requirements , 2012, Requirements Engineering.

[23]  Omar Alonso,et al.  Using crowdsourcing for TREC relevance assessment , 2012, Inf. Process. Manag..

[24]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[25]  T. Neale Citation Analysis of Canadian Case Law , 2013 .

[26]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[27]  Akira Shimazu,et al.  Automated reference resolution in legal texts , 2013, Artificial Intelligence and Law.

[28]  Radboud Winkels,et al.  Towards a Legal Recommender System , 2014, JURIX.

[29]  Mehrdad Sabetzadeh,et al.  Automated detection and resolution of legal cross references: Approach and a study of Luxembourg's legislation , 2014, 2014 IEEE 22nd International Requirements Engineering Conference (RE).

[30]  A. Mollalo,et al.  Geographic Information System‐Based Analysis of the Spatial and Spatio‐Temporal Distribution of Zoonotic Cutaneous Leishmaniasis in Golestan Province, North‐East of Iran , 2014, Zoonoses and public health.

[31]  Johan Karlsson,et al.  Automatic target recognition using discrimination based on optimal transport , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Daisy Zhe Wang,et al.  Consensus Maximization Fusion of Probabilistic Information Extractors , 2016, HLT-NAACL.

[33]  Silvio Savarese,et al.  Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Boqing Gong,et al.  Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Karl Branting Data-centric and logic-based models for automated legal problem solving , 2017, Artificial Intelligence and Law.

[36]  Ilker Demirkol,et al.  Mobility Management as a Service for 5G Networks , 2017, ArXiv.

[37]  Zheng Cao,et al.  Marine Animal Classification With Correntropy-Loss-Based Multiview Learning , 2019, IEEE Journal of Oceanic Engineering.