Improving access to and understanding of regulations through taxonomies

Abstract Industrial taxonomies have the potential to automate information retrieval, facilitate interoperability and, most importantly, improve decision making — decisions that must comply with existing government regulations and codes of practice. However, it is difficult to find those regulations and codes most relevant to a particular decision, even though they are now in digital form, and often available online. The focus of this work is to map regulations and codes to existing industry-specific taxonomies that would improve their access and retrieval and facilitate their integration with application programs. Keyword matching is a commonly used technique for mapping from a single taxonomy to a single regulation. In this paper, we examine techniques to address two other mapping problems: from a single taxonomy to multiple regulations and from multiple taxonomies to a single regulation. Those techniques – Cosine similarity, Jaccard coefficient, and market basket analysis – provide metrics for measuring the similarity between concepts from different taxonomies. We discuss these metrics and provide evaluations using examples from the building industry. These examples illustrate the potential regulatory benefits from the mapping between various taxonomies and regulations.

[1]  Un Yong Nahm and Mikhail Bilenko and Raymond J. Mooney,et al.  Two Approaches to Handling Noisy Variation in Text Mining , 2002 .

[2]  Prasenjit Mitra,et al.  An algebra for semantic interoperability of information sources , 2001, Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001).

[3]  J. Fountain Information, Institutions and Governance: Advancing a Basic Social Science Research Program for Digital Government , 2003 .

[4]  Peter Jackson,et al.  A machine learning approach to prior case retrieval , 2001, ICAIL '01.

[5]  William E. Moen The metadata approach to accessing government information , 2001, Gov. Inf. Q..

[6]  Kincho H. Law,et al.  A comparative analysis framework for semi-structured documents, with applications to government regulations , 2004 .

[7]  Gerd Stumme,et al.  Ontology Merging for Federated Ontologies on the Semantic Web , 2001, OIS@IJCAI.

[8]  Natalya Fridman Noy Tools for Mapping and Merging Ontologies , 2004, Handbook on Ontologies.

[9]  Nicolas Bonnel,et al.  Effective Organization and Visualization of Web Search Results , 2006, EuroIMSA.

[10]  Kevin D. Ashley,et al.  Improving the representation of legal case texts with information extraction methods , 2001, ICAIL '01.

[11]  Chinatsu Aone,et al.  Fast and effective text mining using linear-time document clustering , 1999, KDD '99.

[12]  Paul Thompson Automatic categorization of case law , 2001, ICAIL '01.

[13]  Kincho H. Law,et al.  A software infrastructure for regulatory information management and compliance assistance , 2003 .

[14]  Andreas Rauber,et al.  Automatic text representation, classification and labeling in European law , 2001, ICAIL '01.

[15]  Steven R. Ray,et al.  Interoperability Standards in the Semantic Web , 2002, J. Comput. Inf. Sci. Eng..

[16]  Prasenjit Mitra,et al.  An algebraic framework for the interoperation of ontologies , 2004 .

[17]  Kincho H. Law,et al.  Logic-based regulation compliance-assistance , 2003, ICAIL.

[18]  Marie-Francine Moens,et al.  Abstracting of legal cases: the SALOMON experience , 1997, ICAIL '97.

[19]  Albert T. Jones,et al.  Regulation retrieval using industry specific taxonomies , 2008, Artificial Intelligence and Law.

[20]  Trevor J. M. Bench-Capon Knowledge-based systems and legal applications , 1991 .

[21]  K. Roberts,et al.  Thesis , 2002 .

[22]  Christos Papatheodorou,et al.  Integrating knowledge management tools for government information , 2004, Gov. Inf. Q..

[23]  Steffen Staab,et al.  International Handbooks on Information Systems , 2013 .