An Automatic Method for Deriving OWL Ontologies from XML Documents

In the last decade, the field of Big Data Analytics has become increasingly important in both the academic and the business communities. Typically, data are mostly structured, collected by different actors through various heterogeneous and distributed information sources, and stored and managed often directly in XML. In order to enable large volume of data to be described in such a way that their meaning can be exploited by machines and, thus, semantic queries and automatic inferential procedures can be enabled, this paper presents an automatic method to derive OWL ontologies from XML schemas. The main contribution of this method relies on the possibility of producing a target ontology starting from multiple XML schemas, by discriminating between domain and cross-domain entities and, contextually, simplifying the overall structure of the final ontology generated, i.e. By eliminating not-used cross-domain entities. This method has been applied to a concrete application case in the healthcare domain, with the goal of generating an ontological model from the XML schemas implementing the HL7 Version 3 Clinical Document Architecture Release 2.

[1]  Nektarios Gioldasis,et al.  Querying XML Data with SPARQL , 2009, DEXA.

[2]  Christian Zirpins,et al.  Lifting XML Schema to OWL , 2004, ICWE.

[3]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[4]  Geert-Jan Houben,et al.  RDF-Based Architecture for Semantic Integration of Heterogeneous Information Sources , 2001, Workshop on Information Integration on the Web.

[5]  Mohamed Bahaj,et al.  Restructuring of XML Documents in the Form of Ontologies , 2014 .

[6]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[7]  Jiuyun Xu,et al.  Using Relational Database to Build OWL Ontology from XML Data Sources , 2007, 2007 International Conference on Computational Intelligence and Security Workshops (CISW 2007).

[8]  Michel C. A. Klein Interpreting XML documents via an RDF schema ontology , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.

[9]  Daniel J. Vreeman,et al.  Logical Observation Identifiers Names and Codes (LOINC®) users' guide , 2010 .

[10]  Paul J. Walmsley,et al.  XML Schema Part 0: Primer Second Edition , 2004 .

[11]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[12]  Sören Auer,et al.  Mapping XML to OWL Ontologies , 2005, Leipziger Informatik-Tage.

[13]  Lee Min Lau,et al.  Mapping Department of Defense Laboratory Results to Logical Observation Identifiers Names and Codes (LOINC®) , 2005, AMIA.

[14]  Nadine Cullot,et al.  Building Ontologies from XML Data Sources , 2009, 2009 20th International Workshop on Database and Expert Systems Application.

[15]  Peter F. Patel-Schneider,et al.  Transforming XML Schema to OWL Using Patterns , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.

[16]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .