Ontologies and Databases: The DL-Lite Approach

Ontologies provide a conceptualization of a domain of interest. Nowadays, they are typically represented in terms of Description Logics (DLs), and are seen as the key technology used to describe the semantics of information at various sites. The idea of using ontologies as a conceptual view over data repositories is becoming more and more popular, but for it to become widespread in standard applications, it is fundamental that the conceptual layer through which the underlying data layer is accessed does not introduce a significant overhead in dealing with the data. Based on these observations, in recent years a family of DLs, called DL-Lite, has been proposed, which is specifically tailored to capture basic ontology and conceptual data modeling languages, while keeping low complexity of reasoning and of answering complex queries, in particular when the complexity is measured w.r.t. the size of the data. In this article, we present a detailed account of the major results that have been achieved for the DL-Lite family. Specifically, we concentrate on $DL-Lite_{\mathcal{A},id}$, an expressive member of this family, present algorithms for reasoning and query answering over $DL-Lite_{\mathcal{A},id}$ ontologies, and analyze their computational complexity. Such algorithms exploit the distinguishing feature of the logics in the DL-Lite family, namely that ontology reasoning and answering unions of conjunctive queries is first-order rewritable, i.e., it can be delegated to a relational database management system. We analyze also the effect of extending the logic with typical DL constructs, and show that for most such extensions, the nice computational properties of the DL-Lite family are lost. We address then the problem of accessing relational data sources through an ontology, and present a solution to the notorious impedance mismatch between the abstract objects in the ontology and the values appearing in data sources. The solution exploits suitable mappings that create the objects in the ontology from the appropriate values extracted from the data sources. Finally, we discuss the QUONTO system that implements all the above mentioned solutions and is wrapped by the DIG-QUONTO server, thus providing a standard DL reasoner for $DL-Lite_{\mathcal{A},id}$ with extended functionality to access external data sources.

[1]  Maurizio Lenzerini,et al.  On the Update of Description Logic Ontologies at the Instance Level , 2006, AAAI.

[2]  Diego Calvanese,et al.  DL-Lite in the Light of First-Order Logic , 2007, AAAI.

[3]  Phokion G. Kolaitis,et al.  Conjunctive-query containment and constraint satisfaction , 1998, PODS.

[4]  Umberto Straccia,et al.  Towards Top-k Query Answering in Description Logics: The Case of DL-Lite , 2006, JELIA.

[5]  Salil P. Vadhan,et al.  Computational Complexity , 2005, Encyclopedia of Cryptography and Security.

[6]  Franz Baader,et al.  From Tableaux to Automata for Description Logics , 2003, Fundam. Informaticae.

[7]  Diego Calvanese,et al.  Data Integration throughDL-LiteA Ontologies , 2008, SDKB.

[8]  I. Horrocks,et al.  A Tableau Decision Procedure for $\mathcal{SHOIQ}$ , 2007, Journal of Automated Reasoning.

[9]  Francesco M. Donini,et al.  Deduction in Concept Languages: From Subsumption to Instance Checking , 1994, J. Log. Comput..

[10]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[11]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[12]  Carola Eschenbach,et al.  Formal Ontology in Information Systems , 2008 .

[13]  Michael J. Lacki,et al.  Habitat management for forest-roosting bats of North America: a critical review of habitat studies , 2003 .

[14]  I. Horrocks,et al.  The Instance Store: DL Reasoning with Large Numbers of Individuals , 2004, Description Logics.

[15]  Edward O. Minot,et al.  Breeding success of New Zealand falcons (Falco novaeseelandiae) in a pine plantation , 2009 .

[16]  Mark O. Kimberley,et al.  Diversity and succession of adventive and indigenous vascular understorey plants in Pinus radiata plantation forests in New Zealand , 2003 .

[17]  Volker Haarslev,et al.  Description Logic Systems , 2003, Description Logic Handbook.

[18]  Antonella Poggi,et al.  Ontology-based database access with DIG-Mastro and the OBDA Plugin for Protégé (Demo Description) , 2008, OWLED.

[19]  Hervé Jactel,et al.  Role of exotic pine forests in the conservation of the critically endangered New Zealand ground beetle Holcaspis brevicula (Coleoptera: Carabidae) , 2005 .

[20]  Johann Eder,et al.  Logic and Databases , 1992, Advanced Topics in Artificial Intelligence.

[21]  Frank Wolter,et al.  Can You Tell the Difference Between DL-Lite Ontologies? , 2008, KR.

[22]  Volker Haarslev,et al.  LAS: Extending Racer by a Large ABox Store , 2005, Description Logics.

[23]  Diego Calvanese,et al.  2ATAs make DLs easy , 2002, Description Logics.

[24]  Carsten Lutz,et al.  Temporalising Tractable Description Logics , 2007, 14th International Symposium on Temporal Representation and Reasoning (TIME'07).

[25]  Diego Calvanese,et al.  EQL-Lite: Effective First-Order Query Processing in Description Logics , 2007, IJCAI.

[26]  J. Hayes,et al.  Temporal Variation in Activity of Bats and the Design of Echolocation-Monitoring Studies , 1997 .

[27]  Diego Calvanese,et al.  Reasoning on UML class diagrams , 2005, Artif. Intell..

[28]  C. Tyndale-Biscoe,et al.  Eucalypt Forests as Refuge for Wildlife , 1975 .

[29]  Jane A. Sedgeley Quality of cavity microclimate as a factor influencing selection of maternity roosts by a tree-dwelling bat, Chalinolobus tuberculatus, in New Zealand , 2001 .

[30]  R. Brigham,et al.  Cavity roost site availability and habitat use by bats in different aged riparian cottonwood stands , 2007 .

[31]  José Meseguer,et al.  A logical semantics for object-oriented databases , 1993, SIGMOD '93.

[32]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[33]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[34]  Ronald J. Brachman,et al.  An Overview of the KL-ONE Knowledge Representation System , 1985, Cogn. Sci..

[35]  Ronald E. Thill,et al.  Roost selection by Big Brown Bats in Forests of Arkansas: Importance of Pine Snags and Open Forest Habitats to Males , 2008 .

[36]  Diego Calvanese,et al.  Towards an Open Framework for Ontology Based Data Access with Protégé and DIG 1.1 , 2008, OWLED.

[37]  Werner Nutt,et al.  The Complexity of Concept Languages , 1997, KR.

[38]  Robin Milner,et al.  On Observing Nondeterminism and Concurrency , 1980, ICALP.

[39]  Colin F. J. O'Donnell,et al.  Home range and use of space by Chalinolobus tuberculatus, a temperate rainforest bat from New Zealand , 2001 .

[40]  Gunter Saake,et al.  Logics for databases and information systems , 1998 .

[41]  Mick N. Clout,et al.  Effects of Plantation Forestry on Birds in New Zealand , 1984 .

[42]  Carsten Lutz,et al.  Data Complexity in the EL Family of Description Logics , 2007, LPAR.

[43]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS '05.

[44]  Leonid Libkin,et al.  Data exchange and incomplete information , 2006, PODS '06.

[45]  Hector J. Levesque,et al.  The Tractability of Subsumption in Frame-Based Description Languages , 1984, AAAI.

[46]  T. Spies,et al.  Growing‐Season Microclimatic Gradients from Clearcut Edges into Old‐Growth Douglas‐Fir Forests , 1995 .

[47]  Diego Calvanese,et al.  Reasoning in Expressive Description Logics , 2001, Handbook of Automated Reasoning.

[48]  Peter Crowther,et al.  The DIG Description Logic Interface , 2003, Description Logics.

[49]  Richard J. Barker,et al.  Factors influencing survival and long-term population viability of New Zealand long-tailed bats (Chalinolobus tuberculatus): Implications for conservation , 2005 .

[50]  Bradley Law,et al.  Summer torpor in a free-ranging bat from subtropical Australia , 2003 .

[51]  P. Olsen,et al.  Management of exotic pine plantations in northeast Queensland for goshawks , 2000 .

[52]  Stefano Spaccapietra,et al.  Conceptual Modeling — ER 2002 , 2002, Lecture Notes in Computer Science.

[53]  Franz Baader Augmenting Concept Languages by Transitive Closure of Roles: An Alternative to Terminological Cycles , 1991, IJCAI.

[54]  Diego Calvanese,et al.  Linking Data to Ontologies: The Description Logic DL-Lite_A , 2006, OWLED.

[55]  Diego Calvanese,et al.  Data Complexity of Query Answering in Expressive Description Logics via Tableaux , 2008, Journal of Automated Reasoning.

[56]  Release Notes for Racerpro 1.9.2 Beta , .

[57]  Diego Calvanese,et al.  Expressive Description Logics , 2003, Description Logic Handbook.

[58]  Robert Demolombe,et al.  An Extended Relational Algebra on Abstract Objects for Summarizing Answers to Queries , 2003, Fundam. Informaticae.

[59]  Bradley Law,et al.  Tracks and riparian zones facilitate the use of Australian regrowth forest by insectivorous bats , 2002 .

[60]  Alexander Jane,et al.  Ecology of long-tailed bats Chalinolobus tuberculatus (Forster, 1844) in the Waitakere Ranges: implications for monitoring , 2001 .

[61]  Klaus Schild,et al.  A Correspondence Theory for Terminological Logics: Preliminary Report , 1991, IJCAI.

[62]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[63]  Moshe Y. Vardi Reasoning about The Past with Two-Way Automata , 1998, ICALP.

[64]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[65]  Bernhard Nebel,et al.  Computational Complexity of Terminological Reasoning in BACK , 1988, Artif. Intell..

[66]  Giuseppe De Giacomo,et al.  Reasoning in Expressive Description Logics with Fixpoints based on Automata on Infinite Trees , 1999, IJCAI.

[67]  Colin F. J. O'Donnell,et al.  Roost use by long-tailed bats in South Canterbury: examining predictions of roost-site selection in a highly fragmented landscape , 2004 .

[68]  Andrea Calì,et al.  On the Expressive Power of Data Integration Systems , 2002, ER.

[69]  Michael J. Lacki,et al.  Bats in Forests: Conservation and Management , 2007 .

[70]  Christopher Turbill Winter activity of Australian tree-roosting bats: influence of temperature and climatic patterns , 2008 .

[71]  Diego Calvanese,et al.  DL-Lite: Tractable Description Logics for Ontologies , 2005, AAAI.

[72]  C. King,et al.  The handbook of New Zealand mammals , 1990 .

[73]  Maurizio Lenzerini,et al.  On the Approximation of Instance Level Update and Erasure in Description Logics , 2007, AAAI.

[74]  S. Loeb,et al.  Habitat Use by Forest Bats in South Carolina in Relation to Local, Stand, and Landscape Characteristics , 2006 .

[75]  Larry Wos,et al.  What Is Automated Reasoning? , 1987, J. Autom. Reason..

[76]  Michael Uschold,et al.  Ontologies and semantics for seamless connectivity , 2004, SGMD.

[77]  Ian Horrocks,et al.  From SHIQ and RDF to OWL: the making of a Web Ontology Language , 2003, J. Web Semant..

[78]  Bijan Parsia,et al.  Pellet System Description , 2006, Description Logics.

[79]  Diego Calvanese,et al.  Data Integration through DL-LiteA Ontologies , 2008, SDKB 2008.

[80]  Moshe Y. Vardi The complexity of relational query languages (Extended Abstract) , 1982, STOC '82.

[81]  Andrea Calì,et al.  Query rewriting and answering under constraints in data integration systems , 2003, IJCAI.

[82]  Jeff Z. Pan,et al.  Expressive Querying over Fuzzy DL-Lite Ontologies , 2007, Description Logics.

[83]  Bernhard Thalheim,et al.  Semantics in Data and Knowledge Bases , 2008, SDKB.

[84]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[85]  Volker Haarslev,et al.  RACER System Description , 2001, IJCAR.

[86]  Antonis C. Kakas,et al.  Computational Logic: Logic Programming and Beyond , 2002, Lecture Notes in Computer Science.

[87]  Boris Motik,et al.  Data Complexity of Reasoning in Very Expressive Description Logics , 2005, IJCAI.

[88]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[89]  Natalya F. Noy,et al.  Semantic integration: a survey of ontology-based approaches , 2004, SGMD.

[90]  Ian Horrocks,et al.  Using an Expressive Description Logic: FaCT or Fiction? , 1998, KR.

[91]  Diego Calvanese,et al.  The DL-Lite Family and Relations , 2009, J. Artif. Intell. Res..

[92]  Andrea Dekrout Monitoring New Zealand long-tailed bats (Chalinolobus tuberculatus) in urban habitats : ecology, physiology and genetics , 2009 .

[93]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[94]  Colin F. J. O'Donnell,et al.  Conservation status and causes of decline of the threatened New Zealand Long-tailed Bat Chalinolobus tuberculatus (Chiroptera: Vespertilionidae) , 2000 .

[95]  Diego Calvanese,et al.  Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family , 2007, Journal of Automated Reasoning.

[96]  Heribert Vollmer,et al.  Introduction to Circuit Complexity: A Uniform Approach , 2010 .

[97]  François Goasdoué,et al.  The Use of CARIN Language and Algorithms for Information Integration: The PICSEL System , 2000, Int. J. Cooperative Inf. Syst..

[98]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[99]  Ron van der Meyden,et al.  Logical Approaches to Incomplete Information: A Survey , 1998, Logics for Databases and Information Systems.

[100]  Peter F. Patel-Schneider,et al.  The CLASSIC knowledge representation system: guiding principles and implementation rationale , 1991, SGAR.

[101]  John Haugeland Mind design , 1985 .

[102]  Colin F. J. O'Donnell,et al.  Roost selection by the long-tailed bat, Chalinolobus tuberculatus, in temperate New Zealand rainforest and its implications for the conservation of bats in managed forests , 1999 .

[103]  Pierre Wolper,et al.  Automata theoretic techniques for modal logics of programs: (Extended abstract) , 1984, STOC '84.

[104]  R. Barclay,et al.  Foraging by bats in cleared, thinned and unharvested boreal forest , 2003 .

[105]  Ian Horrocks,et al.  A Tableaux Decision Procedure for SHOIQ , 2005, IJCAI.

[106]  J. Ragle,et al.  IUCN Red List of Threatened Species , 2010 .

[107]  Franz Baader,et al.  An Overview of Tableau Algorithms for Description Logics , 2001, Stud Logica.

[108]  Francesco M. Donini,et al.  Decidable Reasoning in Terminological Knowledge Representation Systems , 1993, IJCAI.

[109]  Edith Schonberg,et al.  Ontology Reasoning with Large Data Repositories , 2008, Ontology Management.

[110]  James A. Hendler,et al.  A Portrait of the Semantic Web in Action , 2001, IEEE Intell. Syst..

[111]  Derick Wood,et al.  Theory of computation , 1986 .

[112]  Richard Hull,et al.  A Survey of Theoretical Research on Typed Complex Database Objects , 1988, XP7.52 Workshop on Database Theory.

[113]  Diego Calvanese,et al.  Path-Based Identification Constraints in Description Logics , 2008, KR.

[114]  Diego Calvanese,et al.  Unifying Class-Based Representation Formalisms , 2011, J. Artif. Intell. Res..

[115]  Diego Calvanese,et al.  Linking Data to Ontologies , 2008, J. Data Semant..

[116]  Diego Calvanese,et al.  Data Complexity of Query Answering in Description Logics , 2006, Description Logics.

[117]  H. Aldridge,et al.  Load Carrying and Maneuverability in an Insectivorous Bat: a Test of the 5% “Rule” of Radio-Telemetry , 1988 .

[118]  Diego Calvanese,et al.  Answering Queries Using Views over Description Logics Knowledge Bases , 2000, AAAI/IAAI.

[119]  G. R. Williams,et al.  A survey of the distribution, seasonal activity and roost sites of New Zealand bats , 1984 .

[120]  Gert Smolka,et al.  Attributive Concept Descriptions with Complements , 1991, Artif. Intell..

[121]  Diego Calvanese,et al.  Realizing Ontology Based Data Access: A plug-in for protégé , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[122]  Bradley Law,et al.  Eucalypt plantings on farms: Use by insectivorous bats in south-eastern Australia , 2006 .

[123]  Diego Calvanese,et al.  Description Logics for Information Integration , 2002, Computational Logic: Logic Programming and Beyond.

[124]  Ian Horrocks,et al.  A Description Logic with Transitive and Inverse Roles and Role Hierarchies , 1999, J. Log. Comput..

[125]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[126]  Diego Calvanese,et al.  QuOnto: Querying Ontologies , 2005, AAAI.

[127]  Colin F. J. O'Donnell,et al.  Cryptic local populations in a temperate rainforest bat Chalinolobus tuberculatus in New Zealand , 2000 .

[128]  William J. Zielinski,et al.  Individual legacy trees influence vertebrate wildlife diversity in commercial forests , 2004 .

[129]  Omer Reingold,et al.  Undirected connectivity in log-space , 2008, JACM.

[130]  Moshe Y. Vardi,et al.  Polynomial-time implication problems for unary inclusion dependencies , 1990, JACM.

[131]  R GruberThomas Toward principles for the design of ontologies used for knowledge sharing , 1995 .

[132]  Frank Wolter,et al.  Semi-qualitative Reasoning about Distances: A Preliminary Report , 2000, JELIA.

[133]  Luca Pulina,et al.  Minimal Module Extraction from DL-Lite Ontologies Using QBF Solvers , 2009, IJCAI.

[134]  Dieter Fensel,et al.  Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information , 1999, DS-8.

[135]  Raymond Reiter On Closed World Data Bases , 1977, Logic and Data Bases.

[136]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[137]  Martin Hepp,et al.  Ontology Management, Semantic Web, Semantic Web Services, and Business Applications , 2008, Ontology Management.

[138]  Colin F. J. O'Donnell,et al.  Use of Roosts by the Long-Tailed Bat, Chalinolobus tuberculatus, in Temperate Rainforest in New Zealand , 1999 .

[139]  David S. Johnson,et al.  Testing containment of conjunctive queries under functional and inclusion dependencies , 1982, J. Comput. Syst. Sci..

[140]  Frank Wolter,et al.  Monodic fragments of first-order temporal logics: 2000-2001 A.D , 2001, LPAR.