Data Access Linking and Integration with DALI: Building a Safety Net for an Ocean of City Data

DALI is a practical system that exploits Linked Data to provide federated entity search and spatial exploration across hundreds of information sources containing Open and Enterprise data pertaining to cities, which are stored in tabular files or in their original enterprise systems. Our system is able to lift data into a meaningful linked structure with explicit semantics, and support novel contextual search and retrieval tasks by identifying related entities across models and data sources. We evaluate in two pilot scenarios. In the first, data-engineers bring together public and enterprise datasets about public safety. In the second, knowledge-engineers and domain-experts, build a view of health and social care providers for vulnerable populations. We show that our approach can re-use data assets and provides better results than pure text-based approaches in finding relevant information, as well as satisfying specific information needs.

[1]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[2]  Fred D. Davis Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology , 1989, MIS Q..

[3]  Timothy W. Finin,et al.  RDF123: From Spreadsheets to RDF , 2008, SEMWEB.

[4]  Raphaël Troncy,et al.  Enabling Linked Data Publication with the Datalift Platform , 2012, Semantic Cities @ AAAI.

[5]  Vanessa López,et al.  Guided exploration and integration of urban data , 2013, HT '13.

[6]  Vassilios Peristeras,et al.  A Publishing Pipeline for Linked Government Data , 2012, ESWC.

[7]  Pradeep Ravikumar,et al.  A Comparison of String Distance Metrics for Name-Matching Tasks , 2003, IIWeb.

[8]  Yves Raimond,et al.  The BBC World Service Archive prototype , 2014, J. Web Semant..

[9]  Diego Calvanese,et al.  Evaluating SPARQL-to-SQL Translation in Ontop , 2013, ORE.

[10]  Ian Horrocks,et al.  Publishing the Norwegian Petroleum Directorate's FactPages as Semantic Web Data , 2013, SEMWEB.

[11]  Vanessa López,et al.  Enabling Person-Centric Care using Linked Data Technologies , 2014, MIE.

[12]  James A. Hendler,et al.  TWC LOGD: A portal for linked open government data ecosystems , 2011, J. Web Semant..

[13]  Gianluca Quercini,et al.  Entity discovery and annotation in tables , 2013, EDBT '13.

[14]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[15]  Jacob Cohen,et al.  The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability , 1973 .

[16]  Reynold Xin,et al.  Finding related tables , 2012, SIGMOD Conference.