Sheet2RDF: a Flexible and Dynamic Spreadsheet Import&Lifting Framework for RDF

In this paper, we introduce Sheet2RDF, a platform for the acquisition and transformation of spreadsheets into RDF datasets. Based on Apache UIMA and CODA, two wider-scoped frameworks respectively aimed at knowledge acquisition from unstructured information and RDF triplification, Sheet2RDF narrows down their capabilities in order to restrict the domain of acquisition to spreadsheets, thus taking into consideration their peculiarities and providing informed solutions facilitating the transformation process, while still exploiting their full potentialities. Sheet2RDF comes also bundled in the form of a plugin for two RDF management platforms: Semantic Turkey and VocBench. The integration with such platforms enhances the level of automatism in the process, thanks to a human-computer interface that can exploit suggestions by users and translate them into proper transformation rules. In addition, it strengthens this interaction by direct contact with the data/vocabularies edited in the platform.

[1]  Maria Teresa Pazienza,et al.  Semi-automatic Knowledge Acquisition through CODA , 2014, IEA/AIE.

[2]  Maria Teresa Pazienza,et al.  CODA: Computer-aided ontology development architecture , 2014, IBM J. Res. Dev..

[3]  Maria Teresa Pazienza,et al.  Semantic turkey: a browser-integrated environment for knowledge acquisition and management , 2012 .

[4]  Maria Teresa Pazienza,et al.  Semantic Turkey: A browser-integrated environment for knowledge acquisition and management , 2012, Semantic Web.

[5]  Sean Bechhofer,et al.  SKOS Simple Knowledge Organization System Reference , 2009 .

[6]  David A. Ferrucci,et al.  UIMA: an architectural approach to unstructured information processing in the corporate research environment , 2004, Natural Language Engineering.

[7]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[8]  Maria Teresa Pazienza,et al.  PEARL: ProjEction of Annotations Rule Language, a Language for Projecting (UIMA) Annotations over RDF Knowledge Bases , 2012, LREC.

[9]  James A. Hendler,et al.  The Semantic Web: A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities , 2001 .

[10]  Johannes Keizer,et al.  Thesaurus maintenance, alignment and publication as linked data: the AGROVOC use case , 2012, Int. J. Metadata Semant. Ontologies.

[11]  Timothy Lebo,et al.  Converting governmental datasets into linked data , 2010, I-SEMANTICS '10.

[12]  Timothy W. Finin,et al.  RDF123: From Spreadsheets to RDF , 2008, SEMWEB.

[13]  Raphaël Troncy,et al.  Enabling Linked Data Publication with the Datalift Platform , 2012, Semantic Cities @ AAAI.

[14]  Tim Beardsley Tool Time on Cactus Hill , 1998 .

[15]  Johannes Keizer,et al.  VocBench: A Web Application for Collaborative Development of Multilingual Thesauri , 2015, ESWC.

[16]  Yolanda Gil,et al.  PROV-DM: The PROV Data Model , 2013 .

[17]  Timothy Clark,et al.  Open Annotation Data Model , 2013 .

[18]  Gail Hodge,et al.  Systems of Knowledge Organization for Digital Libraries: Beyond Traditional Authority Files , 2000 .