Working framework of semantic interoperability for CRIS with heterogeneous data sources

Purpose Information from Current Research Information Systems (CRIS) is stored in different formats, in platforms that are not compatible, or even in independent networks. It would be helpful to have a well-defined methodology to allow for management data processing from a single site, so as to take advantage of the capacity to link disperse data found in different systems, platforms, sources and/or formats. Based on functionalities and materials of the VLIR project, the purpose of this paper is to present a model that provides for interoperability by means of semantic alignment techniques and metadata crosswalks, and facilitates the fusion of information stored in diverse sources. Design/methodology/approach After reviewing the state of the art regarding the diverse mechanisms for achieving semantic interoperability, the paper analyzes the following: the specific coverage of the data sets (type of data, thematic coverage and geographic coverage); the technical specifications needed to retrieve and analyze a distribution of the data set (format, protocol, etc.); the conditions of re-utilization (copyright and licenses); and the “dimensions” included in the data set as well as the semantics of these dimensions (the syntax and the taxonomies of reference). The semantic interoperability framework here presented implements semantic alignment and metadata crosswalk to convert information from three different systems (ABCD, Moodle and DSpace) to integrate all the databases in a single RDF file. Findings The paper also includes an evaluation based on the comparison – by means of calculations of recall and precision – of the proposed model and identical consultations made on Open Archives Initiative and SQL, in order to estimate its efficiency. The results have been satisfactory enough, due to the fact that the semantic interoperability facilitates the exact retrieval of information. Originality/value The proposed model enhances management of the syntactic and semantic interoperability of the CRIS system designed. In a real setting of use it achieves very positive results.

[1]  Janina Fengel,et al.  Semantic technologies for aligning heterogeneous business process models , 2014, Bus. Process. Manag. J..

[2]  Emmanouel Garoufallou,et al.  A critical introduction to metadata for e-science and e-research , 2014, Int. J. Metadata Semant. Ontologies.

[3]  Oscar Corcho,et al.  Methodological Guidelines for Publishing Government Linked Data , 2011 .

[4]  José A. Senso,et al.  AUTHORIS: a tool for authority control in the semantic web , 2013, Libr. Hi Tech.

[5]  Claus Zinn,et al.  Integrated access to cultural heritage resources through representation and alignment of controlled vocabularies , 2008 .

[6]  Martin Gaedke,et al.  Silk - A Link Discovery Framework for the Web of Data , 2009, LDOW.

[7]  Yusniel Hidalgo Delgado Marco de trabajo basado en los datos enlazados para la interoperabilidad semántica en el protocolo OAI-PMH , 2015 .

[8]  Danica Zendulková The Implementation of CERIF based Data Model for Statistical Survey of Research and Development Potential within the SK CRIS , 2014, CRIS.

[9]  Sabine Schmidt,et al.  BoRIS and BIA: CRIS and Institutional Repository Integration at the Free University of Bozen-Bolzano , 2014, CRIS.

[10]  Usman Wajid,et al.  Enhancing Enterprise Collaboration Using a Protocol for Semantic Alignment , 2009, 2009 18th IEEE International Workshops on Enabling Technologies: Infrastructures for Collaborative Enterprises.

[11]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[12]  Ngoc Thanh Nguyen,et al.  A METHOD FOR ONTOLOGY ALIGNMENT BASED ON SEMANTICS OF ATTRIBUTES , 2012, Cybern. Syst..

[13]  Ya-Ning Chen A RDF-based approach to metadata crosswalk for semantic interoperability at the data element level , 2015, Libr. Hi Tech.

[14]  Valerie McCutcheon,et al.  Research Data Meets Research Information Management: Two Case Studies Using (a) Pure CERIF-CRIS and (b) EPrints Repository Platform with CERIF Extensions , 2014, CRIS.

[15]  Danica Zendulková,et al.  Electronic Theses and Dissertations in CRIS , 2014, CRIS.

[16]  David Wood,et al.  The Joy of Data - A Cookbook for Publishing Linked Government Data on the Web , 2011 .

[17]  Steffen Staab,et al.  On How to Perform a Gold Standard Based Evaluation of Ontology Learning , 2006, SEMWEB.

[18]  Andrea Bollini,et al.  Publication Metadata in CERIF: Inspiration by FRBR , 2014, CRIS.

[19]  Ashwin Machanavajjhala,et al.  Entity Resolution: Theory, Practice & Open Challenges , 2012, Proc. VLDB Endow..

[20]  Lidija Ivanović,et al.  Integration of a research management system and an OAI-PMH compatible ETDs repository at the University of Novi Sad, Republic of Serbia , 2012 .

[21]  Marc Ehrig,et al.  Ontology Alignment: Bridging the Semantic Gap , 2006 .

[22]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.

[23]  Zongkai Yang,et al.  Research of Metadata Based Digital Educational Resource Sharing , 2008, 2008 International Conference on Computer Science and Software Engineering.

[24]  Dragan Ivanovic,et al.  User interface of web application for searching PhD dissertations of the University of Novi Sad , 2013, 2013 IEEE 11th International Symposium on Intelligent Systems and Informatics (SISY).

[25]  Michael Uschold,et al.  Creating Semantically Integrated Communities on the World Wide Web , 2002 .

[26]  Syed Ali Hassan,et al.  The semantic alignment of H-FOAF, DOMAIN and DBLP ontologies with link open data for a health social network , 2014, 2014 14th International Conference on Control, Automation and Systems (ICCAS 2014).

[27]  Sergio Luján-Mora,et al.  TRANSFORMING LIBRARY CATALOGS INTO LINKED DATA. , 2014 .

[28]  N. Joint Current research information systems, open access repositories and libraries , 2008 .

[29]  Amed Abel Leiva Mederos,et al.  BM2LOD: PLATFORM FOR PUBLISHING BIBLIOGRAPHIC DATA AS LINKED OPEN DATA , 2014 .

[30]  Avigdor Gal Uncertain entity resolution: re-evaluating entity resolution in the big data era: tutorial , 2014, VLDB 2014.

[31]  Vilas Wuwongse,et al.  An application profile for research collaboration and information management , 2015, Program.

[32]  Carlos Sousa Pinto,et al.  CERIF - Is the Standard Helping to Improve CRIS? , 2014, CRIS.

[33]  Marco Schorlemmer,et al.  A formal model for situated semantic alignment , 2007, AAMAS '07.

[34]  Jérôme David,et al.  On Fixing Semantic Alignment Evaluation Measures , 2008, OM.

[35]  Keith G. Jeffery,et al.  From Open Data to Data-intensive Science through CERIF , 2014, CRIS.