Data measurement in research information systems: metrics for the evaluation of data quality

In recent years, research information systems (RIS) have become an integral part of the university’s IT landscape. At the same time, many universities and research institutions are still working on the implementation of such information systems. Research information systems support institutions in the measurement, documentation, evaluation and communication of research activities. Implementing such integrative systems requires that institutions assure the quality of the information on research activities entered into them. Since many information and data sources are interwoven, these different data sources can have a negative impact on data quality in different research information systems. Because the topic is currently of interest to many institutions, the aim of the present paper is firstly to consider how data quality can be investigated in the context of RIS, and then to explain how various dimensions of data quality described in the literature can be measured in research information systems. Finally, a framework as a process flow according to UML activity diagram notation is developed for monitoring and improvement of the quality of these data; this framework can be implemented by technical personnel in universities and research institutions.

[1]  Carlo Batini,et al.  Data Quality at a Glance , 2005, Datenbank-Spektrum.

[2]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[3]  Kurt Geihs,et al.  Mobile Datenbanken und Systeme , 2003, Datenbank-Spektrum.

[4]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques (Data-Centric Systems and Applications) , 2006 .

[5]  Myriam C. Traub Measuring and improving data quality of media collections for professional tasks , 2014, IIiX.

[6]  Mohammad Abuosba,et al.  Improving the data quality in the research information systems , 2019, ArXiv.

[7]  Marcus Kaiser,et al.  How to Measure Data Quality? - A Metric-Based Approach , 2007, ICIS.

[8]  Helmut Krcmar,et al.  Information management , 1994 .

[9]  Richard Y. Wang,et al.  Journey to Data Quality , 2006 .

[10]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.

[11]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[12]  Barbara Ebert,et al.  Research information systems at universities and research institutions - Position Paper of DINI AG FIS , 2015 .

[13]  Larry P. English Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits , 1999 .

[14]  Gunter Saake,et al.  Analyzing data quality issues in research information systems via data profiling , 2018, Int. J. Inf. Manag..

[15]  Gunter Saake,et al.  Data Quality Measures and Data Cleansing for Research Information Systems , 2019, ArXiv.

[16]  Bernd Heinrich,et al.  Die Messung der Datenqualität im Controlling , 2009 .

[17]  Algimantas Juozapavičius,et al.  Research Information Systems , 1997 .

[18]  Holger Hinrichs Datenqualitätsmanagement in Data-warehouse-Systemen , 2002 .