Investigations into data published and consumed on the Web: a systematic mapping study

The increasing interest in using the Web as a platform for data sharing has motivated research about publishing and consuming data on the Web. While this subject is gaining importance, up until now, there are not many academic papers reviewing the approaches for publishing and consuming data on the Web. Furthermore, to the best of our knowledge, there is no systematic review of the literature that analyzes this subject. In this article, we conduct a systematic mapping study that aims to provide an overview of the current literature on publishing and consuming data on the Web by conducting a systematic mapping study. This study seeks to function as a snapshot of this subject by (i) identifying and analyzing how data have been published and consumed on the Web, (ii) discovering the benefits and limitations of publishing and consuming data on the Web (iii) analyzing the evolution of research on publishing and consuming data on the Web, and (iv) classifying the studies into categories related to their contribution. Finally, we discuss the results of this study and their implications for research on data on the Web-related subjects.

[1]  Martina Stockhause,et al.  Key components of data publishing: using current best practices to develop a reference model for data publishing , 2017, International Journal on Digital Libraries.

[2]  Katleen Janssen,et al.  Legal and Institutional Challenges for Opening Data across Public Sectors: Towards Common Policy Solutions , 2014, J. Theor. Appl. Electron. Commer. Res..

[3]  Leyla Jael García Castro,et al.  Biotea: RDFizing PubMed Central in support for the paper as an interface to the Web of Data , 2013, Journal of Biomedical Semantics.

[4]  Martina Stockhause,et al.  WDS-RDA-F11 Publishing Data Workflows WG Synthesis FINAL CORRECTED , 2015 .

[5]  Viktor de Boer,et al.  Linked Data for the International Aid Transparency Initiative , 2014, Journal on Data Semantics.

[6]  Knud Möller,et al.  Lifecycle models of data-centric systems and , 2012 .

[7]  Sören Auer,et al.  Linked Open Data -- Creating Knowledge Out of Interlinked Data , 2014, Lecture Notes in Computer Science.

[8]  V. Yu. Zitserman,et al.  Publishing scientific data as linked open data , 2013, Scientific and Technical Information Processing.

[9]  M. Petticrew,et al.  Systematic Reviews in the Social Sciences: A Practical Guide , 2005 .

[10]  Valeria de Paiva,et al.  A linked open data architecture for the historical archives of the Getulio Vargas Foundation , 2015, International Journal on Digital Libraries.

[11]  Efthimios Tambouris,et al.  On publishing linked open government data , 2013, PCI '13.

[12]  Felix Naumann,et al.  Data Fusion – Resolving Data Conflicts for Integration , 2009 .

[13]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[14]  Eleni Fotopoulou,et al.  Challenges and opportunities in renovating public sector information by enabling linked data and analytics , 2016, Information Systems Frontiers.

[15]  Khaled Shaalan,et al.  A Survey of Web Information Extraction Systems , 2006, IEEE Transactions on Knowledge and Data Engineering.

[16]  Ig Ibert Bittencourt,et al.  A systematic review on the use of best practices for publishing linked data , 2018, Online Inf. Rev..

[17]  Asunción Gómez-Pérez,et al.  Publishing Linked Data - There is no One-Size-Fits-All Formula , 2012 .

[18]  Monica Scannapieco,et al.  Publishing the 15th Italian Population and Housing Census as Linked Open Data , 2014, SemStats@ISWC.

[19]  Spiros Mouzakitis,et al.  A State-of-the-Art Analysis of the Current Public Data Landscape from a Functional, Semantic and Technical Perspective , 2014, J. Theor. Appl. Electron. Commer. Res..

[20]  Barbara Dinter,et al.  A Stakeholder Lens on Metadata Management in Business Intelligence and Big Data - Results of an Empirical Investigation , 2015, AMCIS.

[21]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[22]  Karen Coyle,et al.  Comparing Methodologies: Linked Open Data and Digital Libraries , 2014, AIUCD '14.

[23]  Mohamed Lamine Mouhoub,et al.  Searching Linked Data and Services with a Single Query , 2014, ESWC.

[24]  Kiev Gama,et al.  Towards Ecosystems based on Open Data as a Service , 2014, ICEIS.

[25]  Akinori Yonezawa,et al.  Building Linked Open Data towards integration of biomedical scientific literature with DBpedia , 2013, J. Biomed. Semant..

[26]  J. Kucera Open Government Data Publication Methodology , 2015 .

[27]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[28]  Timos K. Sellis,et al.  Exploration and Visualization in the Web of Big Linked Data: A Survey of the State of the Art , 2016, EDBT/ICDT Workshops.

[29]  Bernadette Farias Lóscio,et al.  Data on the web management system: a reference model , 2018, DG.O.

[30]  Yannis Charalabidis,et al.  Benefits, Adoption Barriers and Myths of Open Data and Open Government , 2012, Inf. Syst. Manag..

[31]  Asunción Gómez-Pérez,et al.  Guidelines for Linked Data generation and publication: An example in building energy consumption , 2015 .

[32]  Berthier A. Ribeiro-Neto,et al.  A brief survey of web data extraction tools , 2002, SGMD.

[33]  Ben Goldacre,et al.  OpenTrials: towards a collaborative open database of all available information on all clinical trials , 2016, Trials.

[34]  Sören Auer,et al.  The emerging web of linked data , 2011, ISWSA '11.

[35]  Bin Chen,et al.  The ChEMBL database as linked open data , 2013, Journal of Cheminformatics.

[36]  Wolfgang Lehner,et al.  OPEN—Enabling Non-expert Users to Extract, Integrate, and Analyze Open Data , 2012, Datenbank-Spektrum.

[37]  Marcos R. Vieira,et al.  Structured Open Urban Data: Understanding the Landscape , 2014, Big Data.

[38]  Valentina Janev,et al.  Lifting Open Data Portals to the Data Web , 2014, Linked Open Data.

[39]  Greg Janée Digital Curation , 2009, Encyclopedia of Database Systems.

[40]  Kellyton dos Santos Brito,et al.  Brazilian government open data: implementation, challenges, and potential opportunities , 2014, dg.o '14.

[41]  Amit P. Sheth,et al.  Semantic Sensor Web , 2008, IEEE Internet Computing.

[42]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[43]  Amit P. Sheth,et al.  From Data to Actionable Knowledge: Big Data Challenges in the Web of Things , 2013, IEEE Intell. Syst..

[44]  Carole A. Goble,et al.  Data curation + process curation=data integration + science , 2008, Briefings Bioinform..

[45]  Michael Martin,et al.  CubeViz: Exploration and Visualization of Statistical Linked Data , 2015, WWW.

[46]  Mary Shaw,et al.  Writing good software engineering research papers , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[47]  Jun Zhao,et al.  Publishing Chinese medicine knowledge as Linked Data on the Web , 2010, Chinese medicine.

[48]  James Gallagher,et al.  Facilitating open exchange of data and information , 2015, Earth Science Informatics.

[49]  Sunil Choenni,et al.  Socio-technical Impediments of Open Data , 2012 .

[50]  Nigel Shadbolt,et al.  Linked Data in Government , 2013, IEEE Internet Computing.

[51]  Soon Ae Chun,et al.  Government 2.0: Making connections between citizens, data and government , 2010, Inf. Polity.

[52]  Sunil Choenni,et al.  On the barriers for local government releasing open data , 2014, Gov. Inf. Q..

[53]  Nelson Piedra,et al.  Consuming and producing linked open data: the case of OpenCourseWare , 2014, Program.

[54]  Katrin Braunschweig,et al.  The State of Open Data Limits of Current Open Data Platforms , 2012 .

[55]  Watchira Buranasing,et al.  Publishing Linked Open Data from Semantic Relation Extraction for Thai Cultural Archive , 2014, JIST.

[56]  Cong Wang,et al.  Toward Secure and Dependable Storage Services in Cloud Computing , 2012, IEEE Transactions on Services Computing.

[57]  Ranjeet Devarakonda,et al.  Mercury: reusable metadata management, data discovery and access system , 2010, Earth Sci. Informatics.

[58]  J. V. Lucke,et al.  Open Government and (Linked) (Open) (Government) (Data) , 2012 .

[59]  Julia Hoxha,et al.  Open Government Data on the Web: A Semantic Approach , 2011, 2011 International Conference on Emerging Intelligent Data and Web Technologies.

[60]  Jeremy G. Frey,et al.  Scientific and technical data sharing: a trading perspective , 2014, Journal of Computer-Aided Molecular Design.

[61]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[62]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[63]  Michael Stonebraker,et al.  Data Curation at Scale: The Data Tamer System , 2013, CIDR.

[64]  Klaus R. Dittrich,et al.  All Together Now: Towards Integrating the World's Information Systems , 2000, JISBD.

[65]  Auri Marcelo Rizzo Vincenzi,et al.  Static Analysis Techniques and Tools: A Systematic Mapping Study , 2013, ICSEA 2013.

[66]  Mohsen Kahani,et al.  Publishing Persian linked data; challenges and lessons learned , 2010, 2010 5th International Symposium on Telecommunications.

[67]  Pieter Colpaert Route Planning Using Linked Open Data , 2014, ESWC.

[68]  Peter Christen,et al.  Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface , 2008, KDD.

[69]  Leonid Stoimenov,et al.  Linked Relations Architecture for Production and Consumption of Linksets in Open Government Data , 2015, I3E.

[70]  Désirée Hilbring,et al.  Automating the web publishing process of environmental data by using semantic annotations , 2014, EMR@ICMR.

[71]  Silvia Mazzini,et al.  LodLive, exploring the web of data , 2012, I-SEMANTICS '12.

[72]  Julie McLeod,et al.  Opening research data: issues and opportunities , 2014 .

[73]  Carole A. Goble,et al.  BioCatalogue: a universal catalogue of web services for the life sciences , 2010, Nucleic Acids Res..

[74]  Peter Webster,et al.  Research Data Repositories: Review of Current Features, Gap Analysis, and Recommendations for Minimum Requirements , 2016 .

[75]  Karen Isabel Cabrera Peña Comparative analysis of public policies in open access models in Latin America. Brazil and Argentina cases , 2015, International Journal of Educational Technology in Higher Education.

[76]  Jing Zhang,et al.  Exploring stakeholders' expectations of the benefits and barriers of e-government knowledge sharing , 2005, J. Enterp. Inf. Manag..

[77]  Naira R. Matevosyan,et al.  Rediscovering Comte de Saint-Simon: From Aristocracy to Meritocracy, a Journey to Inclusion , 2018 .

[78]  Judit Dobránszki,et al.  Potential Dangers with Open Access Data Files in the Expanding Open Data Movement , 2015 .

[79]  Ian Budge,et al.  Managing ‘Big Data’ , 2019, Politics.

[80]  Alejandro Rodríguez-González,et al.  Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base , 2016, Front. Plant Sci..

[81]  Shirley Y. Crompton,et al.  Investigations as Research Objects Within Facilities Science , 2013, TPDL Workshops.

[82]  Lluís Esteve Casellas Serra,et al.  The mapping, selecting and opening of data , 2014 .

[83]  André Freitas,et al.  Big Data Curation , 2016, New Horizons for a Data-Driven Economy.

[84]  Michael L. Brodie,et al.  The meaningful use of big data: four perspectives -- four challenges , 2012, SGMD.

[85]  H. Arksey,et al.  Scoping studies: towards a methodological framework , 2005 .

[86]  Mathieu d'Aquin,et al.  On the Use of Linked Open Data in Education: Current and Future Practices , 2016, Open Data for Education.

[87]  Philipp Frischmuth,et al.  OntoWiki - An authoring, publication and visualization interface for the Data Web , 2015, Semantic Web.

[88]  Harry Halpin,et al.  Architecture of the World Wide Web , 2013 .

[89]  Hector Garcia-Molina,et al.  Extracting structured data from Web pages , 2003, SIGMOD '03.

[90]  Joann J. Ordille,et al.  Data integration: the teenage years , 2006, VLDB.

[91]  Kerry L. Taylor,et al.  Semantics for the Internet of Things: Early Progress and Back to the Future , 2019 .

[92]  Yannis Charalabidis,et al.  Analysing the Characteristics of Open Government Data Sources in Greece , 2018 .

[93]  Christoph Stasch,et al.  New Generation Sensor Web Enablement , 2011, Sensors.

[94]  Rajiv C. Shah,et al.  Lessons for Government Adoption of Open Standards: A Case Study of the Massachusetts Policy , 2008 .

[95]  Bernadette Farias Lóscio,et al.  Towards a meta-model for data ecosystems , 2018, DG.O.

[96]  Sören Auer,et al.  A systematic review of open government data initiatives , 2015, Gov. Inf. Q..

[97]  Martin Necaský,et al.  Methodologies and Best Practices for Open Data Publication , 2015, DATESO.

[98]  Krish Krishnan,et al.  Data Warehousing Revisited , 2013 .

[99]  Tung-Mou Yang,et al.  Examining the socio-technical determinants influencing government agencies' open data publication: A study in Taiwan , 2016, Gov. Inf. Q..

[100]  Paolo Bouquet,et al.  Web of Data and Web of Entities: Identity and Reference in Interlinked Data in the Semantic Web , 2012 .

[101]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[102]  Daniela Grigori,et al.  A Framework for Searching Semantic Data and Services with SPARQL , 2014, ICSOC.

[103]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[104]  Karen F. Gracy Archival description and linked data: a preliminary study of opportunities and implementation challenges , 2015 .

[105]  Olaf Hartig,et al.  A Database Perspective on Consuming Linked Data on the Web , 2010, Datenbank-Spektrum.

[106]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[107]  Eric C. Kansa,et al.  Googling the Grey: Open Data, Web Services, and Semantics , 2010 .

[108]  Bernadette Farias Lóscio,et al.  What is a data ecosystem? , 2018, DG.O.

[109]  Iain Hrynaszkiewicz,et al.  Publishing descriptions of non-public clinical datasets: proposed guidance for researchers, repositories, editors and funding organisations , 2016, Research Integrity and Peer Review.