Hydria: An Online Data Lake for Multi-Faceted Analytics in the Cultural Heritage Domain

Advancements in cultural informatics have significantly influenced the way we perceive, analyze, communicate and understand culture. New data sources, such as social media, digitized cultural content, and Internet of Things (IoT) devices, have allowed us to enrich and customize the cultural experience, but at the same time have created an avalanche of new data that needs to be stored and appropriately managed in order to be of value. Although data management plays a central role in driving forward the cultural heritage domain, the solutions applied so far are fragmented, physically distributed, require specialized IT knowledge to deploy, and entail significant IT experience to operate even for trivial tasks. In this work, we present Hydria, an online data lake that allows users without any IT background to harvest, store, organize, analyze and share heterogeneous, multi-faceted cultural heritage data. Hydria provides a zero-administration, zero-cost, integrated framework that enables researchers, museum curators and other stakeholders within the cultural heritage domain to easily (i) deploy data acquisition services (like social media scrapers, focused web crawlers, dataset imports, questionnaire forms), (ii) design and manage versatile customizable data stores, (iii) share whole datasets or horizontal/vertical data shards with other stakeholders, (iv) search, filter and analyze data via an expressive yet simple-to-use graphical query engine and visualization tools, and (v) perform user management and access control operations on the stored data. To the best of our knowledge, this is the first solution in the literature that focuses on collecting, managing, analyzing, and sharing diverse, multi-faceted data in the cultural heritage domain and targets users without an IT background.

[1]  Shao-Chun Wu,et al.  Systems integration of heterogeneous cultural heritage information systems in museums: a case study of the National Palace Museum , 2016, International Journal on Digital Libraries.

[2]  M. Balafar,et al.  The state-of-the-art in expert recommendation systems , 2019, Eng. Appl. Artif. Intell..

[3]  Angelo Chianese,et al.  CHIS: Cultural Heritage Information System , 2013, Int. J. Knowl. Soc. Res..

[4]  Francisco Ortin,et al.  Design of cultural heritage information systems based on information layers , 2013, JOCCH.

[5]  Jai E. Jung,et al.  Identifying and ranking cultural heritage resources on geotagged social media for smart cultural tourism services , 2016, Personal and Ubiquitous Computing.

[6]  Vincenzo Moscato,et al.  An Edge Intelligence Empowered Recommender System Enabling Cultural Heritage Applications , 2019, IEEE Transactions on Industrial Informatics.

[7]  Ioanna Lykourentzou,et al.  Online Sequencing of Non-Decomposable Macrotasks in Expert Crowdsourcing , 2018, ACM Trans. Soc. Comput..

[8]  Giorgos Lepouras,et al.  Modeling visitors' profiles: A study to investigate adaptation aspects for museum learning technologies , 2010, JOCCH.

[9]  Alexander V. Smirnov,et al.  Context-based infomobility system for cultural heritage recommendation: Tourist Assistant—TAIS , 2017, Personal and Ubiquitous Computing.

[10]  Panos Markopoulos,et al.  Macrotask Crowdsourcing: An Integrated Definition , 2019, Macrotask Crowdsourcing.

[11]  Carlo Meghini,et al.  Using an ontology for representing the knowledge on literary texts: The Dante Alighieri case study , 2016, Semantic Web.

[12]  Tao Li,et al.  A survey on expert finding techniques , 2017, Journal of Intelligent Information Systems.

[13]  Martin Doerr,et al.  X3ML mapping framework for information integration in cultural heritage and beyond , 2017, International Journal on Digital Libraries.

[14]  Christian Meske,et al.  How does the world connect? Exploring the global diffusion of social network sites , 2017, J. Assoc. Inf. Sci. Technol..

[15]  Giorgos Lepouras,et al.  Museum Personalization Based on Gaming and Cognitive Styles: The BLUE Experiment , 2015, Int. J. Virtual Communities Soc. Netw..

[16]  Weeraphan Chanhom,et al.  TOMS: A Linked Open Data System for Collaboration and Distribution of Cultural Heritage Artifact Collections of National Museums in Thailand , 2019, New Generation Computing.

[17]  Pierre Grussenmeyer,et al.  A web information system for the management and the dissemination of Cultural Heritage data , 2007 .

[18]  Natalia Miloslavskaya,et al.  Big Data, Fast Data and Data Lake Concepts , 2016, BICA.

[19]  Alessandro Bozzon,et al.  Choosing the right crowd: expert finding in social networks , 2013, EDBT '13.

[20]  Po-Sen Huang,et al.  Multimedia augmented reality information system for museum guidance , 2013, Personal and Ubiquitous Computing.

[21]  Stefan Stieglitz,et al.  Going Back in Time to Predict the Future - The Complex Role of the Data Collection Period in Social Media Analytics , 2018, Information Systems Frontiers.

[22]  Chern Li Liew,et al.  Participatory Cultural Heritage: A Tale of Two Institutions' Use of Social Media , 2014, D Lib Mag..

[23]  Vassilis Poulopoulos,et al.  Stimulation of reflection and discussion in museum visits through the use of social media , 2017, Social Network Analysis and Mining.

[24]  Juliana Freire,et al.  Finding seeds to bootstrap focused crawlers , 2015, World Wide Web.

[25]  Javier Nogueras-Iso,et al.  Profiling of knowledge organisation systems for the annotation of Linked Data cultural resources , 2019, Inf. Syst..

[26]  Matthias R. Hastall,et al.  "Likes" as social rewards: Their role in online social comparison and decisions to like other People's selfies , 2019, Comput. Hum. Behav..