Jupyter Notebooks for Generous Archive Interfaces

To help scholars to extract meaning, knowledge and value from large volumes of archival content, such as the Dutch Common Lab Research Infrastructure for the Arts and Humanities (CLARIAH), we need to provide more ‘generous’ access to the data than can be provided with generalised search and visualisation tools alone. Our approach is to use Jupyter Notebooks in combination with the existing archive APIs (Application Programming Interface). This gives access to both the archive metadata and a wide range of analysis and visualisation techniques. We have created notebooks and modules of supporting functions that enable the overview, investigation and analysis of the archive. We demonstrate the value of our approach in preliminary tests of its use in scholarly research, and give our observations of the potential value for archivists. Finally, we show that good archive knowledge is essential to create correct and meaningful visualisations and statistics.

[1]  Marijn Koolen,et al.  A Process Model of Scholarly Media Annotation , 2017, CHIIR.

[2]  M. de Rijke,et al.  Media studies research in the data‐driven age: How research questions evolve , 2016, J. Assoc. Inf. Sci. Technol..

[3]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[4]  David McG. Squire,et al.  Deconstructing Bricolage: Interactive Online Analysis of Compiled Texts with Factotum , 2015, Digit. Humanit. Q..

[5]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[6]  Joris van Zundert,et al.  If You Build It, Will We Come? Large Scale Digital Infrastructures as a Dead End for Digital Humanities. , 2012 .

[7]  Mitchell Whitelaw,et al.  Generous Interfaces for Digital Cultural Collections , 2015, Digit. Humanit. Q..

[8]  Marijn Koolen,et al.  Data Scopes: towards Transparent Data Research in Digital Humanities , 2018, DH.

[9]  Ronald Haentjens Dekker,et al.  Code, scholarship, and criticism: When is code scholarship and when is it not? , 2017, Digit. Scholarsh. Humanit..

[10]  Sarah Sutton,et al.  Encyclopedia of Library and Information Sciences , 2009 .

[11]  Lora Aroyo,et al.  Challenges in Enabling Mixed Media Scholarly Research with Multi-media Data in a Sustainable Infrastructure , 2018, DH.

[12]  Franciska de Jong,et al.  Audio-visual Collections and the User Needs of Scholars in the Humanities: a Case for Co-Development. , 2011 .

[13]  Jennifer Edmond,et al.  APIs and Researchers: The Emperor's New Clothes? , 2015 .