Linked Humanities Data: The Next Frontier? A Case-study in Historical Census Data

This paper discusses the use of Linked Data to harmonize the Dutch censuses (1795-1971). Due to the long period they cover, census data is notoriously difficult to compare, aggregate and query in a uniform fashion. In social history, harmonization is the (manual) process of restructuring, interpreting and correcting original data sources to make a comparison possible. We describe a harmonization methodology based on standard Linked Data principles, illustrate how the size and complexity of the resulting linked data source poses new challenges for Semantic Web technology, and discuss potential solutions.

[1]  Onno Boonstra,et al.  Twee eeuwen Nederland geteld. Onderzoek met de digitale Volks-, Beroeps- en Woningtellingen 1795-2001 , 2007 .

[2]  Matthew Sobek,et al.  Challenges and Methods of International Census Harmonization , 2003 .

[3]  Peter B. Meyer,et al.  Proposed Category System for 1960-2000 Census Occupations , 2012 .

[4]  James A. Hendler,et al.  Dynamic Ontologies on the Web , 2000, AAAI/IAAI.

[5]  Robert Isele,et al.  Silk Server - Adding missing Links while consuming Linked Data , 2010, COLD.

[6]  Mark Stevenson,et al.  Words and Intelligence II: Essays in Honor of Yorick Wilks , 2007 .

[7]  Ineke Maas,et al.  Creating a Historical International Standard Classification of Occupations An Exercise in Multinational Interdisciplinary Cooperation , 2004 .

[8]  M.P.M. van Horik,et al.  Twee eeuwen Nederland geteld , 2007 .

[9]  Lisa Y. Dillon,et al.  Best Practices with Large Databases on Historical Populations , 2001, Hist. Comput..

[10]  Michel C. A. Klein,et al.  What Is Concept Drift and How to Measure It? , 2010, EKAW.

[11]  Deborah L. McGuinness,et al.  When owl: sameAs Isn't the Same: An Analysis of Identity in Linked Data , 2010, SEMWEB.

[12]  Michel C. A. Klein,et al.  Concept drift and how to identify it , 2011, J. Web Semant..

[13]  M. St-Hilaire,et al.  Geocoding and Mapping Historical Census Data: The Geographical Component of the Canadian Century Research Infrastructure , 2007 .

[14]  Michel C. A. Klein,et al.  Ontology versioning on the Semantic Web , 2001, SWWS.

[15]  Graham White Semantics, hermeneutics, statistics: some reflections on the semantic web , 2011, BCS HCI.