This paper describes the Social Networks and Archival Context project, built on a database of merged Encoded Archival Context - Corporate Bodies, Persons, and Families (EAC-CPF) records derived from Encoded Archival Description (EAD) records held by the Library of Congress, the California Digital Library, the Northwest Digital Archives, and Virginia Heritage, combined with information from name authority files from the Library of Congress (Library of Congress Name Authority File), OCLC Research (The Virtual International Authority File), and the Getty Vocabulary Program (Union List of Artist Names). The database merges information from each instance of an individual name found in the EAD resources, along with variant names, biographical notes and their topical descriptions. The SNAC prototype interface makes this information searchable and browseable while retaining links to the various data sources.
[1]
Stuart Macdonald,et al.
User Engagement in Research Data Curation
,
2009,
ECDL.
[2]
Paul O'Leary,et al.
Cheshire II: designing a next-generation online catalog
,
1996
.
[3]
Gregory R. Crane,et al.
Disambiguating Geographic Names in a Historical Digital Library
,
2001,
ECDL.
[4]
Razvan C. Bunescu,et al.
Using Encyclopedic Knowledge for Named entity Disambiguation
,
2006,
EACL.
[5]
Breck Baldwin,et al.
Entity-Based Cross-Document Coreferencing Using the Vector Space Model
,
1998,
COLING.
[6]
David Yarowsky,et al.
Unsupervised Personal Name Disambiguation
,
2003,
CoNLL.