A community approach to data integration: Authorship and building meaningful links across diverse archaeological data sets

The ability to link and compare diverse archaeological data sets will catalyze innovative research of great scope and analytic rigor. However, information heterogeneity and limited budgets and information technology skills challenge data dissemination initiatives. This paper argues for new methods of community-based data integration pioneered by the University of Chicago’s Extensible Markup Language (XML) System for Textual and Archaeological Research project (XSTAR). With XSTAR, data integration takes place in two steps: (1) syntactic-schematic integration: Legacy data sets are migrated for representation in the data structures described by the Archaeological Markup Language (ArchaeoML), and (2) Semantic integration: Mappings must be established between related terms and classes in each source database. Because the nuances of meaning are often very subtle, human experts must classify related items in each data set. Initial syntacticschematic mapping of data into XSTAR is simple and fast but occurs at a relatively

[1]  James Campbell,et al.  Public Commons of Geographic Data: Research and Development Challenges , 2004, GIScience.

[2]  Bertram Ludäscher,et al.  Neuroscience Databases: a Practical Guide Federation of Brain Data through Knowledge-guided Mediation , 2022 .

[3]  Edward A. Fox,et al.  Prototyping Digital Libraries Handling Heterogeneous Data Sources - The ETANA-DL Case Study , 2004, ECDL.

[4]  James A. Hendler,et al.  Agents and the Semantic Web , 2001, IEEE Intell. Syst..

[5]  Vince Gaffney,et al.  From Order to Chaos: Publication, Synthesis and the Dissemination of Data in a Digital Age , 1999 .

[6]  J. A. Harris A Neglected Paper on Natural Selection in the English Sparrow , 1911, The American Naturalist.

[7]  Martin Doerr,et al.  The CIDOC Conceptual Reference Module: An Ontological Approach to Semantic Interoperability of Metadata , 2003, AI Mag..

[8]  M. Eisen,et al.  PLoS Medicine— A Medical Journal for the Internet Age , 2004, PLoS Medicine.

[9]  Edward A. Fox Digital libraries , 1993 .

[10]  Paul J Cripps,et al.  Ontological Modelling of the work of the Centre for Archaeology , 2005 .

[11]  F. Bookstein,et al.  A PATH‐ANALYTIC MODEL FOR THE MEASUREMENT OF SELECTION ON MORPHOLOGY , 1989, Evolution; international journal of organic evolution.

[12]  Pamela Samuelson Preserving the positive functions of the public domain in science , 2003, Data Sci. J..

[13]  Burchard von Braunmühl,et al.  Effectively Absolute Continuity and Effective Jordan Decomposability , 2002, Electron. Notes Theor. Comput. Sci..

[14]  Paula Park NIH research to be open access , 2004, Genome Biology.

[15]  J. Richards Preservation and re-use of digital data: the role of the Archaeology Data Service , 1997, Antiquity.

[16]  Glenn Otis Brown,et al.  Out of the Way , 2003, PLoS biology.

[17]  P. O'donald A FURTHER ANALYSIS OF BUMPUS' DATA: THE INTENSITY OF NATURAL SELECTION , 1973, Evolution; international journal of organic evolution.

[18]  S. Rohwer,et al.  HERMON BUMPUS AND NATURAL SELECTION IN THE HOUSE SPARROW PASSER DOMESTICUS , 1972, Evolution; international journal of organic evolution.

[19]  M. Aldenderfer,et al.  Anthropology, Space, and Geographic Information Systems , 2000 .

[20]  P. Grant Centripetal Selection and the House Sparrow , 1972 .

[21]  S. Hilgartner,et al.  Data withholding in academic genetics: evidence from a national survey. , 2002, JAMA.

[22]  I. Hodder Archaeology and global information systems , 1999 .

[23]  Stuart Jeffrey,et al.  From the Ground Up: the publication of archaeological projects (Full report) , 2001 .

[24]  Diane M. Zorich A Survey of Digital Cultural Heritage Initiatives and Their Sustainability Concerns. Managing Economic Challenges. , 2003 .

[25]  Gary Stix,et al.  Some Rights Reserved , 2003 .

[26]  Thomas R. Gruber,et al.  A Translation Approach to Portable Ontologies , 1993 .

[27]  Jason Schultz,et al.  Protecting Traditional Knowledge and Expanding Access to Scientific Data: Juxtaposing Intellectual Property Agendas via a “Some Rights Reserved” Model , 2005, International Journal of Cultural Property.

[28]  J. B. Calhoun The Role of Temperature and Natural Selection in Relation to the Variations in the Size of the English Sparrow in the United States , 1947, The American Naturalist.

[29]  Bertram Ludäscher,et al.  A System for Managing Alternate Models in Model-Based Mediation , 2002, BNCOD.

[30]  B. Manly Some examples of double exponential fitness functions , 1976, Heredity.

[31]  Leah Solla Building Digital Archives for Scientific Information. , 2002 .

[32]  Andrew Paterson The design and development of a social science data warehouse: A case study of the Human Resources Development Data Warehouse Project of the Human Sciences Research Council, South Africa , 2003, Data Sci. J..

[33]  G. Lock,et al.  Beyond the map : archaeology and spatial technologies , 2000 .

[34]  Cungen Cao,et al.  A Domain-Specific Formal Ontology for Archaeological Knowledge Sharing and Reusing , 2002, PAKM.

[35]  The Global Health Watch , 2004, PLoS medicine.

[36]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[37]  Kathryn Denning 'The Storm of Progress' and Archaeology for an Online Public , 2004 .

[38]  J. David Schloen Archaeological Data Models and Web Publication Using XML , 2001, Comput. Humanit..

[39]  G. Nicholas,et al.  Copyrighting the Past? , 2004, Current Anthropology.

[40]  Thomas C. Patterson,et al.  A History of Archaeological Thought. , 1991 .