Reconstructing Historical Populations from Genealogical Data Files

Over the past two decades, a huge number of historical documents have been digitised and made available online. At the same time, numerous software options and websites have encouraged people to conduct research into their family trees, leading to a surge in the availability of genealogical data. A major advantage of genealogical data, from a scientific research perspective, is that it combines information from many sources into a format that is structured by family relations and descendancy, which is very useful for studying the dynamics of population change over the generations. A critical issue for researchers who want to use genealogical data is how to assess the quality of the data and put in place measures to correct the errors that we find in it. In this chapter, I present some of the methods that are being used to filter, clean and aggregate genealogical data to create large datasets that may be used across a diverse range of academic research disciplines.

[1]  R. Decorte,et al.  Genetic genealogy comes of age: perspectives on the use of deep-rooted pedigrees in human population genetics. , 2013, American journal of physical anthropology.

[2]  L. Gavrilov,et al.  Search for Predictors of Exceptional Human Longevity , 2007 .

[3]  Peter Christen,et al.  A Supervised Learning and Group Linking Method for Historical Census Household Linkage , 2011, AusDM.

[4]  C. Gellatly Trends in Population Sex Ratios May be Explained by Changes in the Frequencies of Polymorphic Alleles of a Sex Ratio Gene , 2009, Evolutionary Biology.

[5]  H B NEWCOMBE,et al.  Automatic linkage of vital records. , 1959, Science.

[6]  Christophe G. Giraud-Carrier,et al.  Metric-Based Data Mining Model for Genealogical Record Linkage , 2007, 2007 IEEE International Conference on Information Reuse and Integration.

[7]  Laurent Excoffier,et al.  Deep Human Genealogies Reveal a Selective Advantage to Be on an Expanding Wave Front , 2011, Science.

[8]  L. Gavrilov,et al.  Biodemographic Study of Familial Determinants of Human Longevity , 2001, Population.

[9]  Samuel M. Otterstrom,et al.  Genealogy, Migration, and the Intertwined Geographies of Personal Pasts , 2013 .

[10]  H. B. Newcombe,et al.  Computers can be used to extract "follow-up" statistics of families from files of routine records. , 1959 .

[11]  Zhongwei Zhao Demographic Conditions and Multi-generation Households in Chinese History. Results from Genealogical Research and Microsimulation , 1994 .

[12]  F. Poppel,et al.  Reconstructing the extended kin-network in the Netherlands with genealogical data: Methods, problems, and results , 1997 .