A graphical approach to relatedness inference.

The estimation of relatedness structure in natural populations using molecular marker data has become an important tool in population biology, resulting in a variety of estimation procedures for specific sampling scenarios. In this article a general approach is proposed, in which the detailed relationship structure, typically a pedigree graph or partition, is considered to be the object of inference. This makes available tools used in complex model selection theory which have demonstrated effectiveness. An important advantage of this approach is that it permits a fully Bayesian approach to the problem, providing a principled and accessible way to measure statistical error. The approach is demonstrated by applying the minimum description length principle. This technique is used in model selection to provide a rational way of comparing models of varying complexity. We show how the resulting score may be interpreted and applied as a Bayesian posterior density.

[1]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[2]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[3]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[4]  Ian Painter,et al.  Sibship Reconstruction without Parental Information , 1997 .

[5]  T. C. Marshall,et al.  Statistical confidence for likelihood‐based paternity inference in natural populations , 1998, Molecular ecology.

[6]  Dany Garant,et al.  How to use molecular marker data to measure evolutionary parameters in wild populations , 2005, Molecular ecology.

[7]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[8]  A. Almudevar Most Powerful Permutation Invariant Tests for Relatedness Hypotheses Using Genotypic Data , 2001, Biometrics.

[9]  Michael S. Blouin,et al.  DNA-based methods for pedigree reconstruction and kinship analysis in natural populations , 2003 .

[10]  Mark A. Pitt,et al.  Advances in Minimum Description Length: Theory and Applications , 2005 .

[11]  C. Field,et al.  Accuracy, efficiency and robustness of four algorithms allowing full sibship reconstruction from DNA marker data , 2004, Molecular ecology.

[12]  W. G. Hill,et al.  Estimating quantitative genetic parameters using sibships reconstructed from marker data. , 2000, Genetics.

[13]  A. Jones,et al.  Methods of parentage analysis in natural populations , 2003, Molecular ecology.

[14]  W. G. Hill,et al.  Sibship reconstruction in hierarchical population structures using Markov chain Monte Carlo techniques. , 2002, Genetical research.

[15]  E A Thompson,et al.  Parental and sib likelihoods in genealogy reconstruction. , 1987, Biometrics.

[16]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[17]  K. Roeder,et al.  Fractional paternity assignment: theoretical development and comparison to other methods , 1988, Theoretical and Applied Genetics.

[18]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[19]  A Bootstrap Assessment of Variability in Pedigree Reconstruction Based on Genetic Markers , 2001, Biometrics.

[20]  M R Gross,et al.  A bayesian framework for parentage analysis: the value of genetic and other biological data. , 2001, Theoretical population biology.

[21]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[22]  C. Field,et al.  Estimation of Single-Generation Sibling Relationships Based on DNA Markers , 1999 .

[23]  Dmitry A. Konovalov,et al.  Modified SIMPSON O(n3) algorithm for the full sibship reconstruction problem , 2005, Bioinform..

[24]  J. Rissanen A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .

[25]  L. Noble,et al.  Assignment of paternity groups without access to parental genotypes: multiple mating and developmental plasticity in squid , 2001, Molecular ecology.

[26]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[27]  Richard W. Hamming,et al.  Coding and Information Theory , 2018, Feynman Lectures on Computation.

[28]  P. Smouse,et al.  Genetic analysis of male reproductive contributions in Chamaelirium luteum (L.) gray (Liliaceae). , 1994, Genetics.

[29]  D. Cook,et al.  Family relationships and effective population size in a natural cohort of Atlantic cod (Gadus morhua) larvae , 1997 .

[30]  R. Nielsen,et al.  Statistical approaches to paternity analysis in natural populations and applications to the North Atlantic humpback whale. , 2001, Genetics.

[31]  Nir Friedman,et al.  Learning Bayesian Networks with Local Structure , 1996, UAI.

[32]  Jinliang Wang,et al.  Sibship reconstruction from genetic data with typing errors. , 2004, Genetics.

[33]  Anthony Almudevar,et al.  A simulated annealing algorithm for maximum likelihood pedigree reconstruction. , 2003, Theoretical population biology.

[34]  Stuart C Thomas,et al.  The estimation of genetic relationships using molecular markers and their efficiency in estimating heritability in natural populations , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[35]  Gregory F. Cooper,et al.  Model Averaging for Prediction with Discrete Bayesian Networks , 2004, J. Mach. Learn. Res..