Detecting autozygosity through runs of homozygosity: A comparison of three autozygosity detection algorithms

BackgroundA central aim for studying runs of homozygosity (ROHs) in genome-wide SNP data is to detect the effects of autozygosity (stretches of the two homologous chromosomes within the same individual that are identical by descent) on phenotypes. However, it is unknown which current ROH detection program, and which set of parameters within a given program, is optimal for differentiating ROHs that are truly autozygous from ROHs that are homozygous at the marker level but vary at unmeasured variants between the markers.MethodWe simulated 120 Mb of sequence data in order to know the true state of autozygosity. We then extracted common variants from this sequence to mimic the properties of SNP platforms and performed ROH analyses using three popular ROH detection programs, PLINK, GERMLINE, and BEAGLE. We varied detection thresholds for each program (e.g., prior probabilities, lengths of ROHs) to understand their effects on detecting known autozygosity.ResultsWithin the optimal thresholds for each program, PLINK outperformed GERMLINE and BEAGLE in detecting autozygosity from distant common ancestors. PLINK's sliding window algorithm worked best when using SNP data pruned for linkage disequilibrium (LD).ConclusionOur results provide both general and specific recommendations for maximizing autozygosity detection in genome-wide SNP data, and should apply equally well to research on whole-genome autozygosity burden or to research on whether specific autozygous regions are predictive using association mapping methods.

[1]  Pardis C Sabeti,et al.  Genome-wide detection and characterization of positive selection in human populations , 2007, Nature.

[2]  R. Recker,et al.  Runs of homozygosity identify a recessive locus 12q21.31 for human adult height. , 2010, The Journal of clinical endocrinology and metabolism.

[3]  M. Krawczak,et al.  Genomic and geographic distribution of SNP-defined runs of homozygosity in Europeans. , 2010, Human molecular genetics.

[4]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[5]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[6]  B. Neale Statistical genetics : gene mapping through linkage and association , 2007 .

[7]  Y. Pawitan,et al.  Regions of homozygosity and their impact on complex diseases and traits , 2010, Human Genetics.

[8]  Todd Lencz,et al.  Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia , 2007, Proceedings of the National Academy of Sciences.

[9]  B. J. Klevering,et al.  Mutations in C2ORF71 cause autosomal-recessive retinitis pigmentosa. , 2010, American journal of human genetics.

[10]  R. Houlston,et al.  Colorectal cancer risk is not associated with increased levels of homozygosity in a population from the United Kingdom. , 2009, Cancer research.

[11]  Brian L. Browning,et al.  High-resolution detection of identity by descent in unrelated individuals. , 2010, American journal of human genetics.

[12]  P. Visscher,et al.  Quantification of Inbreeding Due to Distant Ancestors and Its Detection Using Dense Single Nucleotide Polymorphism Data , 2011, Genetics.

[13]  R. A. Fisher,et al.  The Genetical Theory of Natural Selection , 1931 .

[14]  R. A. Fisher,et al.  The Genetical Theory of Natural Selection , 1931 .

[15]  M. Nalls,et al.  Extended tracts of homozygosity identify novel candidate genes associated with late-onset Alzheimer’s disease , 2009, neurogenetics.

[16]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[17]  Harry Campbell,et al.  Genomic Runs of Homozygosity Record Population History and Consanguinity , 2010, PloS one.

[18]  D. Charlesworth,et al.  The genetics of inbreeding depression , 2009, Nature Reviews Genetics.

[19]  Tom Walsh,et al.  Whole exome sequencing and homozygosity mapping identify mutation in the cell polarity protein GPSM2 as the cause of nonsyndromic hearing loss DFNB82. , 2010, American journal of human genetics.

[20]  Jurg Ott,et al.  Genome‐wide autozygosity mapping in human populations , 2009, Genetic epidemiology.

[21]  Igor Rudan,et al.  Runs of homozygosity in European populations. , 2008, American journal of human genetics.

[22]  S T Sherry,et al.  Genetic traces of ancient demography. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[23]  C. Graver Statistical Genetics: Gene Mapping Through Linkage and Association , 2008 .

[24]  S. Leal,et al.  Loss-of-function mutations of ILDR1 cause autosomal-recessive hearing impairment DFNB42. , 2011, American journal of human genetics.

[25]  F. Alkuraya,et al.  Clinical and molecular characterisation of Bardet–Biedl syndrome in consanguineous populations: the power of homozygosity mapping , 2009, Journal of Medical Genetics.

[26]  Maria De Iorio,et al.  Fregene: Simulation of realistic sequence-level data in populations and ascertained samples , 2008, BMC Bioinformatics.

[27]  Jack N. Fenner,et al.  Cross-cultural estimation of the human generation interval for use in genetics-based population divergence studies. , 2005, American journal of physical anthropology.

[28]  A. Cideciyan,et al.  A missense mutation in DHDDS, encoding dehydrodolichyl diphosphate synthase, is associated with autosomal-recessive retinitis pigmentosa in Ashkenazi Jews. , 2011, American journal of human genetics.

[29]  E. Papaemmanuil,et al.  Genome-wide homozygosity signatures and childhood acute lymphoblastic leukemia risk. , 2010, Blood.

[30]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[31]  N. Morton,et al.  Extended tracts of homozygosity in outbred human populations. , 2006, Human molecular genetics.

[32]  Xiaoquan Wen,et al.  Correction: A Map of Recent Positive Selection in the Human Genome , 2006, PLoS Biology.

[33]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[34]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[35]  Jianxin Shi,et al.  Common variants on chromosome 6p22.1 are associated with schizophrenia , 2009, Nature.

[36]  P. Sklar,et al.  No evidence for excess runs of homozygosity in bipolar disorder , 2009, Psychiatric genetics.

[37]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[38]  Terence P. Speed,et al.  Genome analysis A genotype calling algorithm for affymetrix SNP arrays , 2005 .

[39]  Jay L. Lush,et al.  The genetics of populations , 1948 .

[40]  Martin S. Taylor,et al.  CEP152 is a genome maintenance protein disrupted in Seckel syndrome , 2011, Nature Genetics.

[41]  Richard S Houlston,et al.  Risk of breast and prostate cancer is not associated with increased homozygosity in outbred populations , 2010, European Journal of Human Genetics.

[42]  Fuu-Jen Tsai,et al.  Long contiguous stretches of homozygosity in the human genome , 2006, Human mutation.