Importance Sampling for Estimating p Values in Linkage Analysis

Importance sampling methods are proposed for estimating the probability that the maximum of a random process exceeds a high threshold, with particular attention to assessing genome-wide significance levels in linkage analysis. The proposed algorithm is applied to computing the conditional significance level, given observed phenotypes, of scan statistics to map quantitative traits in experimental populations and in extended pedigrees of moderate size with partially informative markers. For detecting an additive effect in an intercross, the importance sampling algorithm is roughly 100 times as efficient as direct Monte Carlo simulation when the true significance level is about .05. For pedigrees with partially informative markers, the efficiency varies greatly with intermarker distance, marker polymorphism, sample size, and missing data. In this case the importance sampling algorithm is used to efficiently explore how p values vary with these parameters. Extensions of the algorithm are discussed for the case of multidimensional statistics that arise when considering dominance effects or searching for multiple loci that may involve interactions.

[1]  Carey E Priebe,et al.  Computing Scan Statistic p Values Using Importance Sampling, With Applications to Genetics and Medical Image Analysis , 2001 .

[2]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[3]  David Siegmund,et al.  Approximate Tail Probabilities for the Maxima of Some Random Fields , 1988 .

[4]  Alan C. Evans,et al.  A Three-Dimensional Statistical Analysis for CBF Activation Studies in Human Brain , 1992, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[5]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[6]  Daniel Q. Naiman,et al.  p-Value Simulation for Affected Sib Pair Multiple Testing , 2005, Human Heredity.

[7]  A new representation for a renewal-theoretic constant appearing in asymptotic approximations of large deviations , 1998 .

[8]  Michael Woodroofe,et al.  Frequentist properties of Bayesian sequential tests , 1976 .

[9]  Daniel Q. Naiman,et al.  A Comprehensive Method for Genome Scans , 2003, Human Heredity.

[10]  E Feingold,et al.  Gaussian models for genetic linkage analysis using complete high-resolution maps of identity by descent. , 1993, American journal of human genetics.

[11]  D Siegmund,et al.  Statistical methods for mapping quantitative trait loci from a dense set of markers. , 1999, Genetics.

[12]  G. Abecasis,et al.  Handling marker-marker linkage disequilibrium: pedigree analysis with clustered markers. , 2005, American journal of human genetics.

[13]  D Siegmund,et al.  Mapping quantitative trait loci in oligogenic models. , 2001, Biostatistics.

[14]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[15]  H. Ola,et al.  Using importance sampling to improve simulation in linkage analysis. , 2004 .

[16]  Mapping quantitative traits with random and with ascertained sibships. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Ola Hössjer,et al.  Statistical Applications in Genetics and Molecular Biology Using Importance Sampling to Improve Simulation in Linkage Analysis , 2011 .

[18]  Martin Kulldorff,et al.  Power evaluation of disease clustering tests , 2003, International journal of health geographics.

[19]  Benjamin Yakir,et al.  Tail probabilities for the null distribution of scanning statistics , 1998 .

[20]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[21]  E. Lander,et al.  Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. , 1989, Genetics.

[22]  L. Almasy,et al.  Multipoint quantitative-trait linkage analysis in general pedigrees. , 1998, American journal of human genetics.

[23]  Pablo V Gejman,et al.  Genomewide linkage scan of 409 European-ancestry and African American families with schizophrenia: suggestive evidence of linkage at 8p23.3-p21.2 and 11p13.1-q14.1 in the combined sample. , 2006, American journal of human genetics.