Scan statistics to scan markers for susceptibility genes.

Scan statistics are applied to combine information on multiple contiguous genetic markers used in a genome screen for susceptibility loci. This information may be, for example, allele sharing proportions for sib pairs or logarithm of odds (lod) scores in general small families. We focus on a dichotomous outcome variable, for example, case and control individuals or affected-affected versus affected-unaffected siblings, and suitable single-marker statistics. A significant scan statistic based on the single-marker statistics represents evidence of the presence of a susceptibility gene. For a given length of the scan statistic, we assess its significance by Monte Carlo permutation tests. Comparing P values for varying lengths of scan statistics, we treat the smallest observed P value as our statistic of interest and determine its overall significance level. We applied this method to a genome screen with autism families. The result was informative and surprising: A susceptibility region was found (genome-wide significance level, P = 0.038), which is missed with conventional approaches.

[1]  B Efron,et al.  Statistical Data Analysis in the Computer Age , 1991, Science.

[2]  S. Karlin,et al.  Chance and statistical significance in protein and DNA sequence analysis. , 1992, Science.

[3]  P. Sham,et al.  Joint linkage of multiple loci for a complex disorder. , 1993, American journal of human genetics.

[4]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[5]  Margot Albus,et al.  Evaluation of a susceptibility gene for schizophrenia on chromosome 6p by multipoint affected sib–pair linkage analysis , 1995, Nature Genetics.

[6]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[7]  G. Chase,et al.  Improvement of the power to detect complex disease genes by regional inference procedures , 1997, Genetic epidemiology.

[8]  D E Weeks,et al.  True and false positive peaks in genomewide scans: applications of length-biased sampling to linkage mapping. , 1997, American journal of human genetics.

[9]  S. Moldin Detection and replication of linkage to a complex human disease , 1997, Genetic epidemiology.

[10]  Alexander F. Wilson,et al.  Regional inference with averaged P values increases the power to detect linkage , 1999, Genetic epidemiology.

[11]  Nancy J. Cox,et al.  Loci on chromosomes 2 (NIDDM1) and 15 interact to increase susceptibility to diabetes in Mexican Americans , 1999, Nature Genetics.

[12]  N E Morton,et al.  Genetic epidemiology of single-nucleotide polymorphisms. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Daniel F. Gudbjartsson,et al.  Allegro, a new computer program for multipoint linkage analysis , 2000, Nature genetics.

[14]  M. Daly,et al.  Full-genome scan for linkage in 50 families segregating the bipolar affective disease phenotype. , 2000, American journal of human genetics.

[15]  B. Manly Randomization, Bootstrap and Monte Carlo Methods in Biology , 2018 .