Statistical power and significance testing in large-scale genetic studies

[1]  G. Kirov,et al.  Analysis of copy number variations at 15 schizophrenia-associated loci , 2014, The British journal of psychiatry : the journal of mental science.

[2]  M. Daly,et al.  Searching for missing heritability: Designing rare variant association studies , 2014, Proceedings of the National Academy of Sciences.

[3]  Iuliana Ionita-Laza,et al.  Family-based association tests for sequence data, and comparisons with population-based association tests , 2013, European Journal of Human Genetics.

[4]  C. Greenwood,et al.  Empirical power of very rare variants for common traits and disease: results from sanger sequencing 1998 individuals , 2013, European Journal of Human Genetics.

[5]  Dan-Yu Lin,et al.  Meta-analysis of gene-level associations for rare variants based on single-variant statistics. , 2013, American journal of human genetics.

[6]  Kathryn Roeder,et al.  Integrated Model of De Novo and Inherited Genetic Variants Yields Greater Power to Identify Risk Genes , 2013, PLoS genetics.

[7]  Seunggeun Lee,et al.  General framework for meta-analysis of rare variants in sequencing association studies. , 2013, American journal of human genetics.

[8]  Kathryn Roeder,et al.  Analysis of Rare, Exonic Variation amongst Subjects with Autism Spectrum Disorders and Population Controls , 2013, PLoS genetics.

[9]  Kathryn Roeder,et al.  Rare Complete Knockouts in Humans: Population Distribution and Significant Role in Autism Spectrum Disorders , 2013, Neuron.

[10]  S. Gabriel,et al.  Analysis of 6,515 exomes reveals a recent origin of most human protein-coding variants , 2012, Nature.

[11]  P. Gregersen,et al.  Immunochip analyses identify a novel risk locus for primary biliary cirrhosis at 13q14, multiple independent associations at four established risk loci and epistasis between 1p31 and 7q32 risk variants. , 2012, Human molecular genetics.

[12]  Biao Li,et al.  SimRare: a program to generate and analyze sequence-based data for association studies of quantitative and qualitative traits , 2012, Bioinform..

[13]  Xihong Lin,et al.  Optimal tests for rare variant effects in sequencing association studies. , 2012, Biostatistics.

[14]  M. Pirinen,et al.  Including known covariates can reduce power to detect genetic effects in case-control studies , 2012, Nature Genetics.

[15]  Tanya M. Teslovich,et al.  The Metabochip, a Custom Genotyping Array for Genetic Studies of Metabolic, Cardiovascular, and Anthropometric Traits , 2012, PLoS genetics.

[16]  Claudio J. Verzilli,et al.  An Abundance of Rare Functional Variants in 202 Drug Target Genes Sequenced in 14,002 People , 2012, Science.

[17]  Adam Kiezun,et al.  Exome sequencing and the genetic basis of complex traits , 2012, Nature Genetics.

[18]  Kenny Q. Ye,et al.  De Novo Gene Disruptions in Children on the Autistic Spectrum , 2012, Neuron.

[19]  Yurii S. Aulchenko,et al.  The Empirical Power of Rare Variant Association Methods: Results from Sanger Sequencing in 1,998 Individuals , 2012, PLoS genetics.

[20]  John P A Ioannidis,et al.  What should the genome-wide significance threshold be? Empirical replication of borderline genetic associations. , 2012, International journal of epidemiology.

[21]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[22]  Johnny S. H. Kwan,et al.  A comprehensive framework for prioritizing variants in exome sequencing studies of Mendelian diseases , 2012, Nucleic acids research.

[23]  G. McVean,et al.  Differential confounding of rare and common variants in spatially structured populations , 2011, Nature Genetics.

[24]  Degui Zhi,et al.  Statistical Guidance for Experimental Design and Data Analysis of Mutation Detection in Rare Monogenic Mendelian Diseases by Exome Sequencing , 2012, PloS one.

[25]  Stacey S. Cherny,et al.  Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets , 2011, Human Genetics.

[26]  D. Conti,et al.  Using extreme phenotype sampling to identify the rare causal variants of quantitative traits in association studies , 2011, Genetic epidemiology.

[27]  Alexander F. Wilson,et al.  Linkage Analysis in the Next-Generation Sequencing Era , 2011, Human Heredity.

[28]  Weiliang Qiu,et al.  Combining effects from rare and common genetic variants in an exome-wide association study of sequence data , 2011, BMC proceedings.

[29]  Mohamad Saad,et al.  Comparative study of statistical methods for detecting association with rare variants in exome-resequencing data , 2011, BMC proceedings.

[30]  J. Shendure,et al.  Exome sequencing as a tool for Mendelian disease gene discovery , 2011, Nature Reviews Genetics.

[31]  Wei Pan,et al.  Comparison of statistical tests for disease association with rare variants , 2011, Genetic epidemiology.

[32]  Adam Kiezun,et al.  Computational and statistical approaches to analyzing variants identified by exome sequencing , 2011, Genome Biology.

[33]  Dan-Yu Lin,et al.  A general framework for detecting disease associations with rare variants in sequencing studies. , 2011, American journal of human genetics.

[34]  M. Southey,et al.  Design Considerations for Massively Parallel Sequencing Studies of Complex Human Disease , 2011, PloS one.

[35]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[36]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[37]  Hon-Cheong So,et al.  Robust Association Tests Under Different Genetic Models, Allowing for Binary or Quantitative Traits and Covariates , 2011, Behavior genetics.

[38]  Kenny Q. Ye,et al.  Detecting multiple causal rare variants in exome sequence data , 2011, Genetic Epidemiology.

[39]  Garrett P. Larson,et al.  Three Ways of Combining Genotyping and Resequencing in Case-Control Association Studies , 2010, PloS one.

[40]  Dajiang J Liu,et al.  Replication strategies for rare variant complex trait association studies via next-generation sequencing. , 2010, American journal of human genetics.

[41]  V. Bansal,et al.  Statistical analysis strategies for association studies involving rare variants , 2010, Nature Reviews Genetics.

[42]  Emily H Turner,et al.  Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome , 2010, Nature Genetics.

[43]  Gary D Bader,et al.  Functional impact of global rare copy number variation in autism spectrum disorders , 2010, Nature.

[44]  Lee-Jen Wei,et al.  Pooled Association Tests for Rare Variants in Exon-Resequencing Studies , 2010 .

[45]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[46]  Ku Chee Seng,et al.  How Many Genetic Variants Remain to Be Discovered? , 2009, PloS one.

[47]  N. Galwey,et al.  A new measure of the effective number of tests, a practical tool for comparing families of non‐independent significance tests , 2009, Genetic epidemiology.

[48]  M. Stephens,et al.  Bayesian statistical methods for genetic association studies , 2009, Nature Reviews Genetics.

[49]  P. Sham,et al.  Novel Sib Pair Selection Strategy Increases Power in Quantitative Association Analysis , 2009, Behavior genetics.

[50]  Suzanne M. Leal,et al.  Discovery of Rare Variants via Sequencing: Implications for the Design of Complex Trait Association Studies , 2009, PLoS genetics.

[51]  J. Todd,et al.  Rare Variants of IFIH1, a Gene Implicated in Antiviral Responses, Protect Against Type 1 Diabetes , 2009, Science.

[52]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[53]  Jon Wakefield,et al.  Bayes factors for genome‐wide association studies: comparison with P‐values , 2009, Genetic epidemiology.

[54]  Joan E Bailey-Wilson,et al.  Establishing an adjusted p-value threshold to control the family-wide type 1 error in genome wide association studies , 2008, BMC Genomics.

[55]  R. Prentice,et al.  Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies. , 2008, Biostatistics.

[56]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[57]  V. Moskvina,et al.  On multiple‐testing correction in genome‐wide association studies , 2008, Genetic epidemiology.

[58]  J. Ioannidis Why Most Discovered True Associations Are Inflated , 2008, Epidemiology.

[59]  Fei Zou,et al.  Estimating odds ratios in genome scans: an approximate conditional likelihood approach. , 2008, American journal of human genetics.

[60]  Qizhai Li,et al.  Efficient Approximation of P‐value of the Maximum of Correlated Tests, with Applications to Genome‐Wide Association Studies , 2008, Annals of human genetics.

[61]  M. Daly,et al.  Estimation of the multiple testing burden for genomewide association studies of nearly all common variants , 2008, Genetic epidemiology.

[62]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[63]  Xavier Estivill,et al.  Maximizing association statistics over genetic models , 2008, Genetic epidemiology.

[64]  F. Dudbridge,et al.  Estimation of significance thresholds for genomewide association scans , 2008, Genetic epidemiology.

[65]  C. Hoggart,et al.  Genome‐wide significance for dense SNP and resequencing data , 2008, Genetic epidemiology.

[66]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[67]  Shamil R Sunyaev,et al.  Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. , 2007, American journal of human genetics.

[68]  J. Pritchard,et al.  Overcoming the winner's curse: estimating penetrance parameters from case-control data. , 2007, American journal of human genetics.

[69]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[70]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[71]  J. Li,et al.  Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix , 2005, Heredity.

[72]  B Müller-Myhsok,et al.  Rapid simulation of P values for product methods and multiple-testing adjustment in association studies. , 2005, American journal of human genetics.

[73]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[74]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[75]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[76]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[77]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[78]  Frank Dudbridge,et al.  Efficient computation of significance levels for multiple associations in large studies of correlated data, including genomewide association studies. , 2004, American journal of human genetics.

[79]  D. Nyholt A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. , 2004, American journal of human genetics.

[80]  Nathaniel Rothman,et al.  Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. , 2004, Journal of the National Cancer Institute.

[81]  John P A Ioannidis,et al.  Genetic associations: false or true? , 2003, Trends in molecular medicine.

[82]  D Curtis,et al.  A note on calculation of empirical P values from Monte Carlo procedure. , 2003, American Journal of Human Genetics.

[83]  Pak Chung Sham,et al.  Genetic Power Calculator: design of linkage and association genetic mapping studies of complex traits , 2003, Bioinform..

[84]  P. Sham,et al.  A note on the calculation of empirical P values from Monte Carlo procedures. , 2002, American journal of human genetics.

[85]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[86]  W. Gauderman Sample size requirements for association studies of gene-gene interaction. , 2002, American journal of epidemiology.

[87]  W James Gauderman,et al.  Sample size requirements for matched case‐control studies of gene–environment interaction , 2002, Statistics in medicine.

[88]  N E Day,et al.  Sample size determination for studies of gene-environment interaction. , 2001, International journal of epidemiology.

[89]  R. Nickerson,et al.  Null hypothesis significance testing: a review of an old and continuing controversy. , 2000, Psychological methods.

[90]  P. Sham,et al.  Power of linkage versus association analysis of quantitative traits, by use of variance-components models, for sibship data. , 2000, American journal of human genetics.

[91]  C. Lewis,et al.  Power comparisons of the transmission/disequilibrium test and sib-transmission/disequilibrium-test statistics. , 1999, American journal of human genetics.

[92]  J K Hewitt,et al.  Combined linkage and association sib-pair analysis for quantitative traits. , 1999, American journal of human genetics.

[93]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[94]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[95]  Z. Šidák Rectangular Confidence Regions for the Means of Multivariate Normal Distributions , 1967 .

[96]  P. Patnaik Corrigenda: The Power Function of the Test for the Difference Between Two Proportions in a 2 × 2 Table , 1959 .

[97]  P. Patnaik THE NON-CENTRAL χ2- AND F-DISTRIBUTIONS AND THEIR APPLICATIONS , 1949 .

[98]  P. Patnaik The Non-central X^2- and F- distribution and Their Applications , 1949 .

[99]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .