Fisher's method of combining dependent statistics using generalizations of the gamma distribution with applications to genetic pleiotropic associations.

A classical approach to combine independent test statistics is Fisher's combination of $p$-values, which follows the $\chi ^2$ distribution. When the test statistics are dependent, the gamma distribution (GD) is commonly used for the Fisher's combination test (FCT). We propose to use two generalizations of the GD: the generalized and the exponentiated GDs. We study some properties of mis-using the GD for the FCT to combine dependent statistics when one of the two proposed distributions are true. Our results show that both generalizations have better control of type I error rates than the GD, which tends to have inflated type I error rates at more extreme tails. In practice, common model selection criteria (e.g. Akaike information criterion/Bayesian information criterion) can be used to help select a better distribution to use for the FCT. A simple strategy of the two generalizations of the GD in genome-wide association studies is discussed. Applications of the results to genetic pleiotrophic associations are described, where multiple traits are tested for association with a single marker.

[1]  Mitchell H. Gail,et al.  On Combining Data From Genome-Wide Association Studies to Discover Disease-Associated SNPs , 2009, 1010.5046.

[2]  F. Yates,et al.  Statistical methods for research workers. 5th edition , 1935 .

[3]  Kathryn Roeder,et al.  Pleiotropy and principal components of heritability combine to increase power for association analysis , 2008, Genetic epidemiology.

[4]  Colin O. Wu,et al.  Joint Analysis of Binary and Quantitative Traits With Data Sharing and Outcome‐Dependent Sampling , 2012, Genetic epidemiology.

[5]  Tea-Yuan Hwang,et al.  On New Moment Estimation of Parameters of the Gamma Distribution Using its Characterization , 2002 .

[6]  James J. Yang Distribution of Fisher's combination statistic when the tests are dependent , 2010 .

[7]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[8]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[9]  Wei Chen,et al.  Refining the complex rheumatoid arthritis phenotype based on specificity of the HLA-DRB1 shared epitope for antibodies to citrullinated proteins. , 2005, Arthritis and rheumatism.

[10]  E. W. Stacy,et al.  Parameter Estimation for a Generalized Gamma Distribution , 1965 .

[11]  Heping Zhang,et al.  An Association Test for Multiple Traits Based on the Generalized Kendall’s Tau , 2010, Journal of the American Statistical Association.

[12]  Claire Infante-Rivard,et al.  Combining case-control and case-trio data from the same population in genetic association analyses: overview of approaches and illustration with a candidate gene study. , 2009, American journal of epidemiology.

[13]  Christopher I Amos,et al.  Data for Genetic Analysis Workshop 16 Problem 1, association analysis of rheumatoid arthritis data , 2009, BMC proceedings.

[14]  Erich L. Lehmann,et al.  The Power of Rank Tests , 1953 .

[15]  Qiong Yang,et al.  Analyze multivariate phenotypes in genetic association studies by combining univariate association tests , 2010, Genetic epidemiology.

[16]  A. Hess,et al.  Fisher's combined p-value for detecting differentially expressed genes using Affymetrix expression arrays , 2007, BMC Genomics.

[17]  Heping Zhang,et al.  Why Do We Test Multiple Traits in Genetic Association Studies? , 2009, Journal of the Korean Statistical Society.

[18]  Gauss M. Cordeiro,et al.  The Kumaraswamy generalized gamma distribution with application in survival analysis , 2011 .

[19]  Gang Zheng,et al.  Efficiency robust statistics for genetic linkage and association studies under genetic model uncertainty , 2010, Statistics in medicine.

[20]  E. Stacy A Generalization of the Gamma Distribution , 1962 .

[21]  Ramon C. Littell,et al.  Asymptotic Optimality of Fisher's Method of Combining Independent Tests , 1971 .

[22]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[23]  Morton B. Brown 400: A Method for Combining Non-Independent, One-Sided Tests of Significance , 1975 .

[24]  J. Kost,et al.  Combining dependent P-values , 2002 .