A FUNCTIONAL DATA ANALYSIS APPROACH FOR GENETIC ASSOCIATION STUDIES

We present a new method based on Functional Data Analysis (FDA) for detecting associations between one or more scalar covariates and a longitudinal response, while correcting for other variables. Our methods exploit the temporal structure of longitudinal data in ways that are otherwise difficult with a multivariate approach. Our procedure, from an FDA perspective, is a departure from more established methods in two key aspects. First, the raw longitudinal phenotypes are assembled into functional trajectories prior to analysis. Second, we explore an association test that is not directly based on principal components. We instead focus on quantifying the reduction in L 2 variability as a means of detecting associations. Our procedure is motivated by longitudinal genome wide association studies and, in particular, the childhood asthma management program (CAMP) which explores the long term effects of daily asthma treatments. We conduct a simulation study to better understand the advantages (and/or disadvantages) of an FDA approach compared to a traditional multivariate one. We then apply our methodology to data coming from CAMP. We find a potentially new association with a SNP negatively affecting lung function. Furthermore, this SNP seems to have an interaction effect with one of the treatments.

[1]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[2]  Piotr Kokoszka,et al.  Nonparametric inference in small data sets of spatially indexed curves with application to ionospheric trend determination , 2013, Comput. Stat. Data Anal..

[3]  Piotr Kokoszka,et al.  Determining the order of the functional autoregressive model , 2013 .

[4]  Nicolas Verzelen,et al.  Inferring stochastic dynamics from functional data , 2012 .

[5]  Kehui Chen,et al.  Conditional quantile analysis when covariates are functions, with application to growth data , 2012 .

[6]  Christos Davatzikos,et al.  Functional principal component model for high-dimensional brain imaging , 2011, NeuroImage.

[7]  Yusuke Nakamura,et al.  Genomewide association between GLCCI1 and response to glucocorticoid therapy in asthma. , 2011, The New England journal of medicine.

[8]  Lei Huang,et al.  Extracting information from functional connectivity maps via function-on-scalar regression , 2011, NeuroImage.

[9]  Philip T. Reiss,et al.  The International Journal of Biostatistics Fast Function-on-Scalar Regression with Penalized Basis Expansions , 2011 .

[10]  Pierre Lafaye de Micheaux,et al.  Computing the distribution of quadratic forms: Further comparisons between the Liu-Tang-Zhang approximation and exact methods , 2010, Comput. Stat. Data Anal..

[11]  Piotr Kokoszka,et al.  Testing for lack of dependence in the functional linear model , 2008 .

[12]  H. Müller,et al.  Time-synchronized clustering of gene expression trajectories. , 2008, Biostatistics.

[13]  Jin-Ting Zhang,et al.  Statistical inferences for functional data , 2007, 0708.2207.

[14]  Anestis Antoniadis,et al.  Estimation and inference in functional mixed-effects models , 2007, Comput. Stat. Data Anal..

[15]  P. Hall,et al.  Properties of principal component methods for functional and longitudinal data analysis , 2006, math/0608022.

[16]  R. Wu,et al.  Functional mapping — how to map and study the genetic architecture of dynamic complex traits , 2006, Nature Reviews Genetics.

[17]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[18]  André Mas,et al.  Testing hypotheses in the functional linear model , 2003 .

[19]  G. Casella,et al.  Functional mapping of quantitative trait loci underlying the character process: a theoretical framework. , 2002, Genetics.

[20]  N Franklin Adkinson,et al.  Long-term effects of budesonide or nedocromil in children with asthma. , 2000, The New England journal of medicine.

[21]  D. Bosq Linear Processes in Function Spaces: Theory And Applications , 2000 .

[22]  Jianqing Fan,et al.  Two‐step estimation of functional linear models with applications to longitudinal data , 1999 .

[23]  Heather Eliassen,et al.  The Childhood Asthma Management Program (CAMP): design, rationale, and methods. Childhood Asthma Management Program Research Group. , 1999, Controlled clinical trials.

[24]  J. O. Ramsay,et al.  Functional Data Analysis (Springer Series in Statistics) , 1997 .

[25]  J. Imhof Computing the distribution of quadratic forms in normal variables , 1961 .

[26]  M. Reimherr Functional data methods for genome-wide association studies , 2013 .

[27]  Israel Gohberg,et al.  Basic Classes of Linear Operators , 2004 .

[28]  Denis Bosq,et al.  Linear Processes in Function Spaces , 2000 .