Expectile Neural Networks for Genetic Data Analysis of Complex Diseases

The genetic etiologies of common diseases are highly complex and heterogeneous. Classic methods, such as linear regression, have successfully identified numerous variants associated with complex diseases. Nonetheless, for most diseases, the identified variants only account for a small proportion of heritability. Challenges remain to discover additional variants contributing to complex diseases. Expectile regression is a generalization of linear regression and provides complete information on the conditional distribution of a phenotype of interest. While expectile regression has many nice properties, it has been rarely used in genetic research. In this paper, we develop an expectile neural network (ENN) method for genetic data analyses of complex diseases. Similar to expectile regression, ENN provides a comprehensive view of relationships between genetic variants and disease phenotypes and can be used to discover variants predisposing to sub-populations. We further integrate the idea of neural networks into ENN, making it capable of capturing non-linear and non-additive genetic effects (e.g., gene-gene interactions). Through simulations, we showed that the proposed method outperformed an existing expectile regression when there exist complex genotype-phenotype relationships. We also applied the proposed method to the data from the Study of Addiction: Genetics and Environment(SAGE), investigating the relationships of candidate genes with smoking quantity.

[1]  J. Crowley,et al.  Covariance Analysis of Heart Transplant Survival Data , 1977 .

[2]  Qifa Xu,et al.  Expectile regression neural network model with applications , 2017, Neurocomputing.

[3]  James W. Taylor A Quantile Regression Neural Network Approach to Estimating the Conditional Density of Multiperiod Returns , 2000 .

[4]  Ingo Steinwart,et al.  Learning rates for kernel-based expectile regression , 2018, Machine Learning.

[5]  Alex J. Cannon,et al.  Non-crossing nonlinear regression quantiles by monotone composite quantile regression neural network, with application to rainfall extremes , 2018, Stochastic Environmental Research and Risk Assessment.

[6]  Linda Schulze Waltrup,et al.  Expectile and quantile regression—David and Goliath? , 2015 .

[7]  Moshe Buchinsky,et al.  Quantile regression, Box-Cox transformation model, and the U.S. wage structure, 1963–1987 , 1995 .

[8]  H. Tong,et al.  Asymmetric least squares regression estimation: A nonparametric approach ∗ , 1996 .

[9]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[10]  V. Nguyen,et al.  A comparative study of regression based methods in regional flood frequency analysis , 1999 .

[11]  Alex J. Cannon Quantile regression neural networks: Implementation in R and application to precipitation downscaling , 2011, Comput. Geosci..

[12]  Sangyeol Lee,et al.  Nonlinear expectile regression with application to Value-at-Risk and expected shortfall estimation , 2016, Comput. Stat. Data Anal..

[13]  M. King,et al.  Genetic Heterogeneity in Human Disease , 2010, Cell.

[14]  Ming D. Li,et al.  Association and interaction analysis of variants in CHRNA5/CHRNA3/CHRNB4 gene cluster with nicotine dependence in African and European Americans , 2009, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[15]  W. Newey,et al.  Asymmetric Least Squares Estimation and Testing , 1987 .

[16]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[17]  R. Koenker,et al.  Regression Quantiles , 2007 .

[18]  蕭瓊瑞撰述,et al.  2009 , 2019, The Winning Cars of the Indianapolis 500.

[19]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[20]  S. Lipsitz,et al.  Quantile Regression Methods for Longitudinal Data with Drop‐outs: Application to CD4 Cell Counts of Patients Infected with the Human Immunodeficiency Virus , 1997 .

[21]  Hosik Choi,et al.  Penalized expectile regression: an alternative to penalized quantile regression , 2019 .