Trans-ethnic Polygenic Risk Scores for Body Mass Index: An International Hundred K+ Cohorts Consortium Study

Background While polygenic risk scores hold significant promise in estimating an individual's risk of developing a complex trait such as obesity, their application in the clinic has, to date, been limited by a lack of data from non-European populations. As a collaboration model of the International Hundred K+ Cohorts Consortium (IHCC), we endeavored to develop a globally applicable trans-ethnic PRS for body mass index (BMI) through this relatively new international effort. Methods The PRS model was developed trained and tested at the Center for Applied Genomics (CAG) of The Children's Hospital of Philadelphia (CHOP) based on a BMI meta-analysis from the GIANT consortium. The validated PRS models were subsequently disseminated to the participating sites. Scores were generated by each site locally on their cohorts and summary statistics returned to CAG for final analysis. Results We show that in the absence of a well powered trans-ethnic GWAS from which to derive SNPs and effect estimates, trans-ethnic scores can be generated from European ancestry GWAS using Bayesian approaches such as LDpred to adjust the summary statistics using trans-ethnic linkage disequilibrium reference panels. The ported trans-ethnic scores outperform population specific-PRS across all non-European ancestry populations investigated including East Asians and three-way admixed Brazilian cohort. Conclusions Widespread use of PRS in the clinic is hampered by a lack of genotyping data in individuals of non-European ancestry for the vast majority of traits. Here we show that for a truly polygenic trait such as BMI adjusting the summary statistics of a well powered European ancestry study using trans-ethnic LD reference results in a score that is predictive across a range of ancestries including East Asians and three-way admixed Brazilians.

[1]  John J. Connolly,et al.  Improved genetic risk scoring algorithm for type 1 diabetes prediction , 2022, Pediatric diabetes.

[2]  G. Ginsburg,et al.  The International Hundred Thousand Plus Cohort Consortium: integrating large-scale cohorts to address global scientific challenges , 2020, The Lancet Digital Health.

[3]  Bjarni J. Vilhjálmsson,et al.  LDpred2: better, faster, stronger , 2020, bioRxiv.

[4]  M. Inouye,et al.  Towards clinical utility of polygenic risk scores. , 2019, Human molecular genetics.

[5]  B. Duncan,et al.  Context-dependence of race self-classification: Results from a highly mixed and unequal middle-income country , 2019, PloS one.

[6]  Alicia R. Martin,et al.  Clinical use of current polygenic risk scores may exacerbate health disparities , 2019, Nature Genetics.

[7]  J. Danesh,et al.  Efficiency and safety of varying the frequency of whole blood donation (INTERVAL): a randomised trial of 45 000 donors , 2017, The Lancet.

[8]  D. Meyer,et al.  A genomic perspective on HLA evolution , 2017, Immunogenetics.

[9]  P. Magnus,et al.  Cohort Profile Update: The Norwegian Mother and Child Cohort Study (MoBa). , 2016, International journal of epidemiology.

[10]  P. Visscher,et al.  Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores , 2015, bioRxiv.

[11]  X. Shu,et al.  Cohort Profile: The Shanghai Men's Health Study. , 2015, International journal of epidemiology.

[12]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[13]  Moyses Szklo,et al.  American Journal of Epidemiology Practice of Epidemiology Brazilian Longitudinal Study of Adult Health (elsa-brasil): Objectives and Design , 2022 .

[14]  W. Willett,et al.  Plasma 25-hydroxyvitamin D and risk of breast cancer in the Nurses' Health Study II , 2011, Breast Cancer Research.

[15]  J. Stephenson 1000 Genomes Project , 2008 .

[16]  J. Opitz,et al.  Obesity: Genetic, molecular, and environmental aspects , 2007, American journal of medical genetics. Part A.

[17]  N. Rothman,et al.  The Shanghai Women's Health Study: rationale, study design, and baseline characteristics. , 2005, American journal of epidemiology.

[18]  F. Balloux,et al.  Pathogen-Driven Selection and Worldwide HLA Class I Diversity , 2005, Current Biology.

[19]  Graham A. Colditz,et al.  The Nurses' Health Study: lifestyle and health among women , 2005, Nature Reviews Cancer.

[20]  B. Caballero The global epidemic of obesity: an overview. , 2007, Epidemiologic reviews.

[21]  Monte Carlo Integration Markov Chain Monte Carlo and Gibbs Sampling , 2002 .