Toward a fine-scale population health monitoring system

we propose a data from electronic health (EHRs) in concert with genomic data to explore the demographic ties that can impact disease burdens. Using data from a diverse biobank in New York City, we identified 17 communities sharing recent genetic ancestry. We observed 1,177 health outcomes that were statistically associated with a specific group and demonstrated significant differences in the segregation of genetic variants contributing to Mendelian diseases. We also demonstrated that fine-scale population structure can impact the prediction of complex disease risk within groups. This work reinforces the utility of linking genomic data to EHRs and provides a framework toward monitoring of population

[1]  David S Jones,et al.  Hidden in Plain Sight - Reconsidering the Use of Race Correction in Clinical Algorithms. , 2020, The New England journal of medicine.

[2]  S. Naicker,et al.  APOL1 Genetic Variants Are Associated with Serum-Oxidized Low-Density Lipoprotein Levels and Subclinical Atherosclerosis in South African CKD Patients , 2020, Nephron.

[3]  Scott M. Williams,et al.  Association of preeclampsia with infant APOL1 genotype in African Americans , 2020, BMC Medical Genetics.

[4]  C. Winkler,et al.  Maternal variants within the apolipoprotein L1 gene are associated with preeclampsia in a South African cohort of African ancestry. , 2020, European journal of obstetrics, gynecology, and reproductive biology.

[5]  C. Winkler,et al.  APOL1 Nephropathy Risk Alleles and Mortality in African American Adults: A Cohort Study. , 2020, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[6]  B. Young,et al.  Genetics and ESKD Disparities in African Americans. , 2019, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[7]  Judy H. Cho,et al.  Exome sequencing reveals a high prevalence of BRCA1 and BRCA2 founder variants in a diverse population-based biobank , 2019, Genome Medicine.

[8]  José Luis Ambite,et al.  Rapid detection of identity-by-descent tracts for mega-scale datasets , 2019, Nature Communications.

[9]  A. Philippakis,et al.  The "All of Us" Research Program. , 2019, The New England journal of medicine.

[10]  C. Rodriguez,et al.  High blood pressure in Hispanics in the United States: a review. , 2019, Current opinion in cardiology.

[11]  S. Naicker,et al.  JC Virus and APOL1 Risk Alleles in Black South Africans With Hypertension-Attributed CKD , 2019, Kidney international reports.

[12]  Thomas S. Redding,et al.  Racial/Ethnic Disparities in BRCA Counseling and Testing: a Narrative Review , 2019, Journal of Racial and Ethnic Health Disparities.

[13]  Alicia R. Martin,et al.  Clinical use of current polygenic risk scores may exacerbate health disparities , 2019, Nature Genetics.

[14]  E. Kenny,et al.  Personalized Medicine and the Power of Electronic Health Records , 2019, Cell.

[15]  Jeffrey Braithwaite,et al.  Integrating Genomics into Healthcare: A Global Responsibility. , 2019, American journal of human genetics.

[16]  A. Kostic,et al.  Understanding the growing epidemic of type 2 diabetes in the Hispanic population living in the United States , 2018, Diabetes/metabolism research and reviews.

[17]  Christopher R. Gignoux,et al.  Worldwide Frequencies of APOL1 Renal Risk Variants. , 2018, The New England journal of medicine.

[18]  Francisco M. De La Vega,et al.  Polygenic risk scores: a biased prediction? , 2018, Genome Medicine.

[19]  Joshua C. Denny,et al.  Developing and Evaluating Mappings of ICD-10 and ICD-10-CM Codes to Phecodes , 2018, bioRxiv.

[20]  P. Donnelly,et al.  The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.

[21]  N. Risch,et al.  The Clinical Sequencing Evidence-Generating Research Consortium: Integrating Genomic Sequencing in Diverse and Medically Underserved Populations. , 2018, American journal of human genetics.

[22]  Mary E. Haas,et al.  Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations , 2018, Nature Genetics.

[23]  C. Sabatti,et al.  Understanding the Hidden Complexity of Latin American Population Isolates , 2018, bioRxiv.

[24]  N. Schneiderman,et al.  Ancestry-specific recent effective population size in the Americas , 2018, PLoS genetics.

[25]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[26]  B. Trivedi Medicine's future? , 2017, Science.

[27]  Alicia R. Martin,et al.  Haplotype sharing provides insights into fine-scale population history and disease in Finland , 2017, bioRxiv.

[28]  Alexander E. Lopez,et al.  Profiling and leveraging relatedness in a precision medicine cohort of 92,455 exomes , 2017, bioRxiv.

[29]  David Reich,et al.  The promise of disease gene discovery in South Asia , 2017, Nature Genetics.

[30]  Janina M. Jeff,et al.  Genetic identification of a common collagen disease in Puerto Ricans via identity-by-descent mapping in a health system , 2017, bioRxiv.

[31]  Christopher R. Gignoux,et al.  Human demographic history impacts genetic risk prediction across diverse populations , 2016, bioRxiv.

[32]  Kathleen F. Kerr,et al.  African Ancestry-Specific Alleles and Kidney Disease Risk in Hispanics/Latinos. , 2017, Journal of the American Society of Nephrology : JASN.

[33]  Ross E. Curtis,et al.  Clustering of 770,000 genomes reveals post-colonial population structure of North America , 2017, Nature Communications.

[34]  D. Bianchi,et al.  Noninvasive Prenatal DNA Testing: The Vanguard of Genomic Medicine. , 2017, Annual review of medicine.

[35]  Yusuke Nakamura,et al.  Cancer Precision Medicine: From Cancer Screening to Drug Selection and Personalized Immunotherapy. , 2017, Trends in pharmacological sciences.

[36]  Marylyn D. Ritchie,et al.  Distribution and clinical impact of functional variants in 50,726 whole-exome sequences from the DiscovEHR study , 2016, Science.

[37]  G. Canino,et al.  Asthma in Puerto Ricans: Lessons from a high-risk population. , 2016, The Journal of allergy and clinical immunology.

[38]  Zachary A. Szpiech,et al.  A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome , 2016, Nature Communications.

[39]  Christopher R. Gignoux,et al.  Making Precision Medicine Socially Precise. Take a Deep Breath. , 2016, American journal of respiratory and critical care medicine.

[40]  Yali Xue,et al.  BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data , 2016, Bioinform..

[41]  Brian L Browning,et al.  Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent. , 2015, American journal of human genetics.

[42]  M. Criqui,et al.  Cuban Americans have the highest rates of peripheral arterial disease in diverse Hispanic/Latino communities. , 2015, Journal of vascular surgery.

[43]  C. Sabatti,et al.  Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.

[44]  Urvashi Surti,et al.  Recent advances of genomic testing in perinatal medicine. , 2015, Seminars in perinatology.

[45]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[46]  Elissa V. Klinger,et al.  Accuracy of Race, Ethnicity, and Language Preference in an Electronic Health Record , 2015, Journal of General Internal Medicine.

[47]  Ross M. Fraser,et al.  A General Approach for Haplotype Phasing across the Full Spectrum of Relatedness , 2014, PLoS genetics.

[48]  E. Krishnan,et al.  Filipino Gout: A Review , 2014, Arthritis care & research.

[49]  Barry I. Freedman,et al.  APOL1 risk variants, race, and progression of chronic kidney disease. , 2013, The New England journal of medicine.

[50]  N. Arber,et al.  The APC p.I1307K polymorphism is a significant risk factor for CRC in average risk Ashkenazi Jews. , 2013, European journal of cancer.

[51]  C. Bustamante,et al.  RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference. , 2013, American journal of human genetics.

[52]  Jake K. Byrnes,et al.  Reconstructing the Population Genetic History of the Caribbean , 2013, PLoS genetics.

[53]  E. Thompson Identity by Descent: Variation in Meiosis, Across Genomes, and in Populations , 2013, Genetics.

[54]  Fan Wang,et al.  APC polymorphisms and the risk of colorectal neoplasia: a HuGE review and meta-analysis. , 2013, American journal of epidemiology.

[55]  J. Sollano,et al.  Hepatitis B infection among adults in the philippines: A national seroprevalence study. , 2013, World journal of hepatology.

[56]  Irving E. Vega,et al.  Frequency and clinicopathological characteristics of presenilin 1 Gly206Ala mutation in Puerto Rican Hispanics with dementia. , 2013, Journal of Alzheimer's disease : JAD.

[57]  Brian L Browning,et al.  Identity by descent between distant relatives: detection and applications. , 2012, Annual review of genetics.

[58]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[59]  R. Collins What makes UK Biobank special? , 2012, The Lancet.

[60]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[61]  D. Falush,et al.  Inference of Population Structure using Dense Haplotype Data , 2012, PLoS genetics.

[62]  I. Haq,et al.  ABRAHAM'S CHILDREN , 2012 .

[63]  David E Frost,et al.  All of us. , 2011, Journal of oral and maxillofacial surgery : official journal of the American Association of Oral and Maxillofacial Surgeons.

[64]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[65]  Martin Rosvall,et al.  Multilevel Compression of Random Walks on Networks Reveals Hierarchical Organization in Large Integrated Systems , 2010, PloS one.

[66]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[67]  C. Koebnick,et al.  Health plan administrative records versus birth certificate records: quality of race and ethnicity information in children , 2010, BMC health services research.

[68]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[69]  R. Desnick,et al.  Type 1 Gaucher disease: significant disease manifestations in "asymptomatic" homozygotes. , 2010, Archives of internal medicine.

[70]  Marylyn D. Ritchie,et al.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations , 2010, Bioinform..

[71]  Á. Carracedo,et al.  The Garífuna (Black Carib) people of the Atlantic coasts of Honduras: Population dynamics, structure, and phylogenetic relations inferred from genetic data, migration matrices, and isonymy , 2010, American journal of human biology : the official journal of the Human Biology Council.

[72]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[73]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[74]  Cédric Villani,et al.  The Wasserstein distances , 2009 .

[75]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[76]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[77]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[78]  D. Reich,et al.  MYH9 is associated with nondiabetic end-stage renal disease in African Americans , 2008, Nature Genetics.

[79]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[80]  A. Lahad,et al.  Carrier screening for Gaucher disease: lessons for low-penetrance, treatable diseases. , 2007, JAMA.

[81]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[82]  K. Marsh,et al.  Sickle cell disease in Africa: burden and research priorities , 2007, Annals of tropical medicine and parasitology.

[83]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[84]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[85]  M. Cheang,et al.  A Population-Based Case Control Study of Potential Risk Factors for IBD , 2006, The American Journal of Gastroenterology.

[86]  N. Risch,et al.  Estimation of individual admixture: Analytical and study design considerations , 2005, Genetic epidemiology.

[87]  Kenneth Offit,et al.  Functional and genomic approaches reveal an ancient CHEK2 allele associated with breast cancer in the Ashkenazi Jewish population. , 2005, Human molecular genetics.

[88]  N. Risch,et al.  The importance of race and ethnic background in biomedical research and clinical practice. , 2003, The New England journal of medicine.

[89]  R. Cooper,et al.  Race and genomics. , 2003, The New England journal of medicine.

[90]  N. Freimer,et al.  Genetic demography of Antioquia (Colombia) and the Central Valley of Costa Rica , 2003, Human Genetics.

[91]  Jonathan Scott Friedlaender,et al.  A Human Genome Diversity Cell Line Panel , 2002, Science.

[92]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[93]  D. Mannino,et al.  Asthma mortality in U.S. Hispanics of Mexican, Puerto Rican, and Cuban heritage, 1990-1995. , 2000, American journal of respiratory and critical care medicine.

[94]  M. King,et al.  Genomic views of human history. , 1999, Science.

[95]  P. Gergen,et al.  Reported asthma among Puerto Rican, Mexican-American, and Cuban children, 1982 through 1984. , 1993, American journal of public health.

[96]  P. Menozzi,et al.  Synthetic maps of human gene frequencies in Europeans. , 1978, Science.