Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest

Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and –DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.

[1]  B S Weir,et al.  HIBAG—HLA genotype imputation with attribute bagging , 2013, The Pharmacogenomics Journal.

[2]  H. Frangoul,et al.  Cost saving associated with implementing a stepwise approach to HLA typing of related donors before hematopoietic SCT , 2014, Bone Marrow Transplantation.

[3]  Alexander T. Dilthey,et al.  HLA*IMP - an integrated framework for imputing classical HLA alleles from SNP genotypes , 2011, Bioinform..

[4]  S. Factor,et al.  Association of Parkinson disease with structural and regulatory variants in the HLA region. , 2013, American journal of human genetics.

[5]  Scott M. Williams,et al.  challenges for genome-wide association studies , 2010 .

[6]  Steven J Mack,et al.  Balancing selection and heterogeneity across the classical human leukocyte antigen loci: a meta-analytic review of 497 population studies. , 2008, Human immunology.

[7]  G Thomson,et al.  MHC microsatellite diversity and linkage disequilibrium among common HLA-A, HLA-B, DRB1 haplotypes: implications for unrelated donor hematopoietic transplantation and disease association studies. , 2005, Tissue antigens.

[8]  Gudmundur A. Thorisson,et al.  The International HapMap Project Web site. , 2005, Genome research.

[9]  James Robinson,et al.  The IMGT/HLA database , 2008, Nucleic Acids Res..

[10]  C. Moore,et al.  Association between presence of HLA-B*5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir , 2002, The Lancet.

[11]  J. Gorski The HLA-DRw8 lineage was generated by a deletion in the DR B region followed by first domain diversification. , 1989, Journal of immunology.

[12]  Robert M. Plenge,et al.  Defining the Role of the MHC in Autoimmunity: A Review and Pooled Analysis , 2008, PLoS genetics.

[13]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[14]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[15]  Pardis C Sabeti,et al.  A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC , 2006, Nature Genetics.

[16]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[17]  Y. Okada,et al.  Predicting HLA alleles from high-resolution SNP data in three Southeast Asian populations. , 2014, Human molecular genetics.

[18]  Elizabeth Phillips,et al.  HLA and pharmacogenetics of drug hypersensitivity. , 2012, Pharmacogenomics.

[19]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[20]  Lue Ping Zhao,et al.  Empirical evaluations of analytical issues arising from predicting HLA alleles using multiple SNPs , 2011, BMC Genetics.

[21]  Sophie Palmer,et al.  Genetic Analysis of Completely Sequenced Disease-Associated MHC Haplotypes Identifies Shuffling of Segments in Recent Human History , 2006, PLoS genetics.

[22]  M. Feolo,et al.  HLA Diversity in the 1000 Genomes Dataset , 2014, PloS one.

[23]  Jonathan Scott Friedlaender,et al.  A Human Genome Diversity Cell Line Panel , 2002, Science.

[24]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[25]  J. Stockman HLA-A∗3101 and Carbamazepine-Induced Hypersensitivity Reactions in Europeans , 2012 .

[26]  W. Klitz,et al.  Common and well-documented HLA alleles: 2012 update to the CWD catalogue. , 2013, Tissue antigens.

[27]  Soumya Raychaudhuri,et al.  Interrogating the major histocompatibility complex with high-throughput genomics. , 2012, Human molecular genetics.

[28]  Yuan-Tsong Chen,et al.  Predicting HLA genotypes using unphased and flanking single-nucleotide polymorphisms in Han Chinese population , 2014, BMC Genomics.

[29]  W. Bodmer,et al.  Nomenclature for factors of the HLA system, 2010 , 2010, Tissue antigens.

[30]  I. James,et al.  Predisposition to abacavir hypersensitivity conferred by HLA-B*5701 and a haplotypic Hsp70-Hom variant , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  G. Andersson,et al.  Evolution of the human HLA-DR region. , 1998, Frontiers in bioscience : a journal and virtual library.

[32]  H. Inoko,et al.  Gene Map of the HLA Region, Graves' Disease and Hashimoto Thyroiditis, and Hematopoietic Stem Cell Transplantation. , 2016, Advances in immunology.

[33]  W. Klitz,et al.  Polymorphism, recombination, and linkage disequilibrium within the HLA class II region. , 1992, Journal of immunology.

[34]  A. Begovich,et al.  HLA‐DR, DO AND DP TYPING USING PCR AMPLIFICATION AND IMMOBILIZED PROBES , 1991, European journal of immunogenetics : official journal of the British Society for Histocompatibility and Immunogenetics.

[35]  Bo Zhang,et al.  Predicting multiallelic genes using unphased and flanking single nucleotide polymorphisms , 2011, Genetic epidemiology.

[36]  Buhm Han,et al.  Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens , 2013, PloS one.

[37]  M. Iannuzzi,et al.  Performance of HLA allele prediction methods in African Americans for class II genes HLA-DRB1, −DQB1, and –DPB1 , 2014, BMC Genetics.

[38]  Edward J. Louis,et al.  New reservoirs of HLA alleles: pools of rare variants enhance immune defense. , 2012, Trends in genetics : TIG.

[39]  K. Anastos,et al.  Human leucocyte antigen class I and II imputation in a multiracial population , 2016, International journal of immunogenetics.

[40]  Jason H. Moore,et al.  BIOINFORMATICS REVIEW , 2005 .

[41]  M S Nieminen,et al.  Evaluation of HLA-DRB1 imputation using a Finnish dataset. , 2014, Tissue antigens.

[42]  Alexander T. Dilthey,et al.  Multi-Population Classical HLA Type Imputation , 2013, PLoS Comput. Biol..

[43]  J. Traherne,et al.  Review Article doi: 10.1111/j.1744-313X.2008.00765.x Blackwell , 2022 .

[44]  Steven G.E. Marsh,et al.  Nomenclature for factors of the HLA system , 1975 .

[45]  Peter Donnelly,et al.  A statistical method for predicting classical HLA alleles from SNP data. , 2008, American journal of human genetics.

[46]  B. Thiers HLA-B*5801 Allele as a Genetic Marker for Severe Cutaneous Adverse Reactions Caused by Allopurinol , 2006 .

[47]  H. Erlich,et al.  HLA DNA typing: past, present, and future. , 2012, Tissue antigens.

[48]  Miles Parkes,et al.  Genetic insights into common pathways and complex relationships among immune-mediated diseases , 2013, Nature Reviews Genetics.

[49]  N. Kamatani,et al.  High-accuracy imputation for HLA class I and II genes based on high-resolution SNP data of population-specific references , 2015, The Pharmacogenomics Journal.