Measuring Ambiguity in HLA Typing Methods

In hematopoietic stem cell transplantation, donor selection is based primarily on matching donor and patient HLA genes. These genes are highly polymorphic and their typing can result in exact allele assignment at each gene (the resolution at which patients and donors are matched), but it can also result in a set of ambiguous assignments, depending on the typing methodology used. To facilitate rapid identification of matched donors, registries employ statistical algorithms to infer HLA alleles from ambiguous genotypes. Linkage disequilibrium information encapsulated in haplotype frequencies is used to facilitate prediction of the most likely haplotype assignment. An HLA typing with less ambiguity produces fewer high-probability haplotypes and a more reliable prediction. We estimated ambiguity for several HLA typing methods across four continental populations using an information theory-based measure, Shannon's entropy. We used allele and haplotype frequencies to calculate entropy for different sets of 1,000 subjects with simulated HLA typing. Using allele frequencies we calculated an average entropy in Caucasians of 1.65 for serology, 1.06 for allele family level, 0.49 for a 2002-era SSO kit, and 0.076 for single-pass SBT. When using haplotype frequencies in entropy calculations, we found average entropies of 0.72 for serology, 0.73 for allele family level, 0.05 for SSO, and 0.002 for single-pass SBT. Application of haplotype frequencies further reduces HLA typing ambiguity. We also estimated expected confirmatory typing mismatch rates for simulated subjects. In a hypothetical registry with all donors typed using the same method, the entropy values based on haplotype frequencies correspond to confirmatory typing mismatch rates of 1.31% for SSO versus only 0.08% for SBT. Intermediate-resolution single-pass SBT contains the least ambiguity of the methods we evaluated and therefore the most certainty in allele prediction. The presented measure objectively evaluates HLA typing methods and can help define acceptable HLA typing for donor recruitment.

[1]  Nabil El-Kadhi,et al.  Inferred HLA haplotype information for donors from hematopoietic stem cells donor registries. , 2005, Human immunology.

[2]  W R Mayr,et al.  Nomenclature for factors of the HLA system, 2004 , 2005, Tissue antigens.

[3]  Xin Huang,et al.  Sequencing genes in silico using single nucleotide polymorphisms , 2011, BMC Genetics.

[4]  Joshua S. Paul,et al.  Genotype and SNP calling from next-generation sequencing data , 2011, Nature Reviews Genetics.

[5]  Jerzy K. Kulski,et al.  The HLA genomic loci map: expression, interaction, diversity and disease , 2009, Journal of Human Genetics.

[6]  Tomer Hertz,et al.  Identifying HLA supertypes by learning distance functions , 2007, Bioinform..

[7]  David Heckerman,et al.  Statistical Resolution of Ambiguous HLA Typing Data , 2008, PLoS Comput. Biol..

[8]  E. Thorsby,et al.  HLA associated genetic predisposition to autoimmune diseases: Genes involved and possible mechanisms. , 2005, Transplant immunology.

[9]  Takato O. Yoshida,et al.  EFFECT OF MATCHING OF CLASS I HLA ALLELES ON CLINICAL OUTCOME AFTER TRANSPLANTATION OF HEMATOPOIETIC STEM CELLS FROM AN UNRELATED DONOR , 1998 .

[10]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[11]  S G Marsh,et al.  Going back to the roots: effective utilisation of HLA typing information for bone marrow registries requires full knowledge of the DNA sequences of the oligonucleotide reagents used in the testing. , 2000, Tissue antigens.

[12]  James Robinson,et al.  The IMGT/HLA database , 2008, Nucleic Acids Res..

[13]  Michael Cullen,et al.  An integrated haplotype map of the human major histocompatibility complex. , 2003, American journal of human genetics.

[14]  N Morling,et al.  HLA and disease. , 1977, Transplantation proceedings.

[15]  Effie W Petersdorf,et al.  MHC Haplotype Matching for Unrelated Hematopoietic Cell Transplantation , 2007, PLoS medicine.

[16]  S G Marsh,et al.  The HLA dictionary 2001: a summary of HLA-A, -B, -C, -DRB1/3/4/5, -DQB1 alleles and their association with serologically defined HLA-A, -B, -C, -DR, and -DQ antigens. , 2001, Human immunology.

[17]  Stephan Beck,et al.  A high-resolution linkage-disequilibrium map of the human major histocompatibility complex and first generation of tag single-nucleotide polymorphisms. , 2005, American journal of human genetics.

[18]  M Setterholm,et al.  The HLA dictionary 2008: a summary of HLA-A, -B, -C, -DRB1/3/4/5, and -DQB1 alleles and their association with serologically defined HLA-A, -B, -C, -DR, and -DQ antigens. , 2009, Tissue antigens.

[19]  Sankar K. Pal,et al.  Generalized Rough Sets, Entropy, and Image Ambiguity Measures , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[20]  C. Skibola,et al.  Multi-locus HLA class I and II allele and haplotype associations with follicular lymphoma. , 2012, Tissue antigens.

[21]  Loren Gragert,et al.  High-resolution HLA alleles and haplotypes in the United States population. , 2007, Human immunology.

[22]  Loren Gragert,et al.  Estimation of HLA-A, -B, -DRB1 haplotype frequencies using mixed resolution data from a National Registry with selective retyping of volunteers. , 2007, Human immunology.

[23]  Effie W Petersdorf,et al.  Nomenclature for Factors of the HLA System, 2004. , 2005, Human immunology.

[24]  D. Monos,et al.  Large-scale DRB and DQB1 oligonucleotide typing for the NMDP registry: progress report from year 2. , 1996, Tissue antigens.

[25]  Charles F. Hockett,et al.  A mathematical theory of communication , 1948, MOCO.

[26]  C. Hurley,et al.  Definitions of histocompatibility typing terms. , 2011, Blood.

[27]  M Setterholm,et al.  Hematopoietic stem cell donor registry strategies for assigning search determinants and matching relationships , 2004, Bone Marrow Transplantation.

[28]  H. Kim,et al.  HLA Haplotyping from RNA-seq Data Using Hierarchical Read Weighting , 2013, PloS one.

[29]  M. Tilanus,et al.  A computerized method to predict the discriminatory properties for class II sequencing based typing. , 1996, Human immunology.

[30]  H. Inoko,et al.  The clinical significance of human leukocyte antigen (HLA) allele compatibility in patients receiving a marrow transplant from serologically HLA-A, HLA-B, and HLA-DR matched unrelated donors. , 2002, Blood.

[31]  D. Monos,et al.  Large-scale DNA-based typing of HLA-A and HLA-B at low resolution is highly accurate specific and reliable. , 2000, Tissue antigens.

[32]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[33]  D. Monos,et al.  Large-scale oligonucleotide typing for HLA-DRB1/3/4 and HLA-DQB1 is highly accurate, specific, and reliable. , 1993, Tissue antigens.

[34]  Robert M. Plenge,et al.  Defining the Role of the MHC in Autoimmunity: A Review and Pooled Analysis , 2008, PLoS genetics.

[35]  Pardis C Sabeti,et al.  A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC , 2006, Nature Genetics.

[36]  Thomas D. Schneider,et al.  A brief review of molecular information theory , 2010, Nano Commun. Networks.

[37]  R Higuchi,et al.  High-resolution, high-throughput HLA genotyping by next-generation sequencing. , 2009, Tissue antigens.

[38]  M Maiers,et al.  World Marrow Donor Association guidelines for use of HLA nomenclature and its validation in the data exchange among hematopoietic stem cell donor registries and cord blood banks , 2007, Bone Marrow Transplantation.

[39]  M Setterholm,et al.  Maintaining updated DNA-based HLA assignments in the National Marrow Donor Program Bone Marrow Registry. , 2000, Reviews in immunogenetics.

[40]  M. Torres,et al.  Nomenclature for factors of the HLA system. , 2011, Bulletin of the World Health Organization.

[41]  G. Longton,et al.  The significance of HLA-DRB1 matching on clinical outcome after HLA-A, B, DR identical unrelated donor marrow transplantation. , 1995, Blood.

[42]  J. Bodmer,et al.  IMGT/HLA Database - a sequence database for the human major histocompatibility complex , 2000, Nucleic Acids Res..

[43]  A. Maritan,et al.  Using the principle of entropy maximization to infer genetic interaction networks from gene expression patterns , 2006, Proceedings of the National Academy of Sciences.

[44]  S. Schuster Next-generation sequencing transforms today's biology , 2008, Nature Methods.

[45]  S. Krishnakumar,et al.  High-throughput, high-fidelity HLA genotyping with deep sequencing , 2012, Proceedings of the National Academy of Sciences.