HLAscan: genotyping of the HLA region using next-generation sequencing data

BackgroundSeveral recent studies showed that next-generation sequencing (NGS)-based human leukocyte antigen (HLA) typing is a feasible and promising technique for variant calling of highly polymorphic regions. To date, however, no method with sufficient read depth has completely solved the allele phasing issue. In this study, we developed a new method (HLAscan) for HLA genotyping using NGS data.ResultsHLAscan performs alignment of reads to HLA sequences from the international ImMunoGeneTics project/human leukocyte antigen (IMGT/HLA) database. The distribution of aligned reads was used to calculate a score function to determine correctly phased alleles by progressively removing false-positive alleles. Comparative HLA typing tests using public datasets from the 1000 Genomes Project and the International HapMap Project demonstrated that HLAscan could perform HLA typing more accurately than previously reported NGS-based methods such as HLAreporter and PHLAT. In addition, the results of HLA-A, −B, and -DRB1 typing by HLAscan using data generated by NextGen were identical to those obtained using a Sanger sequencing–based method. We also applied HLAscan to a family dataset with various coverage depths generated on the Illumina HiSeq X-TEN platform. HLAscan identified allele types of HLA-A, −B, −C, −DQB1, and -DRB1 with 100% accuracy for sequences at ≥ 90× depth, and the overall accuracy was 96.9%.ConclusionsHLAscan, an alignment-based program that takes read distribution into account to determine true allele types, outperformed previously developed HLA typing tools. Therefore, HLAscan can be reliably applied for determination of HLA type across the whole-genome, exome, and target sequences.

[1]  H. Erlich,et al.  HLA DNA typing: past, present, and future. , 2012, Tissue antigens.

[2]  James Robinson,et al.  The IPD and IMGT/HLA database: allele variant databases , 2014, Nucleic Acids Res..

[3]  Richard A. Moore,et al.  Derivation of HLA types from shotgun sequence datasets , 2012, Genome Medicine.

[4]  P. Sham,et al.  HLAreporter: a tool for HLA typing from next generation sequencing data , 2015, Genome Medicine.

[5]  J. Knight,et al.  Major histocompatibility complex genomics and human disease. , 2013, Annual review of genomics and human genetics.

[6]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[7]  M. Carrington,et al.  High-resolution patterns of meiotic recombination across the human major histocompatibility complex. , 2002, American journal of human genetics.

[8]  Helene Polin,et al.  Rapid, scalable and highly automated HLA genotyping using next-generation sequencing: a transition from research to diagnostics , 2013, BMC Genomics.

[9]  M. Ni,et al.  Inference of high resolution HLA types using genome-wide RNA or DNA sequencing reads , 2014, BMC Genomics.

[10]  F. Christiansen,et al.  The genetic basis for the association of the 8.1 ancestral haplotype (A1, B8, DR3) with multiple immunopathological diseases , 1999, Immunological reviews.

[11]  Szilveszter Juhos,et al.  HLA Typing from 1000 Genomes Whole Genome and Whole Exome Illumina Data , 2013, PloS one.

[12]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[13]  M. H. Park,et al.  HLA‐A, ‐B and ‐DRB1 polymorphism in Koreans defined by sequence‐based typing of 4128 cord blood units , 2013, International journal of immunogenetics.

[14]  M. Zody,et al.  ATHLATES: accurate typing of human leukocyte antigen through exome sequencing , 2013, Nucleic acids research.

[15]  N. Lennon,et al.  Next-generation sequencing for HLA typing of class I loci , 2011, BMC Genomics.

[16]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[17]  Ituro Inoue,et al.  The impact of next-generation sequencing technologies on HLA research , 2015, Journal of Human Genetics.

[18]  P. Dunn Human leucocyte antigen typing: techniques and technology, a critical appraisal , 2011, International journal of immunogenetics.

[19]  Simon C. Potter,et al.  Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia , 2013, Nature Genetics.

[20]  C. Harding,et al.  Pathways of antigen processing. , 1991, Current opinion in immunology.

[21]  Diogo Meyer,et al.  The Relevance of HLA Sequencing in Population Genetics Studies , 2014, Journal of immunology research.

[22]  S. Krishnakumar,et al.  High-throughput, high-fidelity HLA genotyping with deep sequencing , 2012, Proceedings of the National Academy of Sciences.

[23]  Pardis C Sabeti,et al.  A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC , 2006, Nature Genetics.