Next-generation sequencing can reveal in vitro-generated PCR crossover products: some artifactual sequences correspond to HLA alleles in the IMGT/HLA database.

The high-resolution human leukocyte antigen (HLA) genotyping assay that we developed using 454 sequencing and Conexio software uses generic polymerase chain reaction (PCR) primers for DRB exon 2. Occasionally, we observed low abundance DRB amplicon sequences that resulted from in vitro PCR 'crossing over' between DRB1 and DRB3/4/5. These hybrid sequences, revealed by the clonal sequencing property of the 454 system, were generally observed at a read depth of 5%-10% of the true alleles. They usually contained at least one mismatch with the IMGT/HLA database, and consequently, were easily recognizable and did not cause a problem for HLA genotyping. Sometimes, however, these artifactual sequences matched a rare allele and the automatic genotype assignment was incorrect. These observations raised two issues: (1) could PCR conditions be modified to reduce such artifacts? and (2) could some of the rare alleles listed in the IMGT/HLA database be artifacts rather than true alleles? Because PCR crossing over occurs during late cycles of PCR, we compared DRB genotypes resulting from 28 and (our standard) 35 cycles of PCR. For all 21 cell line DNAs amplified for 35 cycles, crossover products were detected. In 33% of the cases, these hybrid sequences corresponded to named alleles. With amplification for only 28 cycles, these artifactual sequences were not detectable. To investigate whether some rare alleles in the IMGT/HLA database might be due to PCR artifacts, we analyzed four samples obtained from the investigators who submitted the sequences. In three cases, the sequences were generated from true alleles. In one case, our 454 sequencing revealed an error in the previously submitted sequence.

[1]  C. Quince,et al.  Sample richness and genetic diversity as drivers of chimera formation in nSSU metagenetic analyses , 2012, Nucleic acids research.

[2]  Claudia Stewart,et al.  Analysis of 454 sequencing error rate, error sources, and artifact recombination for detection of Low-frequency drug resistance mutations in HIV-1 DNA , 2013, Retrovirology.

[3]  A. Meyerhans,et al.  DNA recombination during PCR. , 1990, Nucleic acids research.

[4]  Matthew W. Anderson,et al.  A multi-site study using high-resolution HLA genotyping by next generation sequencing. , 2011, Tissue antigens.

[5]  Astrid Gall,et al.  Universal Amplification, Next-Generation Sequencing, and Assembly of HIV-1 Genomes , 2012, Journal of Clinical Microbiology.

[6]  F. Bach,et al.  Molecular studies of a rare DR2/LD-5a/DQw3 HLA class II haplotype. Multiple genetic mechanisms in the generation of polymorphic HLA class II genes. , 1988, Journal of immunology.

[7]  James Robinson,et al.  The IMGT/HLA database , 2008, Nucleic Acids Res..

[8]  B. Mach,et al.  Structural comparison of the genes of two HLA-DR supertypic groups: The loci encoding DRw52 and DRw53 are not truly allelic , 2004, Immunogenetics.

[9]  H A Erlich,et al.  High throughput HLA genotyping using 454 sequencing and the Fluidigm Access Array™ System for simplified amplicon library preparation. , 2013, Tissue antigens.

[10]  S. Pääbo,et al.  DNA damage promotes jumping between templates during enzymatic amplification. , 1990, The Journal of biological chemistry.

[11]  Helene Polin,et al.  Rapid, scalable and highly automated HLA genotyping using next-generation sequencing: a transition from research to diagnostics , 2013, BMC Genomics.

[12]  B. Mach,et al.  Polymorphism of human Ia antigens: gene conversion between two DR β loci results in a new HLA-D/DR specificity , 1986, Nature.

[13]  A. Shuldiner,et al.  Hybrid DNA artifact from PCR of closely related target sequences. , 1989, Nucleic acids research.

[14]  Qunhui Li,et al.  Artificial Recombination May Influence the Evolutionary Analysis of Newcastle Disease Virus , 2011, Journal of Virology.

[15]  A. J. Jones,et al.  At Least 1 in 20 16S rRNA Sequence Records Currently Held in Public Repositories Is Estimated To Contain Substantial Anomalies , 2005, Applied and Environmental Microbiology.

[16]  N. Legath,et al.  Identification of a novel HLA-DRB3*01 variant, HLA-DRB3*0114, containing a DRB1 sequence motif by micro-temperature gradient gel electrophoresis confirmed by sequence-based typing. , 2010, Tissue antigens.

[17]  H. Erlich,et al.  HLA DNA typing: past, present, and future. , 2012, Tissue antigens.

[18]  K. Mullis,et al.  Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase. , 1988, Science.

[19]  R Higuchi,et al.  High-resolution, high-throughput HLA genotyping by next-generation sequencing. , 2009, Tissue antigens.