A Short‐Read Multiplex Sequencing Method for Reliable, Cost‐Effective and High‐Throughput Genotyping in Large‐Scale Studies

Accurate genotyping is important for genetic testing. Sanger sequencing‐based typing is the gold standard for genotyping, but it has been underused, due to its high cost and low throughput. In contrast, short‐read sequencing provides inexpensive and high‐throughput sequencing, holding great promise for reaching the goal of cost‐effective and high‐throughput genotyping. However, the short‐read length and the paucity of appropriate genotyping methods, pose a major challenge. Here, we present RCHSBT—reliable, cost‐effective and high‐throughput sequence based typing pipeline—which takes short sequence reads as input, but uses a unique variant calling, haploid sequence assembling algorithm, can accurately genotype with greater effective length per amplicon than even Sanger sequencing reads. The RCHSBT method was tested for the human MHC loci HLA‐A, HLA‐B, HLA‐C, HLA‐DQB1, and HLA‐DRB1, upon 96 samples using Illumina PE 150 reads. Amplicons as long as 950 bp were readily genotyped, achieving 100% typing concordance between RCHSBT‐called genotypes and genotypes previously called by Sanger sequence. Genotyping throughput was increased over 10 times, and cost was reduced over five times, for RCHSBT as compared with Sanger sequence genotyping. We thus demonstrate RCHSBT to be a genotyping method comparable to Sanger sequencing‐based typing in quality, while being more cost‐effective, and higher throughput.

[1]  S. Hayashi,et al.  Rapid and Specific Genotyping System for Hepatitis B Virus Corresponding to Six Major Genotypes by PCR Using Type-Specific Primers , 2001, Journal of Clinical Microbiology.

[2]  Helene Polin,et al.  Rapid, scalable and highly automated HLA genotyping using next-generation sequencing: a transition from research to diagnostics , 2013, BMC Genomics.

[3]  P. Terasaki,et al.  Comprehensive Method for the Typing of HLA‐A, B, and C Alleles by Direct Sequencing of PCR Products Obtained from Genomic DNA , 1997, Journal of immunotherapy.

[4]  S. Priori,et al.  Genetic testing in the long QT syndrome: development and validation of an efficient approach to genotyping in clinical practice. , 2005, JAMA.

[5]  S. Lank,et al.  A novel single cDNA amplicon pyrosequencing method for high-throughput, cost-effective sequence-based HLA class I genotyping. , 2010, Human immunology.

[6]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[7]  Xiang-min Xu,et al.  Carrier screening for alpha- and beta-thalassemia in pregnancy: the results of an 11-year prospective program in Guangzhou Maternal and Neonatal hospital. , 2005, Prenatal diagnosis.

[8]  J. Shendure,et al.  Exome sequencing as a tool for Mendelian disease gene discovery , 2011, Nature Reviews Genetics.

[9]  S. Krishnakumar,et al.  High-throughput, high-fidelity HLA genotyping with deep sequencing , 2012, Proceedings of the National Academy of Sciences.

[10]  Xiang-min Xu,et al.  Carrier screening for α‐ and β‐thalassemia in pregnancy: the results of an 11‐year prospective program in Guangzhou Maternal and Neonatal hospital , 2005 .

[11]  Ryan W. Kim,et al.  Carrier Testing for Severe Childhood Recessive Diseases by Next-Generation Sequencing , 2011, Science Translational Medicine.

[12]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[13]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[14]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[15]  R Higuchi,et al.  High-resolution, high-throughput HLA genotyping by next-generation sequencing. , 2009, Tissue antigens.

[16]  Gilles Caraux,et al.  A 454 multiplex sequencing method for rapid and reliable genotyping of highly polymorphic genes in large-scale studies , 2010, BMC Genomics.

[17]  J. Dungan Association Between Carrier Screening and Incidence of Cystic Fibrosis , 2010 .

[18]  C. Peyton,et al.  Genotyping of 27 Human Papillomavirus Types by Using L1 Consensus PCR Products by a Single-Hybridization, Reverse Line Blot Detection Method , 1998, Journal of Clinical Microbiology.

[19]  Helene Polin,et al.  Rapid high-throughput human leukocyte antigen typing by massively parallel pyrosequencing for high-resolution allele identification. , 2009, Human immunology.

[20]  Stephen J O'Brien,et al.  The influence of HLA genotype on AIDS. , 2003, Annual review of medicine.

[21]  M. Danzer,et al.  Characterisation of two novel HLA alleles, HLA-Cw*0429 and HLA-DRB3*0223. , 2008, Tissue antigens.

[22]  K. Voelkerding,et al.  Next-generation sequencing: from basic research to diagnostics. , 2009, Clinical chemistry.

[23]  D. Monos,et al.  Large-scale oligonucleotide typing for HLA-DRB1/3/4 and HLA-DQB1 is highly accurate, specific, and reliable. , 1993, Tissue antigens.

[24]  S. Odelberg,et al.  Template-switching during DNA synthesis by Thermus aquaticus DNA polymerase I. , 1995, Nucleic acids research.

[25]  Matthew W. Anderson,et al.  A multi-site study using high-resolution HLA genotyping by next generation sequencing. , 2011, Tissue antigens.

[26]  M. Danzer,et al.  High-throughput sequence-based typing strategy for HLA-DRB1 based on real-time polymerase chain reaction. , 2007, Human immunology.