An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar

How viruses evolve within hosts can dictate infection outcomes; however, reconstructing this process is challenging. We evaluate our multiplexed amplicon approach, PrimalSeq, to demonstrate how virus concentration, sequencing coverage, primer mismatches, and replicates influence the accuracy of measuring intrahost virus diversity. We develop an experimental protocol and computational tool, iVar, for using PrimalSeq to measure virus diversity using Illumina and compare the results to Oxford Nanopore sequencing. We demonstrate the utility of PrimalSeq by measuring Zika and West Nile virus diversity from varied sample types and show that the accumulation of genetic diversity is influenced by experimental and biological systems.

[1]  M. Vignuzzi,et al.  Host Alternation of Chikungunya Virus Increases Fitness while Restricting Population Diversity and Adaptability to Novel Selective Pressures , 2010, Journal of Virology.

[2]  Hans Ellegren,et al.  Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria , 2014, BMC Genomics.

[3]  Tavis K. Anderson,et al.  Selective constraint and adaptive potential of West Nile virus within and among naturally infected avian hosts and mosquito vectors , 2018, Virus evolution.

[4]  Jan Albert,et al.  Population genomics of intrapatient HIV-1 evolution , 2015, eLife.

[5]  T. Bedford,et al.  Genetic characterization of the Zika virus epidemic in the US Virgin Islands , 2017, bioRxiv.

[6]  Huldrych F. Günthard,et al.  Whole Genome Deep Sequencing of HIV-1 Reveals the Impact of Early Minor Variants Upon Immune Recognition During Acute Infection , 2012, PLoS pathogens.

[7]  Rahul Raman,et al.  Hemagglutinin Receptor Binding Avidity Drives Influenza A Virus Antigenic Drift , 2009, Science.

[8]  R. A. McKay,et al.  1970s and ‘Patient 0’ HIV-1 genomes illuminate early HIV/AIDS history in North America , 2016, Nature.

[9]  V. Potapov,et al.  Correction: Examining Sources of Error in PCR by Single-Molecule Sequencing , 2017, PloS one.

[10]  Wei-June Chen,et al.  Study of Sequence Variation of Dengue Type 3 Virus in Naturally Infected Mosquitoes and Human Hosts: Implications for Transmission and Evolution , 2004, Journal of Virology.

[11]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[12]  Kristian G. Andersen,et al.  Experimental Evolution to Study Virus Emergence , 2017, Cell.

[13]  Katherine Spindler,et al.  Rapid evolution of RNA genomes. , 1982, Science.

[14]  D. Burton,et al.  Neutralizing human monoclonal antibodies prevent Zika virus infection in macaques , 2017, Science Translational Medicine.

[15]  Yoshihiro Kawaoka,et al.  Selective Bottlenecks Shape Evolutionary Pathways Taken during Mammalian Adaptation of a 1918-like Avian Influenza Virus , 2016, Cell Host & Microbe.

[16]  N. R. Faria,et al.  Establishment and cryptic transmission of Zika virus in Brazil and the Americas , 2017, Nature.

[17]  J. Loparo,et al.  Mapping DNA polymerase errors by single-molecule sequencing , 2016, Nucleic acids research.

[18]  A. Lauring,et al.  Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling , 2016, Journal of Virology.

[19]  G. Ebel,et al.  Genetic variation in West Nile virus from naturally infected mosquitoes and birds suggests quasispecies structure and strong purifying selection. , 2005, The Journal of general virology.

[20]  R. Nielsen,et al.  The Evolutionary Pathway to Virulence of an RNA Virus , 2017, Cell.

[21]  G. Ebel,et al.  Mosquitoes Transmit Unique West Nile Virus Populations during Each Feeding Episode. , 2017, Cell reports.

[22]  G. Ebel,et al.  Genetic diversity and purifying selection in West Nile virus populations are maintained during host switching. , 2008, Virology.

[23]  Q. Lan,et al.  Small heat shock proteins distinguish between two mosquito species and confirm identity of their cell lines. , 1990, The American journal of tropical medicine and hygiene.

[24]  Marc Lipsitch,et al.  Shared Genomic Variants: Identification of Transmission Routes Using Pathogen Deep-Sequence Data , 2017, American journal of epidemiology.

[25]  Shane S. Sturrock,et al.  Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data , 2012, Bioinform..

[26]  N. Ben-Tal,et al.  Emergence and transmission of arbovirus evolutionary intermediates with epidemic potential. , 2014, Cell host & microbe.

[27]  Raul Andino,et al.  Mapping the Evolutionary Potential of RNA Viruses. , 2018, Cell host & microbe.

[28]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[29]  M. Ar Gouilh,et al.  Genetic Drift, Purifying Selection and Vector Genotype Shape Dengue Virus Intra-host Genetic Diversity in Mosquitoes , 2016, PLoS genetics.

[30]  M. Valenciano,et al.  2015/16 seasonal vaccine effectiveness against hospitalisation with influenza A(H1N1) pdm09 and B among elderly people in Europe: results from the I-MOVE plus project , 2017 .

[31]  G. Ebel,et al.  Experimental Evolution of an RNA Virus in Wild Birds: Evidence for Host-Dependent Impacts on Population Structure and Competitive Fitness , 2015, PLoS pathogens.

[32]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[33]  Marion C Lanteri,et al.  Zika Virus Tissue and Blood Compartmentalization in Acute Infection of Rhesus Macaques , 2017, PloS one.

[34]  W. Kloosterman,et al.  From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy , 2018, Genome Biology.

[35]  Raul Andino,et al.  Mutational and fitness landscapes of an RNA virus revealed through population sequencing , 2013, Nature.

[36]  Brent S. Pedersen,et al.  Nanopore sequencing and assembly of a human genome with ultra-long reads , 2017, Nature Biotechnology.

[37]  P. Parameswaran,et al.  Intrahost Selection Pressures Drive Rapid Dengue Virus Microevolution in Acute Human Infections , 2017, Cell Host & Microbe.

[38]  R. Andino,et al.  Library preparation for highly accurate population sequencing of RNA viruses , 2014, Nature Protocols.

[39]  Christopher J. R. Illingworth,et al.  On the effective depth of viral sequence data , 2017, Virus evolution.

[40]  Kazutaka Katoh,et al.  MAFFT: iterative refinement and additional methods. , 2014, Methods in molecular biology.

[41]  U. Obolski,et al.  Genomic and epidemiological monitoring of yellow fever virus transmission potential , 2018, Science.

[42]  Benedict Paten,et al.  Haplotype-aware genotyping from noisy long reads , 2018, bioRxiv.

[43]  S. Weaver,et al.  Mosquitoes Put the Brake on Arbovirus Evolution: Experimental Evolution Reveals Slower Mutation Accumulation in Mosquito Than Vertebrate Cells , 2009, PLoS pathogens.

[44]  Hayden C. Metsky,et al.  Genomic epidemiology reveals multiple introductions of Zika virus into the United States , 2017, Nature.

[45]  E. Lavezzo,et al.  Infection dynamics in a traveller with persistent shedding of Zika virus RNA in semen for six months after returning from Haiti to Italy, January 2016 , 2016, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[46]  K. Tardif,et al.  Sequencing-Based Genotyping of Mixed Human Papillomavirus Infections by Use of RipSeq Software , 2013, Journal of Clinical Microbiology.

[47]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[48]  Jun Ying Lim,et al.  Estimating and mitigating amplification bias in qualitative and quantitative arthropod metabarcoding , 2017, Scientific Reports.

[49]  Bradley J. Main,et al.  Vector competence of Aedes aegypti, Culex tarsalis, and Culex quinquefasciatus from California for Zika virus , 2018, PLoS neglected tropical diseases.

[50]  G. Ebel,et al.  Genetic Drift during Systemic Arbovirus Infection of Mosquito Vectors Leads to Decreased Relative Fitness during Host Switching. , 2016, Cell host & microbe.

[51]  Jennifer L. Gardy,et al.  Towards a genomics-informed, real-time, global pathogen surveillance system , 2017, Nature Reviews Genetics.

[52]  Rita Sipos,et al.  Effect of primer mismatch, annealing temperature and PCR cycle number on 16S rRNA gene-targetting bacterial community analysis. , 2007, FEMS microbiology ecology.

[53]  Roger E Bumgarner,et al.  Comparison of Major and Minor Viral SNPs Identified through Single Template Sequencing and Pyrosequencing in Acute HIV-1 Infection , 2015, PloS one.

[54]  David A. Matthews,et al.  Real-time, portable genome sequencing for Ebola surveillance , 2016, Nature.

[55]  M. Vignuzzi,et al.  Quasispecies diversity determines pathogenesis through cooperative interactions in a viral population , 2006, Nature.

[56]  Trevor Bedford,et al.  Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples , 2017, Nature Protocols.

[57]  K. Robasky,et al.  The role of replicates for error mitigation in next-generation sequencing , 2013, Nature Reviews Genetics.

[58]  F. Zanini,et al.  Error rates, PCR recombination, and sampling depth in HIV-1 whole genome deep sequencing. , 2017, Virus research.

[59]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[60]  A. Wilm,et al.  Tracking Dengue Virus Intra-host Genetic Diversity during Human-to-Mosquito Transmission , 2015, PLoS neglected tropical diseases.

[61]  S. Weaver,et al.  Vector-Borne Transmission Imposes a Severe Bottleneck on an RNA Virus Population , 2012, PLoS pathogens.

[62]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[63]  D. Burton,et al.  Fetal demise and failed antibody therapy during Zika virus infection of pregnant macaques , 2018, Nature Communications.

[64]  Tommy F. Liu,et al.  Nucleic Acid Template and the Risk of a PCR-Induced HIV-1 Drug Resistance Mutation , 2010, PloS one.

[65]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[66]  Karthik Gangavarapu,et al.  Genome sequencing reveals Zika virus diversity and spread in the Americas , 2017, bioRxiv.

[67]  Joshua S. Paul,et al.  Genotype and SNP calling from next-generation sequencing data , 2011, Nature Reviews Genetics.

[68]  V. Fofanov,et al.  Phylogenetic analysis of West Nile Virus in Maricopa County, Arizona: Evidence for dynamic behavior of strains in two major lineages in the American Southwest , 2017, bioRxiv.

[69]  Tarjei S Mikkelsen,et al.  Enhanced methods for unbiased deep sequencing of Lassa and Ebola RNA viruses from clinical and biological samples , 2014, Genome Biology.

[70]  D. O’Connor,et al.  Infection via mosquito bite alters Zika virus tissue tropism and replication kinetics in rhesus macaques , 2017, Nature Communications.

[71]  J. Morrison,et al.  Intraamniotic Zika virus inoculation of pregnant rhesus macaques produces fetal neurologic disease , 2018, Nature Communications.

[72]  Niranjan Nagarajan,et al.  INC-Seq: accurate single molecule reads using nanopore sequencing , 2016, bioRxiv.

[73]  M. Vignuzzi,et al.  Virus population dynamics during infection. , 2017, Current opinion in virology.

[74]  C. Quince,et al.  Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform , 2015, Nucleic acids research.

[75]  L. Kramer,et al.  Experimental Passage of St. Louis Encephalitis Virus In Vivo in Mosquitoes and Chickens Reveals Evolutionarily Significant Virus Characteristics , 2009, PloS one.