ViDiT-CACTUS: an inexpensive and versatile library preparation and sequence analysis method for virus discovery and other microbiology applications.

High-throughput sequencing (HTS) technologies are becoming increasingly important within microbiology research, but aspects of library preparation, such as high cost per sample or strict input requirements, make HTS difficult to implement in some niche applications and for research groups on a budget. To answer these necessities, we developed ViDiT, a customizable, PCR-based, extremely low-cost (less than US$5 per sample), and versatile library preparation method, and CACTUS, an analysis pipeline designed to rely on cloud computing power to generate high-quality data from ViDiT-based experiments without the need of expensive servers. We demonstrate here the versatility and utility of these methods within three fields of microbiology: virus discovery, amplicon-based viral genome sequencing, and microbiome profiling. ViDiT-CACTUS allowed the identification of viral fragments from 25 different viral families from 36 oropharyngeal-cloacal swabs collected from wild birds, the sequencing of three almost complete genomes of avian influenza A viruses (>90% coverage), and the characterization and functional profiling of the complete microbial diversity (bacteria, archaea, viruses) within a deep-sea carnivorous sponge. ViDiT-CACTUS demonstrated its validity in a wide range of microbiology applications, and its simplicity and modularity make it easily implementable in any molecular biology laboratory, towards various research goals.

[1]  M. Snyder,et al.  High-throughput sequencing technologies. , 2015, Molecular cell.

[2]  L. Fortier,et al.  Importance of prophages to evolution and virulence of bacterial pathogens , 2013, Virulence.

[3]  Yoshihiro Kawaoka,et al.  Single-Reaction Genomic Amplification Accelerates Sequencing and Vaccine Production for Classical and Swine Origin Human Influenza A Viruses , 2009, Journal of Virology.

[4]  A. Djikeng,et al.  Viral genome sequencing by random priming methods , 2008 .

[5]  Mart Krupovic,et al.  Genomics of Bacterial and Archaeal Viruses: Dynamics within the Prokaryotic Virosphere , 2011, Microbiology and Molecular Reviews.

[6]  M. Canuti,et al.  Virus discovery: are we scientists or genome collectors? , 2014, Trends in microbiology.

[7]  Z. Ning,et al.  Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of GC-biased genomes , 2009, Nature Methods.

[8]  S. Dufour,et al.  Microbiomes of the Arctic carnivorous sponges Chondrocladia grandis and Cladorhiza oxeata suggest a specific, but differential involvement of bacterial associates , 2017, Arctic Science.

[9]  J. Cocho,et al.  A glimpse into past, present, and future DNA sequencing. , 2013, Molecular genetics and metabolism.

[10]  Chao Xie,et al.  Fast and sensitive protein alignment using DIAMOND , 2014, Nature Methods.

[11]  Matthew Z. DeMaere,et al.  Functional genomic signatures of sponge bacteria reveal unique and shared features of symbiosis , 2010, The ISME Journal.

[12]  Ben Nichols,et al.  Distributed under Creative Commons Cc-by 4.0 Vsearch: a Versatile Open Source Tool for Metagenomics , 2022 .

[13]  John Hackett,et al.  The Perils of Pathogen Discovery: Origin of a Novel Parvovirus-Like Hybrid Genome Traced to Nucleic Acid Extraction Spin Columns , 2013, Journal of Virology.

[14]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[15]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[16]  Davide Heller,et al.  eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences , 2015, Nucleic Acids Res..

[17]  M. Meyer,et al.  Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA , 2013, Nature Protocols.

[18]  Angela C. M. Luyf,et al.  UvA-DARE ( Digital Academic Repository ) A Sensitive Assay for Virus Discovery in Respiratory Clinical Samples , 2011 .

[19]  K. Lohman,et al.  Development of a Real-Time Reverse Transcriptase PCR Assay for Type A Influenza Virus and the Avian H5 and H7 Hemagglutinin Subtypes , 2002, Journal of Clinical Microbiology.

[20]  Louise Aigrain,et al.  Quantitation of next generation sequencing library preparation protocol efficiencies using droplet digital PCR assays - a systematic comparison of DNA library preparation kits for Illumina sequencing , 2016, BMC Genomics.

[21]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[22]  Kareem A. Mosa,et al.  Limited resources of genome sequencing in developing countries: Challenges and solutions , 2016, Applied & translational genomics.

[23]  C. Thermes,et al.  Library preparation methods for next-generation sequencing: tone down the bias. , 2014, Experimental cell research.

[24]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[25]  Alexander F. Auch,et al.  MEGAN analysis of metagenomic data. , 2007, Genome research.

[26]  Raymond K. Auerbach,et al.  The real cost of sequencing: higher than you think! , 2011, Genome Biology.