Application of Nanopore Sequencing for High Throughput Genotyping in Horses

Simple Summary Detection and genotyping of genetic variants across genomes has several applications that include, e.g., identification of genetic background of phenotypic traits, detection of disease-related variation, and analysis of population genetic structure. In this study, we attempt to develop a Nanopore sequencing-based genotyping strategy (with MinION system from Oxford Nanopore) that allows simple and cost-efficient genome-wide analysis in horse species. With this method, we generated 28,426 polymorphisms that were genotyped with high accuracy, with a level of error not exceeding 3%. The method can be further improved to increase the number of detected variants and improve their reliability by increasing the sequencing depth. Abstract Nanopore sequencing is a third-generation biopolymer sequencing technique that relies on monitoring the changes in an electrical current that occur as nucleic acids are passed through a protein nanopore. Increasing quality of reads generated by nanopore sequencing systems encourages their application in genome-wide polymorphism detection and genotyping. In this study, we employed nanopore sequencing to identify genome-wide polymorphisms in the horse genome. To reduce the size and complexity of genome fragments for sequencing in a simple and cost-efficient manner, we amplified random DNA fragments using a modified DOP-PCR and sequenced the resulting products using the MinION system. After initial filtering, this generated 28,426 polymorphisms, which were validated at a 3% error rate. Upon further filtering for polymorphism and reproducibility, we identified 9495 SNPs that reflected the horse population structure. To conclude, the use of nanopore sequencing, in conjunction with a genome enrichment step, is a promising tool that can be practical in a variety of applications, including genotyping, population genomics, association studies, linkage mapping, and potentially genomic selection.

[1]  T. Ott,et al.  Long‐read genotyping with SLANG (Simple Long‐read loci Assembly of Nanopore data for Genotyping) , 2022, Applications in plant sciences.

[2]  Kiichi Hirota,et al.  Rapid detection of single nucleotide polymorphisms using the MinION nanopore sequencer: a feasibility study for perioperative precision medicine , 2022, JA Clinical Reports.

[3]  A. Scorilas,et al.  Third-Generation Sequencing: The Spearhead towards the Radical Transformation of Modern Genomics , 2021, Life.

[4]  K. Au,et al.  Nanopore sequencing technology, bioinformatics and applications , 2021, Nature Biotechnology.

[5]  J. Nicolas,et al.  Sequencing DNA with nanopores: Troubles and biases , 2021, PloS one.

[6]  Kristoffer Sahlin,et al.  Error correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis , 2021, Nature Communications.

[7]  T. Szmatoła,et al.  Linkage disequilibrium, haplotype blocks and historical effective population size in Arabian horses and selected Polish native horse breeds , 2020 .

[8]  G. Spangenberg,et al.  Assessment of low-coverage nanopore long read sequencing for SNP genotyping in doubled haploid canola (Brassica napus L.) , 2019, Scientific Reports.

[9]  Nobuaki Kono,et al.  Nanopore sequencing: Review of potential applications in functional genomics , 2019, Development, growth & differentiation.

[10]  Reidar Andreson,et al.  Primer3_masker: integrating masking of template sequence with primer design software , 2018, Bioinform..

[11]  Barton E Slatko,et al.  Overview of Next‐Generation Sequencing Technologies , 2018, Current protocols in molecular biology.

[12]  Jia Gu,et al.  fastp: an ultra-fast all-in-one FASTQ preprocessor , 2018, bioRxiv.

[13]  T. Szmatoła,et al.  Genotyping-by-sequencing performance in selected livestock species. , 2018, Genomics.

[14]  Jin-Xian Liu,et al.  StructureSelector: A web‐based software to select and visualize the optimal number of clusters using multiple methods , 2018, Molecular ecology resources.

[15]  M. A. Machado,et al.  Parentage assignment using SNP markers, inbreeding and population size for the Brazilian Red Sindhi cattle , 2017 .

[16]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[17]  Jean-Simon Brouard,et al.  Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation , 2017, BMC Genetics.

[18]  K. Ropka-Molik,et al.  Characteristics of runs of homozygosity in selected cattle breeds maintained in Poland , 2016 .

[19]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, bioRxiv.

[20]  M. Jakobsson,et al.  Clumpak: a program for identifying clustering modes and packaging population structure inferences across K , 2015, Molecular ecology resources.

[21]  M. Maćkowski,et al.  Genetic diversity in Hucul and Polish primitive horse breeds , 2015 .

[22]  G. M. Macbeth,et al.  NeEstimator v2: re‐implementation of software for the estimation of contemporary effective population size (Ne) from genetic data , 2014, Molecular ecology resources.

[23]  Sunday O. Peters,et al.  Genotyping-by-Sequencing (GBS): A Novel, Efficient and Cost-Effective Genotyping Method for Cattle Using Next-Generation Sequencing , 2013, PloS one.

[24]  P. Sachdev,et al.  Gray matter atrophy patterns of mild cognitive impairment subtypes , 2012, Journal of the Neurological Sciences.

[25]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[26]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[27]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[28]  M. Stephens,et al.  Inferring weak population structure with the assistance of sample group information , 2009, Molecular ecology resources.

[29]  M. Goddard,et al.  Mapping genes for complex traits in domestic animals and their use in breeding programmes , 2009, Nature Reviews Genetics.

[30]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[31]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[32]  N. Carter,et al.  Degenerate oligonucleotide-primed PCR: general amplification of target DNA by a single degenerate primer. , 1992, Genomics.

[33]  M. Goddard,et al.  Genomic selection: A paradigm shift in animal breeding , 2016 .

[34]  Khalid Rehman Hakeem,et al.  Plant Omics: Trends and Applications , 2016, Springer International Publishing.