SUMMARY
Next-generation sequencing technologies produce short reads that are either de novo assembled or mapped to a reference genome. Genotypes and/or single-nucleotide polymorphisms are then determined from the read composition at each site, which become the basis for many downstream analyses. However, for low sequencing depths, e.g. , there is considerable statistical uncertainty in the assignment of genotypes because of random sampling of homologous base pairs in heterozygotes and sequencing or alignment errors. Recently, several probabilistic methods have been proposed to account for this uncertainty and make accurate inferences from low quality and/or coverage sequencing data. We present ngsTools, a collection of programs to perform population genetics analyses from next-generation sequencing data. The methods implemented in these programs do not rely on single-nucleotide polymorphism or genotype calling and are particularly suitable for low sequencing depth data.
AVAILABILITY
Programs included in ngsTools are implemented in C/C++ and are freely available for noncommercial use at https://github.com/mfumagalli/ngsTools.
CONTACT
mfumagalli82@gmail.com
SUPPLEMENTARY INFORMATION
Supplementary materials are available at Bioinformatics online.
[1]
R. Nielsen,et al.
Quantifying Population Genetic Differentiation from Next-Generation Sequencing Data
,
2013,
Genetics.
[2]
A. Fujiyama,et al.
A map of rice genome variation reveals the origin of cultivated rice
,
2012,
Nature.
[3]
R. Nielsen,et al.
Estimating inbreeding coefficients from NGS data: Impact on genotype calling and allele frequency estimation
,
2013,
Genome research.
[4]
Jun Wang,et al.
SNP Calling, Genotype Calling, and Sample Allele Frequency Estimation from New-Generation Sequencing Data
,
2012,
PloS one.