Sequana is a Python-based software dedicated to the development of Next Generation Sequencing (NGS) pipelines. We use the Snakemake (Köster and Rahmann 2012) framework to design our pipelines, which eases the decomposition of pipelines into modular sub-units. We currently have 7 pipelines covering quality control, variant calling, long-reads quality, de-novo and RNA-seq analysis (see https://sequana.readthedocs.io for details). Our pipelines are associated with HTML reports based on JINJA templating and Javascript. The reports are used to store the results of a pipelines but also materials required to reproduce the results. Sequana is also a Python library that provides tools to perform various analysis tasks (e.g., variant call filtering). Some of the library components provide original tools that are also available as standalone applications. For instance a fast taxonomic analysis based on Kraken (Wood and Salzberg 2014) as well as a tool to perform exhaustive coverage analysis (Desvillechabrol et al. 2016) (bottom right panel in the image here below).
[1]
Thomas Cokelaer,et al.
Detection and characterization of low and high genome coverage regions using an efficient running median and a double threshold approach
,
2016,
bioRxiv.
[2]
Jacques van Helden,et al.
Sequanix: a dynamic graphical interface for Snakemake workflows
,
2017,
bioRxiv.
[3]
Derrick E. Wood,et al.
Kraken: ultrafast metagenomic sequence classification using exact alignments
,
2014,
Genome Biology.
[4]
Sven Rahmann,et al.
Genome analysis
,
2022
.