Recent advances in high-throughput sequencing technology made it possible to probe the cell transcriptomes by generating hundreds of millions of short reads which represent the fragments of the transcribed RNA molecules. The first and the most crucial task in the RNA-seq data analysis is mapping of the reads to the reference genome. STAR (Spliced Transcripts Alignment to a Reference) is an RNA-seq mapper that performs highly accurate spliced sequence alignment at an ultrafast speed. STAR alignment algorithm can be controlled by many user-defined parameters. Here, we describe the most important STAR options and parameters, as well as best practices for achieving the maximum mapping accuracy and speed.
[1]
Paul Theodor Pyl,et al.
HTSeq—a Python framework to work with high-throughput sequencing data
,
2014,
bioRxiv.
[2]
Thomas R. Gingeras,et al.
STAR: ultrafast universal RNA-seq aligner
,
2013,
Bioinform..
[3]
Colin N. Dewey,et al.
RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome
,
2011,
BMC Bioinformatics.
[4]
Gonçalo R. Abecasis,et al.
The Sequence Alignment/Map format and SAMtools
,
2009,
Bioinform..
[5]
L. Pachter,et al.
Streaming fragment assignment for real-time analysis of sequencing experiments
,
2012,
Nature Methods.