A Simple Process of RNA-Sequence Analyses by Hisat2, Htseq and DESeq2

High throughput RNA sequencing is now a commonly used technique. And how to process and analysis the mass data has been the trend in Bioscience. Recently many researchers have to concentrate on more accurate and efficient method. The data in this paper are published by Stephen T. Smale's team. This article we introduce a general workflow to analysis the raw data. And we also show some basic and useful analysis tools for each step to visualize the analysis results. The differential expression analysis can be used in many different research directions. Many other tools have different features and advantages, choosing the right tools can help us get the more effective results.

[1]  Charlotte Soneson,et al.  A comparison of methods for differential expression analysis of RNA-seq data , 2013, BMC Bioinformatics.

[2]  Thomas E. Royce,et al.  Toward a universal microarray: prediction of gene expression through nearest-neighbor probe sequence identification , 2007, Nucleic acids research.

[3]  C. Clemens,et al.  Into the unknown: expression profiling without genome sequence information in CHO by next generation sequencing , 2010, Nucleic acids research.

[4]  Eun Ji Kim,et al.  Simulation-based comprehensive benchmarking of RNA-seq aligners , 2016, Nature Methods.

[5]  Paul A. Bates,et al.  Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis , 2006, BMC Bioinformatics.

[6]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[7]  Paul Theodor Pyl,et al.  HTSeq—a Python framework to work with high-throughput sequencing data , 2014, bioRxiv.

[8]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[9]  Brandon J Thomas,et al.  A Stringent Systems Approach Uncovers Gene-Specific Mechanisms Regulating Inflammation , 2016, Cell.

[10]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[11]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[12]  David R. Kelley,et al.  Corrigendum: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2014 .

[13]  D. J. Wheeler,et al.  A Block-sorting Lossless Data Compression Algorithm , 1994 .

[14]  Michal J. Okoniewski,et al.  Hybridization interactions between probesets in short oligo microarrays lead to spurious correlations , 2006, BMC Bioinformatics.

[15]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[16]  Simon Anders,et al.  Analysing RNA-Seq data with the DESeq package , 2011 .

[17]  Giovanni Manzini,et al.  Opportunistic data structures with applications , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.