论文信息 - Workshop: Transcriptome assembly from RNA-Seq data: Objectives, algorithms and challenges

Workshop: Transcriptome assembly from RNA-Seq data: Objectives, algorithms and challenges

The second generation sequencing technology revolutionizes many biology related research fields, and posts various computational biology challenges. One of them is transcriptome assembly based on RNA-Seq data, which aims at reconstructing all full-length mRNA transcripts (i.e., isoforms) simultaneously from millions of short reads. We propose three objectives in transcriptome assembly: the maximization of prediction accuracy, minimization of interpretation, and maximization of completeness. The first objective, the maximization of prediction accuracy, requires that the estimated expression levels based on assembled transcripts should be as close as possible to the observed ones for every expressed regions of the genome. The minimization of interpretation follows the parsimony principle to seek as few transcripts in the prediction as possible. The third objective, the maximization of completeness, requires that the maximum number of mapped reads (or “expressed segments” in gene models) be explained by (i.e., contained in) the predicted transcripts in the solution.

Tao Jiang | Wei Li | Jianxing Feng

[1] Tao Jiang,et al. Inference of Isoforms from Short Sequence Reads , 2010, RECOMB.

[2] C. Ponting,et al. Transcribed dark matter: meaning or myth? , 2010, Human molecular genetics.

[3] J. Rinn,et al. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature Biotechnology.

[4] Cole Trapnell,et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[5] Tao Jiang,et al. IsoLasso: A LASSO Regression Approach to RNA-Seq Based Transcriptome Assembly - (Extended Abstract) , 2011, RECOMB.

[6] J. Rinn,et al. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature biotechnology.