Messenger RNA Polyadenylation Site Recognition in Green Alga Chlamydomonas Reinhardtii

Recognition of polyadenylation [poly(A)] sites for messenger RNA is important in genome annotation and gene expression regulation analysis In the paper, poly(A) sites of Chlamydomonas reinhardtii were identified using an updated version of poly(A) site recognition software PASS_VAR based on generalized hidden Markov model First, we analyzed the characteristics of the poly(A) sites and their surrounding sequence patterns, and used an entropy-based feature selection method to select important poly(A) signal patterns in conservative signal states Then we improved the existing poly(A) sites recognition software PASS that was initially designed only for Arabidopsis to make it suitable for different species Next, Chlamydomonas sequences were grouped according to their signal patterns and used to train the model parameters through mathematical statistics methods Finally, poly(A) sites were identified using PASS_VAR The efficacy of our model is showed up to 93% confidence with strong signals.

[1]  Shu-Kun Lin Diversity and Entropy , 1999, Entropy.

[2]  J. A. Buchheim,et al.  Regulation of flagellar length in Chlamydomonas. , 2008, Seminars in cell & developmental biology.

[3]  Guoli Ji,et al.  Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylation , 2008, Nucleic acids research.

[4]  Xiaohui Wu,et al.  Modeling Plant mRNA Poly(A) Sites: Software Design and Implementation , 2007 .

[5]  B. Tian,et al.  Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation. , 2005, RNA.

[6]  D. Hovorun,et al.  Hypothetical Double‐Helical Poly(A) Formation in a Cell and Its Possible Biological Significance , 1999, IUBMB life.

[7]  Richard Durbin,et al.  A probabilistic model of 3' end formation in Caenorhabditis elegans. , 2004, Nucleic acids research.

[8]  W. Pernice,et al.  Finite-Difference Time-Domain Methods and Material Models for the Simulation of Metallic and Plasmonic Structures , 2010 .

[9]  Qingshun Quinn Li,et al.  Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures1[w] , 2005, Plant Physiology.

[10]  D. Gautheret,et al.  Sequence determinants in human polyadenylation site selection , 2003, BMC Genomics.

[11]  W. Gish,et al.  Gene structure prediction and alternative splicing analysis using genomically aligned ESTs. , 2001, Genome research.

[12]  Xiaohui Wu,et al.  Predictive modeling of plant messenger RNA polyadenylation sites , 2007, BMC Bioinformatics.

[13]  Nick Proudfoot,et al.  New perspectives on connecting messenger RNA 3' end formation to transcription. , 2004, Current opinion in cell biology.

[14]  Stevo K. Jaćimovski,et al.  Statistical and Dynamical Equivalence of Different Elementary Cells , 2007 .

[15]  Jens Rupprecht,et al.  From systems biology to fuel--Chlamydomonas reinhardtii as a model for a systems biology approach to improve biohydrogen production. , 2009, Journal of biotechnology.

[16]  Miller Tran,et al.  Chlamydomonas reinhardtii chloroplasts as protein factories. , 2007, Current opinion in biotechnology.

[17]  G. Edwalds-Gilbert,et al.  Alternative poly(A) site selection in complex transcription units: means to an end? , 1997, Nucleic acids research.

[18]  Jack E. Tabaska,et al.  Detection of polyadenylation signals in human DNA sequences. , 1999, Gene.

[19]  Chun Liang,et al.  Unique Features of Nuclear mRNA Poly(A) Signals and Alternative Polyadenylation in Chlamydomonas reinhardtii , 2008, Genetics.