A Novel ncRNA Gene Prediction Approach Based on Fuzzy Neural Networks with Structure Learning

Discovering ncRNA genes is a challenging problem, which has attracted much attention recently. The accuracy of computational ncRNA prediction methods still needs to be improved, however, due to the diversity and the lack of consensus patterns of ncRNA genes. In this paper, we propose an effective computational approach based on fuzzy neural networks with structure learning (FNNSL) for novel ncRNA gene prediction. It has advantages such as explicit physical meanings of nodes and parameters in the network, and effective incorporation of prior knowledge by the fuzzy sets theory. Specifically, a structure learning algorithm is presented to decrease parameter dimensions, enhance the computational efficiency, and avoid the over-learning. In addition, a fuzzy c-means clustering method is adopted for fuzzy partitioning of input feature variables, and the corresponding implementations are compared to the other ncRNA gene prediction tools. The improved prediction accuracy demonstrates the effectiveness of the proposed approach.

[1]  Z. D. Deng,et al.  A fully dynamical fuzzy neural network , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[2]  Sun Zengqi,et al.  A fuzzy neural network and its application to controls , 1996 .

[3]  Masaru Tomita,et al.  Identification and expression analysis of putative mRNA‐like non‐coding RNA in Drosophila , 2005, Genes to cells : devoted to molecular & cellular mechanisms.

[4]  Saman K. Halgamuge,et al.  Protein motif extraction with neuro-fuzzy optimization , 2002, Bioinform..

[5]  I. Hofacker,et al.  Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics. , 2004, Journal of molecular biology.

[6]  B. Barrell,et al.  Genome-wide discovery and verification of novel structured RNAs in Plasmodium falciparum. , 2008, Genome research.

[7]  James S. Albus,et al.  New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)1 , 1975 .

[8]  Peter F Stadler,et al.  Fast and reliable prediction of noncoding RNAs , 2005, Proc. Natl. Acad. Sci. USA.

[9]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[10]  Kay Nieselt,et al.  Comparative analysis of structured RNAs in S. cerevisiae indicates a multitude of different functions , 2007, BMC Biology.

[11]  Howard Y. Chang,et al.  Functional Demarcation of Active and Silent Chromatin Domains in Human HOX Loci by Noncoding RNAs , 2007, Cell.

[12]  Csaba Pal,et al.  Differential impact of simultaneous migration on coevolving hosts and parasites , 2007, BMC Evolutionary Biology.

[13]  M. Sugeno,et al.  Derivation of Fuzzy Control Rules from Human Operator's Control Actions , 1983 .

[14]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[15]  D. Ecker,et al.  RNAMotif, an RNA secondary structure definition and search algorithm. , 2001, Nucleic acids research.

[16]  Elena Rivas,et al.  Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs , 2000, Bioinform..

[17]  G. Storz An Expanding Universe of Noncoding RNAs , 2002, Science.

[18]  M. Gerstein,et al.  Structured Rnas in the Encode Selected Regions of the Human Genome , 2022 .

[19]  Tao Jiang,et al.  Computational prediction of novel non-coding RNAs in Arabidopsis thaliana , 2009, BMC Bioinformatics.

[20]  J. Steitz,et al.  The expanding universe of noncoding RNAs. , 2006, Cold Spring Harbor symposia on quantitative biology.

[21]  Mark Gerstein,et al.  Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human , 2007, BMC Evolutionary Biology.

[22]  Zhidong Deng,et al.  A fuzzy neural network and its application to controls , 1996, Artif. Intell. Eng..

[23]  Diego di Bernardo,et al.  ddbRNA: detection of conserved secondary structures in multiple alignments , 2003, Bioinform..

[24]  Peifa Jia,et al.  A neural-fuzzy BOXES control system with reinforcement learning and its applications to inverted pendulum , 1995, 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century.

[25]  Elena Rivas,et al.  Noncoding RNA gene detection using comparative sequence analysis , 2001, BMC Bioinformatics.

[26]  Byung-Jun Yoon,et al.  An overview of the role of context-sensitive HMMS in the prediction of NCRNA genes , 2005, IEEE/SP 13th Workshop on Statistical Signal Processing, 2005.

[27]  Rolf Backofen,et al.  Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based Clustering , 2007, PLoS Comput. Biol..

[28]  S. Eddy,et al.  Noncoding RNA genes identified in AT-rich hyperthermophiles , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[29]  I-Min A. Dubchak,et al.  A computational approach to identify genes for functional RNAs in genomic sequences. , 2001, Nucleic acids research.