Comparing Bitmapped microRNA Structure Images using Mutual Symmetry

We present a high-throughput method for analyzing large-scale bitmapped bio-data: processing of elongated molecular structures by 2D images and analyzing their shapes for chemoinformatics databases. Two-dimensional structure databases are transfered to bitmap images — a commonly used visualization widely spread online. Then, an efficient clustering of the molecular structures is achieved by a mutual symmetry-based binary matrix representation of the shapes. We present a method to compute the difference between two of such representations and evaluate its performance with respect to time and quality of matching. In our tests we use two bitmap databases, one containing true human microRNA folded 2D structures and one with claimed human microRNA folded 2D structures. We show the stability of the matching with respect to parameterization and orientation of the shapes. Our method enables a good automatic clustering of structures with high visual similarity.

[1]  Peter F Stadler,et al.  Fast and reliable prediction of noncoding RNAs , 2005, Proc. Natl. Acad. Sci. USA.

[2]  Robert Giegerich,et al.  Pure multiple RNA secondary structure alignments: a progressive profile approach , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  N. Kasabov,et al.  Classifying MicroRNAs by Gabor Filter Features from 2D Structure Bitmap Images on a Case Study of Human MicroRNAs , 2005 .

[4]  H. Blum Biological shape and visual science (part I) , 1973 .

[5]  R. Unger,et al.  RNAMAT: an efficient method to detect classes of RNA molecules and their structural features , 2004, The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[6]  Kaleem Siddiqi,et al.  A shock grammar for recognition , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[8]  PaperNo Recognition of shapes by editing shock graphs , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9]  Arjan Kuijper,et al.  Describing and Matching 2D Shapes by Their Points of Mutual Symmetry , 2006, ECCV.

[10]  Philip N. Klein,et al.  Shock-Based Indexing into Large Shape Databases , 2002, ECCV.

[11]  Ali Shokoufandeh,et al.  Shock Graphs and Shape Matching , 1998, International Journal of Computer Vision.

[12]  Arjan Kuijper,et al.  Computationally efficient matching of MicroRNA shapes using mutual symmetry , 2007, SIP.

[13]  Robert Giegerich,et al.  A comprehensive comparison of comparative RNA structure prediction approaches , 2004, BMC Bioinformatics.

[14]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[15]  Martial Mermillod,et al.  Coarse scales are sufficient for efficient categorization of emotional facial expressions: Evidence from neural computation , 2010, Neurocomputing.

[16]  D. Banville Mining chemical structural information from the drug literature. , 2006, Drug discovery today.

[17]  Philip Bille,et al.  From a 2D Shape to a String Structure Using the Symmetry Set , 2004, ECCV.

[18]  Arjan Kuijper,et al.  Alternative 2D Shape Representations using the Symmetry Set , 2006, Journal of Mathematical Imaging and Vision.

[19]  Shaoning Pang,et al.  Image and Fractal Information Processing for Large-Scale Chemoinformatics, Genomics Analyses and Pattern Discovery , 2006, PRIB.

[20]  Arjan Kuijper,et al.  An automatic cell segmentation method for differential interference contrast microscopy , 2008, 2008 19th International Conference on Pattern Recognition.

[21]  Arjan Kuijper,et al.  Deriving the Medial Axis with geometrical arguments for planar shapes , 2007, Pattern Recognit. Lett..

[22]  Arjan Kuijper,et al.  On the Local Form and Transitions of Pre-symmetry Sets , 2012, Journal of Mathematical Imaging and Vision.

[23]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[24]  Nikola K. Kasabov,et al.  Soft computing methods to predict gene regulatory networks: An integrative approach on time-series gene expression data , 2008, Appl. Soft Comput..

[25]  L. Johann Cell Segmentation Using the Level Set Method , 2007 .

[26]  Bart M. ter Haar Romeny,et al.  Front-End Vision and Multi-Scale Image Analysis , 2003, Computational Imaging and Vision.

[27]  P. Hogeweg,et al.  Pattern analysis of RNA secondary structure similarity and consensus of minimal-energy folding. , 1989, Journal of molecular biology.

[28]  Kaleem Siddiqi,et al.  Matching Hierarchical Structures Using Association Graphs , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Benjamin B. Kimia,et al.  On the role of medial geometry in human vision , 2003, Journal of Physiology-Paris.

[30]  Guillermo Sapiro,et al.  An image processing approach to computing distances between RNA secondary structures dot plots , 2009, Algorithms for Molecular Biology.

[31]  Philip Bille,et al.  A survey on tree edit distance and related problems , 2005, Theor. Comput. Sci..

[32]  O. F. Olsen,et al.  Transitions of the pre-symmetry set , 2004, ICPR 2004.

[33]  R. Ravi,et al.  Computing Similarity between RNA Strings , 1996, CPM.

[34]  Kaizhong Zhang,et al.  Comparing multiple RNA secondary structures using tree comparisons , 1990, Comput. Appl. Biosci..

[35]  Frédéric Cao,et al.  Geometric curve evolution and image processing , 2003, Lecture notes in mathematics.

[36]  P. Stadler,et al.  Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome , 2005, Nature Biotechnology.

[37]  Miroslaw Bober,et al.  Curvature Scale Space Representation: Theory, Applications, and MPEG-7 Standardization , 2011, Computational Imaging and Vision.

[38]  Mohammed J. Zaki Efficiently mining frequent trees in a forest: algorithms and applications , 2005, IEEE Transactions on Knowledge and Data Engineering.

[39]  A. Vinogradov Noncoding DNA, isochores and gene expression: nucleosome formation potential , 2005, Nucleic acids research.

[40]  Arjan Kuijper,et al.  On Extending Symmetry Sets for 2D Shapes , 2004, SSPR/SPR.

[41]  J. Mattick,et al.  Small regulatory RNAs in mammals. , 2005, Human molecular genetics.

[42]  Guojun Lu,et al.  Review of shape representation and description techniques , 2004, Pattern Recognit..

[43]  Oliviero Carugo,et al.  CX, DPX and PRIDE: WWW servers for the analysis and comparison of protein 3D structures , 2005, Nucleic Acids Res..

[44]  Arjan Kuijper,et al.  Clustered Cell Segmentation - Based on Iterative Voting and the Level Set Method , 2008, VISAPP.