Fuzzy Multilevel Graph Embedding: for Recognition, Indexing and Retrieval of Graphic Document Images

This thesis addresses the problem of lack of efficient computational tools for graph based structural pattern recognition approaches and proposes to exploit computational strength of statistical pattern recognition. It has two fold contributions. The first contribution is a new method of explicit graph embedding. The proposed graph embedding method exploits multilevel analysis of graph for extracting graph level information, structural level information and elementary level information from graphs. It embeds this information into a numeric feature vector. The method employs fuzzy overlapping trapezoidal intervals for addressing the noise sensitivity of graph representations and for minimizing the information loss while mapping from continuous graph space to discrete vector space. The method has unsupervised learning abilities and is capable of automatically adapting its parameters to underlying graph dataset. The second contribution is a framework for automatic indexing of graph repositories for graph retrieval and subgraph spotting. This framework exploits explicit graph embedding for representing the cliques of order 2 by numeric feature vectors, together with classification and clustering tools for automatically indexing a graph repository. It does not require a labeled learning set and can be easily deployed to a range of application domains, offering ease of query by example (QBE) and granularity of focused retrieval.

[1]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[2]  Jean-Yves Ramel,et al.  Employing fuzzy intervals and loop-based methodology for designing structural signature: an application to symbol recognition , 2010, ArXiv.

[3]  Tomomasa Nagashima,et al.  Detection of Cluster Boundary in Microarray Data by Reference to MIPS Functional Catalogue Database , 2005 .

[4]  Jean-Yves Ramel,et al.  Vector Representation of Graphs: Application to the Classification of Symbols and Letters , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5]  Robert P. W. Duin,et al.  The Dissimilarity Representation for Pattern Recognition - Foundations and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[6]  Edwin R. Hancock,et al.  Pattern Vectors from Algebraic Graph Theory , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Tony Jebara,et al.  Structure preserving embedding , 2009, ICML '09.

[8]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[9]  Jean-Yves Ramel,et al.  Vers une approche floue d'encapsulation de graphes: application à la reconnaissance de symboles , 2010 .

[10]  Ali Shokoufandeh,et al.  Indexing hierarchical structures using graph spectra , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Edwin R. Hancock,et al.  A Riemannian approach to graph embedding , 2007, Pattern Recognit..

[12]  Jean-Yves Ramel,et al.  A Vectorial Representation for the Indexation of Structural Informations , 2008, SSPR/SPR.

[13]  George Lee,et al.  Semi-Supervised Graph Embedding Scheme with Active Learning (SSGEAL): Classifying High Dimensional Biomedical Data , 2010, PRIB.

[14]  Jean-Yves Ramel,et al.  Dimensionality Reduction for Fuzzy-Interval Based Explicit Graph Embedding , 2011 .

[15]  Jean-Yves Ramel,et al.  Subgraph Spotting through Explicit Graph Embedding: An Application to Content Spotting in Graphic Document Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[16]  Jean-Yves Ramel,et al.  Fuzzy multilevel graph embedding , 2013, Pattern Recognit..

[17]  Jean-Yves Ramel,et al.  Graphic Symbol Recognition Using Graph Based Signature and Bayesian Network Classifier , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[18]  Kaspar Riesen,et al.  Graph Classification and Clustering Based on Vector Space Embedding , 2010, Series in Machine Perception and Artificial Intelligence.

[19]  Edwin R. Hancock,et al.  Spectral embedding of graphs , 2003, Pattern Recognit..

[20]  Daniel P. Lopresti,et al.  A fast technique for comparing graph representations with applications to performance evaluation , 2003, Document Analysis and Recognition.

[21]  Jean-Yves Ramel,et al.  Embedding labeled graphs into occurrence matrix , 2009 .

[22]  Kaspar Riesen,et al.  Graph Embedding in Vector Spaces by Means of Prototype Selection , 2007, GbRPR.

[23]  Horst Bunke,et al.  Edit distance-based kernel functions for structural pattern classification , 2006, Pattern Recognit..

[24]  Kaspar Riesen,et al.  Graph Classification Based on Vector Space Embedding , 2009, Int. J. Pattern Recognit. Artif. Intell..

[25]  Horst Bunke,et al.  Error-Correcting Graph Isomorphism Using Decision Trees , 1998, Int. J. Pattern Recognit. Artif. Intell..

[26]  Ernest Valveny,et al.  Symbol Recognition: Current Advances and Perspectives , 2001, GREC.

[27]  E. Reingold,et al.  Combinatorial Algorithms: Theory and Practice , 1977 .

[28]  Yannis Manolopoulos,et al.  Structure-based similarity search with graph histograms , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[29]  Salvatore Tabbone,et al.  Musings on Symbol Recognition , 2005, GREC.

[30]  Kaspar Riesen,et al.  IAM Graph Database Repository for Graph Based Pattern Recognition and Machine Learning , 2008, SSPR/SPR.

[31]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[32]  Jean-Yves Ramel,et al.  A Content Spotting System for Line Drawing Graphic Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[33]  Jean-Yves Ramel,et al.  Design of Evolutionary Methods Applied to the Learning of Bayesian Network Structures , 2010 .

[34]  Prachi Mukherji,et al.  Combination of Symbolic and Statistical Features for Symbols Recognition , 2007, 2007 International Conference on Signal Processing, Communications and Networking.

[35]  Luc De Raedt,et al.  Feature Construction with Version Spaces for Biochemical Applications , 2001, ICML.

[36]  Horst Bunke,et al.  Automatic learning of cost functions for graph edit distance , 2007, Inf. Sci..

[37]  Joachim M. Buhmann,et al.  Optimal Cluster Preserving Embedding of Nonmetric Proximity Data , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Jean-Yves Ramel,et al.  A Fuzzy-Interval Based Approach for Explicit Graph Embedding , 2010, ICPR Contests.

[39]  King-Sun Fu,et al.  Error-Correcting Isomorphisms of Attributed Relational Graphs for Pattern Analysis , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[40]  Jean-Yves Ramel,et al.  Fuzzy Intervals for Designing Structural Signature: An Application to Graphic Symbol Recognition , 2009, GREC.

[41]  Derek G. Corneil,et al.  The graph isomorphism disease , 1977, J. Graph Theory.

[42]  Josep Lladós,et al.  Symbol recognition using graphs , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[43]  H. Bunke,et al.  A probabilistic approach to learning costs for graph edit distance , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[44]  Mohamed A. Ismail,et al.  A new algorithm for subgraph optimal isomorphism , 1998, Pattern Recognit..