Learning patterns from images by combining soft decisions and hard decisions

We present a novel approach for learning patterns (sub-images) shared by multiple images without prior knowledge about the number and the positions of the patterns in the images. The patterns may undergo kinds of rigid and non-rigid transformations. To reduce the searching space, the images are pre-segmented and represented by attribute relation graphs (ARGs). The problem is then formulated as learning the isomorphic subgraph, called pattern ARG (PARG) from multiple sample ARGs (SARG) with regard to the attribute similarity and the relation similarity. An inexact graph-matching algorithm is proposed to establish the correspondence between each SARG and the PARG. Inexact graph matching and model editing based on Bayes' decision rule are incorporated into Generalized Expectation and Maximization (GEM) algorithm. The modified GEM algorithm combines soft decisions and hard decisions together to learn both the appearance and the structure of the PARG. In the experiments, the learned PARG successfully captures the appearance and spatial information of the concept shared by the images.

[1]  King-Sun Fu,et al.  Subgraph error-correcting isomorphisms for syntactic pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Robert M. Haralick,et al.  Structural Descriptions and Inexact Matching , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Shinji Umeyama,et al.  An Eigendecomposition Approach to Weighted Graph Matching Problems , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  B. Huet,et al.  Inexact graph retrieval , 1999, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL'99).

[5]  Thomas S. Huang,et al.  Extracting the Recurring Patterns from Image , 2001 .

[6]  Salih O. Duffuaa,et al.  A Linear Programming Approach for the Weighted Graph Matching Problem , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  King-Sun Fu,et al.  A Step Towards Unification of Syntactic and Statistical Pattern Recognition , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  M. Krcmár,et al.  Application of genetic algorithms in graph matching , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[9]  William J. Christmas,et al.  Structural Matching in Computer Vision Using Probabilistic Relaxation , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Ali N. Akansu,et al.  A Graph-Based Object Description for Information Retrieval in Digital Image and Video Libraries , 2002, J. Vis. Commun. Image Represent..

[11]  Stan Z. Li,et al.  Matching: Invariant to translations, rotations and scale changes , 1992, Pattern Recognit..

[12]  Daniel P. Huttenlocher,et al.  Image segmentation using local variation , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[15]  Birgit Ueberreiter,et al.  Pattern Recognition by Graph Matching - Combinatorial versus continuous Optimization , 1988, Int. J. Pattern Recognit. Artif. Intell..

[16]  Edwin R. Hancock,et al.  Structural Matching by Discrete Relaxation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[18]  Olivier D. Faugeras,et al.  Shape Matching of Two-Dimensional Objects , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Brendan J. Frey,et al.  Transformed component analysis: joint estimation of spatial transformations and image components , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  W. Eric L. Grimson,et al.  A framework for learning query concepts in image classification , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[21]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.