Trainable context model for multiscale segmentation

Most previous approaches to Bayesian segmentation have used simple prior models, such as Markov random fields (MRF), to enforce regularity in the segmentation. While these methods improve classification accuracy, they are not well suited to modeling complex contextual structure. In this paper, we propose a context model for multiscale segmentation which can capture very complex behaviors on both local and global scales. Our method works by using binary classification trees to model the transition probabilities between segmentations at adjacent scales. The classification trees can be efficiently trained to model essential aspects of contextual behavior. In addition, the data model in our approach is novel in the sense that it can incorporate the correlation among the wavelet feature vectors across scales. We apply our method to the problem of document segmentation to illustrate its usefulness.

[1]  J. Allebach,et al.  Multiscale Document Segmentation , 1997 .

[2]  Edward J. Delp,et al.  Multiresolution image segmentation , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Patrick Pérez,et al.  Hierarchical statistical models for the fusion of multiresolution image data , 1995, Proceedings of IEEE International Conference on Computer Vision.

[4]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[5]  Robert D. Nowak,et al.  Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[6]  Robert M. Haralick,et al.  Document image understanding: geometric and logical layout , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Edward J. Delp,et al.  An Iterative Growing and Pruning Algorithm for Classification Tree Design , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Charles A. Bouman,et al.  A multiscale random field model for Bayesian image segmentation , 1994, IEEE Trans. Image Process..

[9]  Fabrice Heitz,et al.  Hierarchical statistical models for the fusion of multiresolution image data , 1995, Optics + Photonics.