A Robust Regression Approach for Background/Foreground Segmentation

Background/foreground segmentation has a lot of applications in image and video processing. In this paper, a segmentation algorithm is proposed which is mainly designed for text and line extraction in screen content. The proposed method makes use of the fact that the background in each block is usually smoothly varying and can be modeled well by a linear combination of a few smoothly varying basis functions, while the foreground text and graphics create sharp discontinuity. The algorithm separates the background and foreground pixels by trying to fit pixel values in the block into a smooth function using a robust regression method. The inlier pixels that can fit well will be considered as background, while remaining outlier pixels will be considered foreground. This algorithm has been extensively tested on several images from HEVC standard test sequences for screen content coding, and is shown to have superior performance over other methods, such as the k-means clustering based segmentation algorithm in DjVu. This background/foreground segmentation can be used in different applications such as: text extraction, separate coding of background and foreground for compression of screen content and mixed content documents, principle line extraction from palmprint and crease detection in fingerprint images.

[1]  Alexander Shapiro,et al.  On the asymptotics of constrained local M-estimators , 2000 .

[2]  Edward K. Wong,et al.  Check image compression using a layered coding method , 1998, J. Electronic Imaging.

[3]  Michael Lindenbaum,et al.  Sequential Karhunen-Loeve basis extraction and its application to images , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[4]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[5]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[6]  Pengwei Hao,et al.  Compound image compression for real-time computer screen image transmission , 2005, IEEE Transactions on Image Processing.

[7]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, SPIE Optics + Photonics.

[8]  Christian Jutten,et al.  Outlier-aware dictionary learning for sparse representation , 2014, 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[9]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[10]  Wei Wang,et al.  Advanced Screen Content Coding Using Color Table and Index Map , 2014, IEEE Transactions on Image Processing.

[11]  Ming Xu,et al.  Mixed raster content (MRC) model for compound image compression , 1998, Electronic Imaging.

[12]  Christian Jutten,et al.  Sequential subspace finding: A new algorithm for learning low-dimensional linear subspaces , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[13]  George Atia,et al.  Randomized Robust Subspace Recovery for High Dimensional Data Matrices , 2015, ArXiv.

[14]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[15]  Shervin Minaee,et al.  Multispectral Palmprint Recognition Using a Hybrid Feature , 2011, ArXiv.

[16]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Jan-Michael Frahm,et al.  A Comparative Analysis of RANSAC Techniques Leading to Adaptive Real-Time Random Sample Consensus , 2008, ECCV.

[18]  Touradj Ebrahimi,et al.  Christopoulos: Thc Jpeg2000 Still Image Coding System: an Overview the Jpeg2000 Still Image Coding System: an Overview , 2022 .