Screen content image segmentation using sparse decomposition and total variation minimization

Sparse decomposition has been widely used for different applications, such as source separation, image classification, image denoising and more. This paper presents a new algorithm for segmentation of an image into background and foreground text and graphics using sparse decomposition and total variation minimization. The proposed method is designed based on the assumption that the background part of the image is smoothly varying and can be represented by a linear combination of a few smoothly varying basis functions, while the foreground text and graphics can be modeled with a sparse component overlaid on the smooth background. The background and foreground are separated using a sparse decomposition framework regularized with a few suitable regularization terms which promotes the sparsity and connectivity of foreground pixels. This algorithm has been tested on a dataset of images extracted from HEVC standard test sequences for screen content coding, and is shown to have superior performance over some prior methods, including least absolute deviation fitting, k-means clustering based segmentation in DjVu and shape primitive extraction and coding (SPEC) algorithm.

[1]  Shervin Minaee,et al.  Iris recognition using scattering transform and textural features , 2015, 2015 IEEE Signal Processing and Signal Processing Education Workshop (SP/SPE).

[2]  Babak Hossein Khalaj,et al.  A geometric approach to fully automatic chromosome segmentation , 2011, 2014 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[3]  Ali Taalimi,et al.  Online multi-modal task-driven dictionary learning and robust joint sparse representation for visual tracking , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[4]  Wotao Yin,et al.  An Iterative Regularization Method for Total Variation-Based Image Restoration , 2005, Multiscale Model. Simul..

[5]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[6]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[7]  Patrick L. Combettes,et al.  Signal Recovery by Proximal Forward-Backward Splitting , 2005, Multiscale Model. Simul..

[8]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[9]  Thomas S. Huang,et al.  Supervised translation-invariant sparse coding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Jing Zhang,et al.  Extraction of Text Objects in Video Documents: Recent Progress , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[11]  Shervin Minaee,et al.  Screen content image segmentation using sparse-smooth decomposition , 2015, 2015 49th Asilomar Conference on Signals, Systems and Computers.

[12]  Hairong Qi,et al.  Non-intrusive load monitoring of HVAC components using signal unmixing , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[13]  Pengwei Hao,et al.  Compound image compression for real-time computer screen image transmission , 2005, IEEE Transactions on Image Processing.

[14]  Baocai Yin,et al.  Screen Content Coding Based on HEVC Framework , 2014, IEEE Transactions on Multimedia.

[15]  D. Donoho,et al.  Redundant Multiscale Transforms and Their Application for Morphological Component Separation , 2004 .

[16]  Michael Lindenbaum,et al.  Sequential Karhunen-Loeve basis extraction and its application to images , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[17]  Tong Zhang,et al.  Transient Artifact Reduction Algorithm (TARA) Based on Sparse Optimization , 2014, IEEE Transactions on Signal Processing.

[18]  Michael Elad,et al.  Submitted to Ieee Transactions on Image Processing Image Decomposition via the Combination of Sparse Representations and a Variational Approach , 2022 .

[19]  Shervin Minaee,et al.  Screen content image segmentation using least absolute deviation fitting , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[20]  Ming Xu,et al.  Mixed raster content (MRC) model for compound image compression , 1998, Electronic Imaging.

[21]  Andrew B. Watson,et al.  Image Compression Using the Discrete Cosine Transform , 1994 .

[22]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[23]  Zhifei Zhang,et al.  Robust coupling in space of sparse codes for multi-view recognition , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[24]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Shervin Minaee,et al.  Highly Accurate Multispectral Palmprint Recognition Using Statistical and Wavelet Features , 2014, ArXiv.

[26]  Vishal Monga,et al.  Simultaneous Sparsity Model for Histopathological Image Representation and Classification , 2014, IEEE Transactions on Medical Imaging.

[27]  Christian Jutten,et al.  Outlier-aware dictionary learning for sparse representation , 2014, 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[28]  Shervin Minaee,et al.  Fingerprint recognition using translation invariant scattering network , 2015, 2015 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).