Novel video stabilization for real-time optical character recognition applications

This paper presents a novel video stabilization algorithm for real-time optical character recognition (OCR) applications. The proposed method generates output frames in order to stabilize the position of a target word that will be recognized by the OCR application. Unlike in conventional algorithms, in the proposed algorithm, a causal low pass filter is not applied to the trajectory of the target word for reducing the high frequency component of camera motion. The proposed algorithm directly calculates the stable position of the word using two forces: the force used to pull the target word to the center of an output frame and a back force used to return the center of an output frame to the center of an input frame. Hence, the proposed algorithm significantly minimizes the time take to respond to sudden camera movement. Although the proposed method may not outperform state-of-the-art video stabilization in terms of video stability, the proposed technique is much more appropriate for real-time OCR applications than the conventional techniques in terms of accuracy, computational cost, processing delay, and the time taken to respond to sudden camera movement. Simulation results prove the superiority of the proposed method over conventional techniques for real-time OCR applications. 2017 Elsevier Inc. All rights reserved.

[1]  Dimosthenis Karatzas,et al.  MSER-Based Real-Time Text Detection and Tracking , 2014, 2014 22nd International Conference on Pattern Recognition.

[2]  Feng Liu,et al.  Spatially and Temporally Optimized Video Stabilization , 2013, IEEE Transactions on Visualization and Computer Graphics.

[3]  Subhadip Basu,et al.  A Hough Transform based Technique for Text Segmentation , 2010, ArXiv.

[4]  Yun Gu Lee Video stabilization based on human visual system , 2014, J. Electronic Imaging.

[5]  Palaiahnakote Shivakumara,et al.  Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images , 2015, IEEE Transactions on Image Processing.

[6]  Guo Kai,et al.  Fast-rolling shutter compensation based on piecewise quadratic approximation of a camera trajectory , 2014 .

[7]  Ah Chung Tsoi,et al.  Scale Invariant Feature Transform Flow trajectory approach with applications to human action recognition , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[8]  Harry Shum,et al.  Full-frame video stabilization with motion inpainting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Weisi Lin,et al.  Perceptual Quality Assessment of Screen Content Images , 2015, IEEE Transactions on Image Processing.

[10]  Richard Szeliski,et al.  Removing rolling shutter wobble , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Chong-Min Kyung,et al.  Suppressing rolling-shutter distortion of CMOS image sensors by motion vector detection , 2008, IEEE Transactions on Consumer Electronics.

[12]  Wen Gao,et al.  Just Noticeable Difference Estimation for Screen Content Images , 2016, IEEE Transactions on Image Processing.

[13]  Homer H. Chen,et al.  Analysis and Compensation of Rolling Shutter Effect , 2008, IEEE Transactions on Image Processing.

[14]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[15]  Guangyou Xu,et al.  A stable vision system for moving vehicles , 2000, IEEE Trans. Intell. Transp. Syst..

[16]  Weisi Lin,et al.  Saliency-Guided Quality Assessment of Screen Content Images , 2016, IEEE Transactions on Multimedia.

[17]  Ki-Sang Hong,et al.  CMOS Digital Image Stabilization , 2007, IEEE Transactions on Consumer Electronics.

[18]  Irfan A. Essa,et al.  Calibration-free rolling shutter removal , 2012, 2012 IEEE International Conference on Computational Photography (ICCP).

[19]  Michael Gleicher,et al.  Content-preserving warps for 3D video stabilization , 2009, ACM Trans. Graph..