Census transform-based static caption detection for frame rate up-conversion

This paper presents a new static caption detection method that uses the census transform (CT) and motion vector (MV) for frame rate up-conversion. The proposed method splits a frame into several blocks and detects the static regions using CT and MV. CT is used to consider the spatio-temporal consistency of the texture and MV is used to remove the non-static regions containing moving objects. Next, it corrects the falsely detected regions by performing the outlier removal and inward filling. Finally, it detects the static caption based on the existence of global motion. In the experimental results, the average F1 score of the proposed method was up to 0.704 higher than those of the benchmark methods.

[1]  Kwanghoon Sohn,et al.  Static text region detection in video sequences using color and orientation consistencies , 2008, 2008 19th International Conference on Pattern Recognition.

[2]  Ramin Zabih,et al.  Non-parametric Local Transforms for Computing Visual Correspondence , 1994, ECCV.

[3]  Andrew J. Patti,et al.  High resolution standards conversion of low resolution video , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Young Hwan Kim,et al.  Block-based static caption detection using intensity range , 2016, 2016 IEEE 5th Global Conference on Consumer Electronics.

[5]  Mohammad Ghanbari,et al.  General approach to block-matching motion estimation , 1993 .

[6]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[7]  Yuxin Peng,et al.  Using Multiple Frame Integration for the Text Recognition of Video , 2009, 2009 10th International Conference on Document Analysis and Recognition.