Retrieving Chinese Captions in Video Images by Wavelet Transform and Color Clustering

Text information retrieval is an important field in content-based retrieval of video. This article integrats wavelet transform and color clustering to retrieve Chinese caption in video images and performs denoising according to the characteristics of print Chinese letters. While performing color clustering, it proposes an 8 neighbors clustering method which utilizes spatial correlation to achieve a good robust performance. The experiments demonstrate that it is an effective method to retrieve the Chinese caption.