Extracting multidimensional signal features for content-based visual query

Future large visual information systems (such as image databases and video servers) require effective and efficient methods for indexing, accessing, and manipulating images based on visual content. This paper focuses on automatic extraction of low-level visual features such as texture, color, and shape. Continuing our prior work in compressed video manipulation, we also propose to explore the possibility of deriving visual features directly from the compressed domain, such as the DCT and wavelet transform domain. By stressing at the low-level features, we hope to achieve generic techniques applicable to general applications. By exploring the compressed-domain content extractability, we hope to reduce the computational complexity. We also propose a quad-tree based data structure to bind various signal features. Integrated feature maps are proposed to improve the overall effectiveness of the feature-based image query system. Current technical progress and system prototypes are also described. Part of the prototype work has been integrated into the Multimedia/VOD testbed in the Advanced Image Lab of Columbia University.

[1]  Y. S. Hsu,et al.  Pattern Recognition Experiments in the Mandala/Cosine Domain , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[3]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[4]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[5]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[6]  Ramesh C. Jain,et al.  Reasoning About Edges in Scale Space , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Shi-Kuo Chang,et al.  Iconic Indexing by 2-D Strings , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  King-Sun Fu,et al.  Query-by-Pictorial-Example , 1980, IEEE Trans. Software Eng..

[9]  Raimondo Schettini,et al.  Indexing and Fuzzy Logic-Based Retrieval of Color Images , 1991, Visual Database Systems.

[10]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[11]  Dragutin Petkovic,et al.  Efficient query by image content for very large image databases , 1993, Digest of Papers. Compcon Spring.

[12]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[13]  Terry Caelli,et al.  On the classification of image regions by colour, texture and shape , 1993, Pattern Recognit..

[14]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Ramesh C. Jain,et al.  A Visual Information Management System for the Interactive Retrieval of Faces , 1993, IEEE Trans. Knowl. Data Eng..

[16]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[17]  Shih-Fu Chang,et al.  Quad-tree segmentation for texture-based image query , 1994, MULTIMEDIA '94.

[18]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[19]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[20]  Stefano Spaccapietra,et al.  Visual Database Systems 3 , 1995, IFIP — The International Federation for Information Processing.

[21]  Tat-Seng Chua,et al.  Content-based retrieval of segmented images , 1994, MULTIMEDIA '94.

[22]  Stéphane Mallat,et al.  Characterization of Signals from Multiscale Edges , 2011, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Stéphane Mallat,et al.  Zero-crossings of a wavelet transform , 1991, IEEE Trans. Inf. Theory.

[24]  Charles K. Chui,et al.  An Introduction to Wavelets , 1992 .

[25]  Arnold W. M. Smeulders,et al.  An Approach to Image Indexing of Documents , 1991, VDB.

[26]  Shih-Fu Chang,et al.  Manipulation and Compositing of MC-DCT Compressed Video , 1995, IEEE J. Sel. Areas Commun..