P-1 Extracting Multi-Dimensional Signal Features for Content-Based Visual Query

Future large visual information systems (such as image databases and video servers) require effective and efficient methods for indexing, accessing, and manipulating images based on visual content. This paper focuses on automatic extraction of low-level visual features such as texture, color, and shape. Continuing our prior work in compressed video manipulation, we also propose to explore the possibility of deriving visual features directly from the compressed domain, such as the DCT and wavelet transform domain. By stressing at the low-level features, we hope to achieve generic techniques applicable to general applications. By exploring the compressed-domain content extractability, we hope to reduce the computational complexity. We also propose a quad-tree based data structure to bind various signal features. Integrated feature maps are proposed to improve the overall effectiveness of the feature-based image query system. Current technical progress and system prototypes are also described. Part of the prototype work has been integrated into the Multimedia/VOD testbed in the Advanced Image Lab of Columbia University.

[1]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[2]  Y. S. Hsu,et al.  Pattern Recognition Experiments in the Mandala/Cosine Domain , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Shi-Kuo Chang,et al.  Iconic Indexing by 2-D Strings , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[6]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[7]  Raimondo Schettini,et al.  Indexing and Fuzzy Logic-Based Retrieval of Color Images , 1991, Visual Database Systems.

[8]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[9]  Arnold W. M. Smeulders,et al.  An Approach to Image Indexing of Documents , 1991, VDB.

[10]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[11]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[12]  Ramesh C. Jain,et al.  Reasoning About Edges in Scale Space , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Stéphane Mallat,et al.  Characterization of Signals from Multiscale Edges , 2011, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Ramesh C. Jain,et al.  A Visual Information Management System for the Interactive Retrieval of Faces , 1993, IEEE Trans. Knowl. Data Eng..

[15]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[16]  Terry Caelli,et al.  On the classification of image regions by colour, texture and shape , 1993, Pattern Recognit..

[17]  Dragutin Petkovic,et al.  Efficient query by image content for very large image databases , 1993, Digest of Papers. Compcon Spring.

[18]  Tat-Seng Chua,et al.  Content-based retrieval of segmented images , 1994, MULTIMEDIA '94.

[19]  Shih-Fu Chang,et al.  Development of Advanced Image / Video Servers in A Video on Demand Testbed , 1994 .

[20]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[21]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[22]  Shih-Fu Chang,et al.  Quad-tree segmentation for texture-based image query , 1994, MULTIMEDIA '94.

[23]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[24]  Shih-Fu Chang,et al.  Manipulation and Compositing of MC-DCT Compressed Video , 1995, IEEE J. Sel. Areas Commun..

[25]  Stefano Spaccapietra,et al.  Visual Database Systems 3 , 1995, IFIP — The International Federation for Information Processing.

[26]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .