Illumination Invariance and Object Model in Content-Based Image and Video Retrieval

With huge amounts of multimedia information connected to the global information network (Internet), efficient and effective image retrieval from large image and video repositories has become an imminent research issue. This article presents our research in the C-BIRD (content-based image retrieval in digital-libraries) project. In addition to the use of common features such as color, texture, shape, and their conjuncts, and the combined content-based and description-based techniques, it is shown that (a) color-channel-normalization enables search by illumination invariance, and (b) feature localization and a three-step matching algorithm (color hypothesis, texture support, shape verification) facilitate search by object model in image and video databases.

[1]  Jiawei Han,et al.  Mining MultiMedia Data , 1999 .

[2]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[4]  Oren Etzioni,et al.  Multi-Service Search and Comparison Using the MetaCrawler , 1995 .

[5]  Azriel Rosenfeld,et al.  Compact Region Extraction Using Weighted Pixel Linking in a Pyramid , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[7]  Jörn Ostermann,et al.  Coding of arbitrarily shaped video objects in MPEG-4 , 1997, Proceedings of International Conference on Image Processing.

[8]  G. Wyszecki,et al.  Color Science Concepts and Methods , 1982 .

[9]  Shih-Fu Chang,et al.  VideoQ: an automated content based video search system using visual cues , 1997, MULTIMEDIA '97.

[10]  Oren Etzioni,et al.  Multi-Engine Search and Comparison Using the MetaCrawler , 1995, World Wide Web J..

[11]  Naoki Shibata,et al.  Media-based navigation for hypermedia systems , 1993, Hypertext.

[12]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[13]  오승준 [서평]「Digital Video Processing」 , 1996 .

[14]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[15]  Ze-Nian Li,et al.  From NOMAD to explorer: active object recognition on mobile robots , 1998, Pattern Recognit..

[16]  Verónica Dahl,et al.  On-Une Resource Discovery Using Natural Language , 1997, RIAO.

[17]  Jie Wei,et al.  Illumination-invariant video segmentation by hierarchical robust thresholding , 1997, Electronic Imaging.

[18]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[19]  Dragutin Petkovic,et al.  Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review , 1996 .

[20]  Brian V. Funt,et al.  Color Constant Color Indexing , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Refractor Vision , 2000, The Lancet.

[22]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[23]  Fang Liu,et al.  Periodicity, Directionality, and Randomness: Wold Features for Image Modeling and Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[25]  Jie Wei,et al.  Illumination-invariant color object recognition via compressed chromaticity histograms of color-channel-normalized images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[26]  B. S. Manjunath,et al.  Edge flow: A framework of boundary detection and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[28]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[29]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[30]  Gunther Wyszecki,et al.  Color Science: Concepts and Methods, Quantitative Data and Formulae, 2nd Edition , 2000 .

[31]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[32]  Ze-Nian Li,et al.  Linear generalized Hough transform and its parallelization , 1993, Image Vis. Comput..

[33]  R. Gregory The intelligent eye , 1970 .

[34]  Ahmed Karmouch,et al.  Detecting Cuts by Understanding Camera Operations for Video Indexing , 1995, J. Vis. Lang. Comput..

[35]  Ze-Nian Li,et al.  Recognition kernel for content-based search , 1996, 1996 IEEE International Conference on Systems, Man and Cybernetics. Information Intelligence and Systems (Cat. No.96CH35929).

[36]  Sougata Mukherjea,et al.  Towards a Multimedia World-Wide Web Information Retrieval Engine , 1997, Comput. Networks.

[37]  A. Ravishankar Rao,et al.  Towards a texture naming system: Identifying relevant dimensions of texture , 1993, Vision Research.