Neural-fuzzy approach for content-based retrieval of digital video

As digital video databases become more and more pervasive, finding video in large databases becomes a major problem. Because of the nature of video (streamed objects), accessing the content of such databases is inherently a time-consuming operation. The paper proposes a novel neural-fuzzy based approach for retrieving a specific video clip from a video database. Fuzzy logic is used for expressing queries in terms of natural language and a neural network is designed to learn the meaning of these queries. The queries are designed based on features such as colour and texture of shots, scenes and objects in video clips. An error backpropagation algorithm is proposed to learn the meaning of queries in fuzzy terms such as "very similar", "similar" and "some-what similar". Preliminary experiments were conducted on a small video database and different combinations of queries using colour and texture features along with a visual video clip; very promising results were achieved.

[1]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[2]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[3]  Yücel Altunbasak,et al.  Content-based video retrieval and compression: a unified solution , 1997, Proceedings of International Conference on Image Processing.

[4]  Vito Di Gesù,et al.  Content-based indexing of image and video databases by global and shape features , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[5]  John R. Smith,et al.  Sequential processing for content-based retrieval of composite objects , 1997, Electronic Imaging.

[6]  M. La Cascia,et al.  Motion and color-based video indexing and retrieval , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  Shih-Fu Chang,et al.  VideoQ: an automated content based video search system using visual cues , 1997, MULTIMEDIA '97.

[8]  Christos Faloutsos,et al.  Compressed-domain video indexing techniques using DCT and motion vector information in MPEG video , 1997, Electronic Imaging.

[9]  B. S. Manjunath,et al.  NeTra-V: toward an object-based video representation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[10]  Jie Wei,et al.  Illumination-invariant video segmentation by hierarchical robust thresholding , 1997, Electronic Imaging.

[11]  Shih-Fu Chang,et al.  Video object model and segmentation for content-based video indexing , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.