Content-based indexing of images and video.

By representing image content using probabilistic models of an object's appearance we can obtain semantics-preserving compression of the image data. Such compact representations of an image's salient features allow rapid computer searches of even large image databases. Examples are shown for databases of face images, a video of American sign language (ASL), and a video of facial expressions.

[1]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[2]  Alex Pentland,et al.  A vision system for observing and extracting facial action parameters , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Alex Pentland,et al.  Probabilistic visual learning for object detection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[4]  A. Pentland Smart rooms, smart clothes , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[5]  Alex Pentland,et al.  Space-time gestures , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[6]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[7]  Tomaso A. Poggio,et al.  Model-based matching of line drawings by linear combinations of prototypes , 1995, Proceedings of IEEE International Conference on Computer Vision.

[8]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[9]  Alex Pentland,et al.  Facial expression recognition using a dynamic model and motion energy , 1995, Proceedings of IEEE International Conference on Computer Vision.

[10]  Fang Liu,et al.  A new Wold ordering for image similarity , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Thad Starner,et al.  Visual Recognition of American Sign Language Using Hidden Markov Models. , 1995 .

[12]  Rosalind W. Picard,et al.  Finding similar patterns in large image databases , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Hyeonjoon Moon,et al.  The FERET September 1996 Database and Evaluation Procedure , 1997, AVBPA.

[15]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[16]  Alex Pentland,et al.  Video and Image Semantics: Advanced Tools for Telecommunications , 1994, IEEE Multim..

[17]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.