Exploration of Visual Data

1: Introduction. 1.1. Challenges. 1.2. Research Scope. 1.3. State-of-the-Art. 1.4. Outline of Book. 2: Overview Of Visual Information Representation. 2.1. Color. 2.2. Texture. 2.3. Shape. 2.4. Spatial Layout. 2.5. Interest Points. 2.6. Image Segmentation. 2.7. Summary. 3: Edge-based Structural Features. 3.1. Visual Feature Representation. 3.2. Edge-Based Structural Features. 3.3. Experiments and Analysis. 4: Probabilistic Local Structure Models. 4.1. Introduction. 4.2. The Proposed Modeling Scheme. 4.3. Implementation Issues. 4.4. Experiments and Discussion. 4.5. Summary and Discussion. 5: Constructing Table-of-Content for Videos. 5.1. Introduction. 5.2. Related Work. 5.3. The Proposed Approach. 5.4. Determination of the Parameters. 5.5. Experimental Results. 5.6. Conclusions. 6: Nonlinearly Sampled Video Streaming. 6.1. Introduction. 6.2. Problem Statement. 6.3. Frame Saliency Scoring. 6.4. Scenario and Assumptions. 6.5. Minimum Buffer Formulation. 6.6. Limited-Buffer Formulation. 6.7. Extensions and Analysis. 6.8. Experimental Evaluation. 6.9. Discussion. 7: Relevance Feedback for Visual Data Retrieval. 7.1. The Need for User-in-the-Loop. 7.2. Problem Statement. 7.3. Overview of Existing Techniques. 7.4.Learning from Positive Feedbacks. 7.5. Adding Negative Feedbacks: Discriminant Analysis? 7.6. Biased Discriminant Analysis. 7.7. Nonlinear Extensions Using Kernel and Boosting. 7.8. Comparisons and Analysis. 7.9. Relevance Feedback on Image Tiles. 8: Toward Unification of Keywords and Low-Level Contents. 8.1. Introduction. 8.2. Joint Querying and Relevance Feedback. 8.3. Learning Semantic Relations between Keywords. 8.4. Discussion. 9: Future Research Directions. 9.1. Low-level and intermediate-level visual descriptors. 9.2. Learning from user interactions. 9.3. Unsupervised detection of patterns/events. 9.4. Domain-specific applications. References. Index.

[1]  Keinosuke Fukunaga,et al.  Three-Dimensional Shape Analysis Using Local Shape Descriptors , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Xiang Sean Zhou,et al.  Optimal nonlinear sampling for video streaming at low bit rates , 2002, IEEE Trans. Circuits Syst. Video Technol..

[4]  Hans P. Moravec Towards Automatic Visual Obstacle Avoidance , 1977, IJCAI.

[5]  Shih-Fu Chang,et al.  Automated binary texture feature sets for image retrieval , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  Marco La Cascia,et al.  Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web , 1999, Comput. Vis. Image Underst..

[7]  Thomas S. Huang,et al.  Unifying Keywords and Contents in Image Retrieval : Joint Querying , Relevance Feedback , and Pseudoclassification , 2022 .

[8]  Shih-Fu Chang,et al.  Single color extraction and image query , 1995, Proceedings., International Conference on Image Processing.

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[11]  Simone Santini,et al.  Integrated browsing and querying for image databases , 2000, IEEE MultiMedia.

[12]  Brendan J. Frey,et al.  Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[13]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Thomas S. Huang,et al.  Constructing table-of-content for videos , 1999, Multimedia Systems.

[15]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[16]  Bernhard Schölkopf,et al.  Support vector learning , 1997 .

[17]  Milind R. Naphade,et al.  Extracting semantics from audio-visual content: the final frontier in multimedia retrieval , 2002, IEEE Trans. Neural Networks.

[18]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[19]  Shih-Fu Chang,et al.  Tools and techniques for color image retrieval , 1996, Electronic Imaging.

[20]  Richard C. Dubes,et al.  Performance evaluation for four classes of textural features , 1992, Pattern Recognit..

[21]  Cordelia Schmid,et al.  Learning to Parse Pictures of People , 2002, ECCV.

[22]  Nuno Vasconcelos,et al.  Statistical models of video structure for content analysis and characterization , 2000, IEEE Trans. Image Process..

[23]  Andreas Paepcke,et al.  Beyond document similarity: understanding value-based search and browsing technologies , 2000, SGMD.

[24]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Narendra Ahuja,et al.  Multiscale image segmentation by integrated edge and region detection , 1997, IEEE Trans. Image Process..

[26]  Fabio Roli,et al.  Bayesian relevance feedback for content-based image retrieval , 2004, Pattern Recognit..

[27]  Thomas S. Huang,et al.  Optimizing learning in image retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[28]  Thomas S. Huang,et al.  Automated region segmentation using attraction-based grouping in spatial-color-texture space , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[29]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[30]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[31]  Wu-chi Feng,et al.  A Survey of Application Layer Techniques for Adaptive Streaming of Multimedia , 2001, Real Time Imaging.

[32]  Wayne H. Wolf,et al.  Key frame selection by motion analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[33]  Thomas S. Huang,et al.  Segmentation of road scenes using color and fractal-based texture classification , 1994, Proceedings of 1st International Conference on Image Processing.

[34]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  P. Wintz,et al.  An efficient three-dimensional aircraft recognition algorithm using normalized fourier descriptors , 1980 .

[36]  Alex Pentland,et al.  Fractal-Based Description of Natural Scenes , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[38]  Kun Tan,et al.  Content-sensitive video streaming over low bitrate and lossy wireless network , 2001, MULTIMEDIA '01.

[39]  Min Wu,et al.  Dynamic resource allocation via video content and short-term traffic statistics , 2001, IEEE Trans. Multim..

[40]  David B. Cooper,et al.  Recognition and positioning of rigid objects using algebraic moment invariants , 1991, Optics & Photonics.

[41]  Paul A. Viola,et al.  Boosting Image Retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[42]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  S. M. Steve SUSAN - a new approach to low level image processing , 1997 .

[44]  Gerald Salton,et al.  Automatic text processing , 1988 .

[45]  Raimondo Schettini,et al.  Content-based color image retrieval with relevance feedback , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[46]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[47]  Thomas S. Huang,et al.  Efficient access to video content in a unified framework , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[48]  Jing Xiao,et al.  Content-Based Video Indexing and Retrieval , 2004 .

[49]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[50]  A. Murat Tekalp,et al.  A high-performance shot boundary detection algorithm using multiple cues , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[51]  King-Sun Fu,et al.  Error-Correcting Isomorphisms of Attributed Relational Graphs for Pattern Analysis , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[52]  Ning Xu,et al.  Object segmentation using graph cuts based active contours , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[53]  Thomas S. Huang,et al.  Modified Fourier Descriptors for Shape Representation - A Practical Approach , 1996 .

[54]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[55]  Jean-Michel Jolion,et al.  Content based image retrieval using interest points and texture features , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[56]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[57]  Chong-Wah Ngo,et al.  Camera break detection by partitioning of 2D spatio-temporal images in MPEG domain , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[58]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[59]  Thomas S. Huang,et al.  Supporting similarity queries in MARS , 1997, MULTIMEDIA '97.

[60]  Cyrus Shahabi,et al.  Image retrieval by shape: a comparative study , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[61]  Luc Van Gool,et al.  Content-Based Image Retrieval Based on Local Affinely Invariant Regions , 1999, VISUAL.

[62]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[63]  Boon-Lock Yeo Efficient processing of compressed images and video , 1996 .