Real-Time Computerized Annotation of Pictures

Developing effective methods for automated annotation of digital pictures continues to challenge computer scientists. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications, including Web image search, online picture-sharing communities, and scientific experiments. In this work, the authors developed new optimization and estimation techniques to address two fundamental problems in machine learning. These new techniques serve as the basis for the automatic linguistic indexing of pictures - real time (ALIPR) system of fully automatic and high-speed annotation for online pictures. In particular, the D2-clustering method, in the same spirit as K-Means for vectors, is developed to group objects represented by bags of weighted vectors. Moreover, a generalized mixture modeling technique (kernel smoothing as a special case) for nonvector data is developed using the novel concept of hypothetical local mapping (HLM). ALIPR has been tested by thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process. Its performance has also been studied at an online demonstration site, where arbitrary users provide pictures of their choices and indicate the correctness of each annotation word. The experimental results show that a single computer processor can suggest annotation terms in real time and with good accuracy.

[1]  C. Mallows A Note on Asymptotic Joint Normality , 1972 .

[2]  D. Freedman,et al.  Some Asymptotic Theory for the Bootstrap , 1981 .

[3]  R. E. Wheeler Statistical distributions , 1983, APLQ.

[4]  S. Rachev The Monge–Kantorovich Mass Transference Problem and Its Stochastic Applications , 1985 .

[5]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[6]  Tomaso Poggio,et al.  Image Representations for Visual Learning , 1996, Science.

[7]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[8]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[10]  Shih-Fu Chang,et al.  Semantic visual templates: linking visual features to semantics , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[11]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  James Ze Wang,et al.  Classification of textured and non-textured images using region segmentation , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[13]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  James Ze Wang,et al.  IRM: integrated region matching for image retrieval , 2000, ACM Multimedia.

[15]  James Z. Wang Region-based retrieval of biomedical images , 2000, MM 2000.

[16]  James Zijun Wang,et al.  Pathfinder: multiresolution region-based searching of pathology images using IRM , 2000, AMIA.

[17]  M. Evans Statistical Distributions , 2000 .

[18]  James Z. Wang SIMPLIcity: a region-based retrieval system for picture libraries and biomedical image databases , 2000, MM 2000.

[19]  James Ze Wang,et al.  RF/sup */IPF: a weighting scheme for multimedia information retrieval , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[20]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2001, Springer Series in Statistics.

[21]  Peter J. Bickel,et al.  The Earth Mover's distance is the Mallows distance: some insights from statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[22]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[23]  James Ze Wang,et al.  A scalable integrated region-based image retrieval system , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[24]  James Ze Wang,et al.  Unsupervised Multiresolution Segmentation for Images with Low Depth of Field , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  James Ze Wang,et al.  Scalable integrated region-based image retrieval using IRM and statistical clustering , 2001, JCDL '01.

[26]  Yixin Chen,et al.  FIRM: fuzzily integrated region matching for content-based image retrieval , 2001, MULTIMEDIA '01.

[27]  Tsuhan Chen,et al.  An active learning framework for content-based information retrieval , 2002, IEEE Trans. Multim..

[28]  James Ze Wang,et al.  LARGE-SCALE EMPEROR DIGITAL LIBRARY AND SEMANTICS-SENSITIVE REGION-BASED RETRIEVAL , 2002 .

[29]  Yixin Chen,et al.  A Region-Based Fuzzy Feature Matching Approach to Content-Based Image Retrieval , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  James Ze Wang,et al.  Learning-based linguistic indexing of pictures with 2--d MHMMs , 2002, MULTIMEDIA '02.

[31]  Jiebo Luo,et al.  Probabilistic spatial context models for scene content understanding , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[32]  Yixin Chen,et al.  Content-based image retrieval by clustering , 2003, MIR '03.

[33]  Daniel Gatica-Perez,et al.  On image auto-annotation with latent space models , 2003, ACM Multimedia.

[34]  James Ze Wang,et al.  Evaluation strategies for automatic linguistic indexing of pictures , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[35]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[36]  Yixin Chen,et al.  An unsupervised learning approach to content-based image retrieval , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[37]  Yixin Chen,et al.  Kernel machines and additive fuzzy systems: classification and function approximation , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[38]  Yixin Chen,et al.  Support vector learning for fuzzy rule-based classification systems , 2003, IEEE Trans. Fuzzy Syst..

[39]  Yixin Chen,et al.  Looking beyond region boundaries: a robust image similarity measure using fuzzified region features , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[40]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Wei-Ying Ma,et al.  Learning an image manifold for retrieval , 2004, MULTIMEDIA '04.

[42]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[43]  Yixin Chen,et al.  Image Categorization by Learning and Reasoning with Regions , 2004, J. Mach. Learn. Res..

[44]  B. S. Manjunath,et al.  Cortina: a system for large-scale, content-based web image retrieval , 2004, MULTIMEDIA '04.

[45]  Jingrui He,et al.  Mean version space: a new active learning method for content-based image retrieval , 2004, MIR '04.

[46]  James Ze Wang,et al.  Studying digital imagery of ancient paintings by mixtures of stochastic models , 2004, IEEE Transactions on Image Processing.

[47]  Paul A. Viola,et al.  Boosting Image Retrieval , 2004, International Journal of Computer Vision.

[48]  James Ze Wang,et al.  Content-based image retrieval: approaches and trends of the new age , 2005, MIR '05.

[49]  Nuno Vasconcelos,et al.  A multiresolution manifold distance for invariant image similarity , 2005, IEEE Transactions on Multimedia.

[50]  Alberto Del Bimbo,et al.  Trademark matching and retrieval in sports video databases , 2007, MIR '07.

[51]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[53]  P. Deb Finite Mixture Models , 2008 .

[54]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.