论文信息 - Active Learning for Information Retrieval : Using 3 D Models As An Example

Active Learning for Information Retrieval : Using 3 D Models As An Example

In this paper, we propose a general approach to content-based information retrieval. We use active learning to produce hidden annotations to improve the retrieval performance. We apply the proposed algorithm to designing a 3D model retrieval system. Our database is composed of approximately 1750 3D models having among 53 attributes that form a complex attribute tree. For each model in the database, we maintain a list of probabilities, each indicating the probability of this model having one of the attributes. During training, the learning algorithm samples models in the database and presents them to the annotator to assign attributes to. For each sampled model, each probability is set to be one or zero depending on whether or not the corresponding attribute is assigned by the annotator. For models that have not been annotated, the learning algorithm estimates their probabilities based on a potential function. Specifically, models close to an annotated model are likely to have the same attributes as the annotated model, so similar probabilities from the annotated model are given to these models. At each stage of the learning, the algorithm is able to determine, among the models that have not been annotated, which model the system is the most uncertain of, and presents it as the next sample to the annotator to assign attributes to. During retrieval, the list of probabilities works as a feature vector for us to calculate the semantic distance between two models, or between the user query and a model in the database. The overall distance between two models is determined by a weighted sum of the semantic distance and the low-level feature distance. We show that with the proposed algorithm, the retrieval performance of the system improves rapidly with the number of annotated samples. Furthermore, we show that active learning outperforms learning based on random sampling.

Tsuhan Chen | Cha Zhang

[1] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[2] Ernest L. Hall,et al. Three-Dimensional Moment Invariants , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Hon-Son Don,et al. 3-D Moment Forms: Their Construction and Application to Object Identification and Positioning , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[4] Gerard Salton,et al. Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[5] Donna K. Harman,et al. Relevance feedback revisited , 1992, SIGIR '92.

[6] H. Sebastian Seung,et al. Information, Prediction, and Query by Committee , 1992, NIPS.

[7] H. Sebastian Seung,et al. Query by committee , 1992, COLT '92.

[8] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[9] William A. Gale,et al. A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[10] Anders Krogh,et al. Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[11] David D. Lewis,et al. A sequential algorithm for training text classifiers: corrigendum and additional data , 1995, SIGF.

[12] Gordon Bell,et al. The virtual reality modeling language , 1995 .

[13] B. S. Manjunath,et al. Texture features and learning similarity , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14] Rosalind W. Picard,et al. Interactive Learning Using a "Society of Models" , 2017, CVPR 1996.

[15] Marc Rioux,et al. Nefertiti: a query by content software for three-dimensional models databases management , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[16] Christos Faloutsos,et al. MindReader: Querying Databases Through Multiple Examples , 1998, VLDB.

[17] Thomas S. Huang,et al. Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[18] Andrew McCallum,et al. Pool-Based Active Learning for Text Classification , 1999 .

[19] M. Hasenjäger,et al. Active Learning in Self-Organizing Maps , 1999 .

[20] Stefanos D. Kollias,et al. Nonlinear relevance feedback: improving the performance of content-based retrieval systems , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[21] Qi Tian,et al. Update relevant image weights for content-based image retrieval using support vector machines , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[22] Samuel Moon-Ho Song,et al. Relevance graph-based image retrieval , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[23] Toshikazu Kato,et al. A similarity retrieval of 3D polygonal models using rotation invariant shape descriptors , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.

[24] Andrew W. Moore,et al. 'N-Body' Problems in Statistical Learning , 2000, NIPS.

[25] Ingemar J. Cox,et al. The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments , 2000, IEEE Trans. Image Process..

[26] Jobst Löffler. Content-based Retrieval of 3D Models in Distributed Web Databases by Visual Shape Information , 2000, IV.

[27] Qiang Yang,et al. A unified framework for semantics and feature based relevance feedback in image retrieval systems , 2000, ACM Multimedia.

[28] Thomas S. Huang,et al. Optimizing learning in image retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[29] Tsuhan Chen,et al. Efficient feature extraction for 2D/3D objects in mesh representation , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).