论文信息 - Image retrieval using image context vectors: first results

Image retrieval using image context vectors: first results

The key question for any image retrieval approach is how to represent the images. We are exploring a new image context vector representation that avoids the need for full image understanding. This representation: (1) is invariant with respect to translation and scaling of (whole) images, (2) is robust with respect to translation, scaling, small rotations, and partial occlusions of objects within images, (3) avoids explicit segmentation into objects, and (4) allows computation of image-query similarity using only about 300 multiplications and additions. A context vector is a high (approximately 300) dimensional vector that can represent images, subimages, or image queries. Image context vectors are an extension of previous work in document retrieval where context vectors were used to represent documents, terms, and queries. The image is first represented as a collection of pairs of features. Each feature pair is then transformed into a 300-dimensional context vector that encodes the feature pair and its orientation. All the vectors for pairs are added together to form the context vector for the entire image. Retrieval order is determined by taking dot products of image context vectors with a query context vector, a fast operation. Results from a first prototype look promising. 119

Stephen I. Gallant | Michael F. Johnston | S. I. Gallant | Michael F. Johnston

[1] Allen R. Hanson,et al. Extracting Straight Lines , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Stephen I. Gallant,et al. Neural network learning and expert systems , 1993 .

[3] Azriel Rosenfeld,et al. Image Analysis and Computer Vision: 1995 , 1996, Comput. Vis. Image Underst..

[4] Gerald Salton,et al. Automatic text processing , 1988 .

[5] Brant C. White,et al. United States patent , 1985 .

[6] AZRIEL ROSENFELD,et al. Image analysis and computer vision: 1991 , 1991, CVGIP Image Underst..

[7] K. Wakimoto,et al. Efficient and Effective Querying by Image Content , 1994 .

[8] Edward M. Riseman,et al. Token-based extraction of straight lines , 1989, IEEE Trans. Syst. Man Cybern..

[9] Marilyn Bohl,et al. Information processing , 1971 .

[10] Azriel Rosenfeld,et al. Image analysis and computer vision: 1992 , 1993 .

[11] M. V. Rossum,et al. In Neural Computation , 2022 .