Psychophysical study of image orientation perception.

The experiment reported here investigates the perception of orientation of color photographic images. A collection of 1000 images (mix of professional photos and consumer snapshots) was used in this study. Each image was examined by at least five observers and shown at varying resolutions. At each resolution, observers were asked to indicate the image orientation, the level of confidence, and the cues they used to make the decision. The results show that for typical images, accuracy is close to 98% when using all available semantic cues from high-resolution images, and 84% when using only low-level vision features and coarse semantics from thumbnails. The accuracy by human observers suggests an upper bound for the performance of an automatic system. In addition, the use of a large, carefully chosen image set that spans the 'photo space' (in terms of occasions and subject matter) and extensive interaction with the human observers reveals cues used by humans at various image resolutions: sky and people are the most useful and reliable among a number of important semantic cues.

[1]  Heinrich H Bülthoff,et al.  Image-based object recognition in man, monkey and machine , 1998, Cognition.

[2]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Jake K. Aggarwal,et al.  Retrieval by classification of images containing large manmade objects using perceptual grouping , 2002, Pattern Recognit..

[4]  Neill W. Campbell,et al.  Automatic Segmentation and Classification of Outdoor Images Using Neural Networks , 1997, Int. J. Neural Syst..

[5]  Anil K. Jain,et al.  Automatic image orientation detection , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[6]  M. Corballis,et al.  Decisions about identity and orientation of rotated letters and digits , 1978, Memory & cognition.

[7]  Takeo Kanade,et al.  Probabilistic modeling of local appearance and spatial relationships for object recognition , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[8]  Takeo Kanade,et al.  A statistical approach to 3d object detection applied to faces and cars , 2000 .

[9]  Jiebo Luo,et al.  A physical model-based approach to detecting sky in photographic images , 2002, IEEE Trans. Image Process..

[10]  Pierre Jolicoeur,et al.  Recognition thresholds for plane-rotated pictures of familiar objects. , 2003, Acta psychologica.

[11]  Adam Reeves,et al.  The use of word-picture verification to study entry-level object recognition: Further support for view-invariant mechanisms , 2002, Memory & cognition.

[12]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[13]  R. Hess,et al.  The interaction of first- and second-order cues to orientation , 1999, Vision Research.

[14]  M J Tarr,et al.  Is human object recognition better described by geon structural descriptions or by multiple views? Comment on Biederman and Gerhardstein (1993). , 1995, Journal of experimental psychology. Human perception and performance.

[15]  A. Murat Tekalp,et al.  Automatic Image Annotation Using Adaptive Color Classification , 1996, CVGIP Graph. Model. Image Process..

[16]  Yongmei Wang,et al.  Content-based image orientation detection with support vector machines , 2001, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL 2001).

[17]  Michael J. Tarr Is human object recognition better described by geon structural description or by multiple views , 1995 .

[18]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[19]  Jiebo Luo,et al.  Indoor vs outdoor classification of consumer photographs using low-level and semantic features , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[20]  Takeo Kanade,et al.  A statistical method for 3D object detection applied to faces and cars , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[21]  Elinor McKone,et al.  Orientation invariance in naming rotated objects: Individual differences and repetition priming , 1999, Perception & psychophysics.

[22]  P. Jolicoeur The time to name disoriented natural objects , 1985, Memory & cognition.

[23]  G K Humphrey,et al.  Surface Cues Reduce the Latency to Name Rotated Images of Objects , 2001, Perception.

[24]  L. Braine,et al.  On how adults identify the orientation of a shape , 1981, Perception & psychophysics.

[25]  R. Shapley,et al.  Contextual influences on orientation discrimination: binding local and global cues , 2001, Vision Research.

[26]  Anil K. Jain,et al.  On image classification: city images vs. landscapes , 1998, Pattern Recognit..

[27]  Milind R. Naphade,et al.  A probabilistic framework for semantic indexing and retrieval in video , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[28]  Toby J. Lloyd-Jones,et al.  Effects of plane rotation, task, and complexity on recognition of familiar and chimeric objects , 2002, Memory & cognition.

[29]  Adam Reeves,et al.  Rotating objects to determine orientation, not identity: Evidence from a backward-masking/dual-task procedure , 2000, Perception & psychophysics.

[30]  R. Maki Naming and locating the tops of rotated pictures. , 1986, Canadian journal of psychology.

[31]  P Jolicoeur,et al.  Orientation congruency effects in visual search. , 1992, Canadian journal of psychology.

[32]  Pierre Jolicoeur,et al.  The influence of perceived rotary motion on the recognition of rotated objects , 1998 .

[33]  Manila Vannucci,et al.  Category Effects on the Processing of Plane-Rotated Objects , 2000, Perception.

[34]  Stefano A. De Caro On the perception of objects and their orientations. , 1998 .

[35]  Jake K. Aggarwal,et al.  Combining structure, color and texture for image retrieval: A performance evaluation , 2002, Object recognition supported by user interaction for service robots.