Horizon estimation: perceptual and computational experiments

The human visual system is able to quickly and robustly infer a wealth of scene information -- the scene "gist" - already after 100 milliseconds of image presentation. Here, we investigated the ability to estimate the position of the horizon in briefly shown images. Being able to judge the horizon position quickly and accurately will help in inferring viewer orientation and scene structure in general and thus might be an important factor of scene gist. In the first, perceptual study, we investigated participants' horizon estimates after a 150 millisecond, masked presentation of typical outdoor scenes from different scene categories. All images were shown in upright, blurred, inverted, and cropped conditions to investigate the influence of different information types on the perceptual decision. We found that despite individual variations, horizon estimates were fairly consistent across participants and conformed well to annotated data. In addition, inversion resulted in significant differences in performance, whereas blurring did not yield any different results, highlighting the importance of global, low-frequency information for making judgments about horizon position. In the second, computational experiment, we then correlated the performance of several algorithms for horizon estimation with the human data -- algorithms ranged from simple estimations of bright-dark-transitions to more sophisticated frequency spectrum analyses motivated by previous computational modeling of scene classification results. Surprisingly, the best fits to human data were obtained with one very simple gradient method and the most complex, trained method. Overall, global frequency spectrum analysis provided the best fit to human estimates, which together with the perceptual data suggests that the human visual system might use similar mechanisms to quickly judge horizon position as part of the scene gist.

[1]  Luc Tremblay,et al.  Gender Differences in Perception of Self-Orientation: Software or Hardware? , 2004, Perception.

[2]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[3]  Denis G. Pelli,et al.  ECVP '07 Abstracts , 2007, Perception.

[4]  Michelle R. Greene,et al.  PSYCHOLOGICAL SCIENCE Research Article The Briefest of Glances The Time Course of Natural Scene Understanding , 2022 .

[5]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[6]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[7]  Eric R Muth,et al.  Nausea induced by vection drum: contributions of body position, visual pattern, and gender. , 2008, Aviation, space, and environmental medicine.

[8]  C Raphel,et al.  Effects of gymnastics expertise on the perception of body orientation in the pitch dimension. , 2000, Journal of vestibular research : equilibrium & orientation.

[9]  Laurence R. Harris,et al.  Perceived self-orientation in allocentric and egocentric space: Effects of visual and physical tilt on saccadic and tactile measures , 2008, Brain Research.

[10]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[11]  Luc Tremblay,et al.  Sex differences in judging self-orientation: the morphological horizon and body pitch , 2007, BMC Neuroscience.

[12]  Derek Hoiem,et al.  Seeing the world behind the image: Spatial layout for 3D scene understanding , 2007 .

[13]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[14]  H. Sedgwick The visible horizon: A potential source of visual information for the perception of size and distance. , 1973 .

[15]  L. Harris,et al.  The subjective visual vertical and the perceptual upright , 2006, Experimental Brain Research.

[16]  E. Reed The Ecological Approach to Visual Perception , 1989 .

[17]  Antonio Torralba,et al.  Depth Estimation from Image Structure , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  W. Warren,et al.  Visual guidance of walking through apertures: body-scaled information for affordances. , 1987, Journal of experimental psychology. Human perception and performance.

[19]  Heinrich H. Bülthoff,et al.  Categorization of natural scenes: Local versus global information and the role of color , 2007, TAP.

[20]  Alexei A. Efros,et al.  Seeing the world behind the image: spatial layout for three-dimensional scene understanding , 2007 .

[21]  Sheena Rogers,et al.  The horizon-ratio relation as information for relative size in pictures , 1996, Perception & psychophysics.

[22]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, CVPR.

[23]  T. A. Kelley,et al.  Effects of scene inversion on change detection of targets matched for visual salience. , 2003, Journal of vision.

[24]  Hermann Aubert,et al.  Eine scheinbare bedeutende Drehung von Objecten bei Neigung des Kopfes nach rechts oder links , 1861, Archiv für pathologische Anatomie und Physiologie und für klinische Medicin.

[25]  I. Curthoys,et al.  Visually perceived vertical and visually perceived horizontal are not orthogonal , 1998, Vision Research.

[26]  Gunther Wyszecki,et al.  Color Science: Concepts and Methods, Quantitative Data and Formulae, 2nd Edition , 2000 .

[27]  Tom Foulsham,et al.  Turning the world around: Patterns in saccade direction vary with picture orientation , 2008, Vision Research.