Beyond blur

To peripheral vision, a pair of physically different images can look the same. Such pairs are metamers relative to each other, just as physically-different spectra of light are perceived as the same color. We propose a real-time method to compute such ventral metamers for foveated rendering where, in particular for near-eye displays, the largest part of the framebuffer maps to the periphery. This improves in quality over state-of-the-art foveation methods which blur the periphery. Work in Vision Science has established how peripheral stimuli are ventral metamers if their statistics are similar. Existing methods, however, require a costly optimization process to find such metamers. To this end, we propose a novel type of statistics particularly well-suited for practical real-time rendering: smooth moments of steerable filter responses. These can be extracted from images in time constant in the number of pixels and in parallel over all pixels using a GPU. Further, we show that they can be compressed effectively and transmitted at low bandwidth. Finally, computing realizations of those statistics can again be performed in constant time and in parallel. This enables a new level of quality for foveated applications such as such as remote rendering, level-of-detail and Monte-Carlo denoising. In a user study, we finally show how human task performance increases and foveation artifacts are less suspicious, when using our method compared to common blurring.

[1]  S M Anstis,et al.  Letter: A chart demonstrating variations in acuity with retinal position. , 1974, Vision research.

[2]  Timo Aila,et al.  Interactive reconstruction of Monte Carlo image sequences using a recurrent denoising autoencoder , 2017, ACM Trans. Graph..

[3]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[4]  D. Hubel Exploration of the primary visual cortex, 1955–78 , 1982, Nature.

[5]  R. Rosenholtz,et al.  A summary statistic representation in peripheral vision explains visual search. , 2009, Journal of vision.

[6]  Pradeep Sen,et al.  A machine learning approach for filtering Monte Carlo noise , 2015, ACM Trans. Graph..

[7]  Lester C. Loschky,et al.  Gaze-Contingent Multiresolutional Displays: An Integrative Review , 2003, Hum. Factors.

[8]  Marcel A J van Gerven,et al.  Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2015, The Journal of Neuroscience.

[9]  Desney S. Tan,et al.  Foveated 3D graphics , 2012, ACM Trans. Graph..

[10]  Steven C Dakin,et al.  Positional averaging explains crowding with letter-like stimuli , 2009, Proceedings of the National Academy of Sciences.

[11]  Wolfgang Heidrich,et al.  HDR-VDP-2: a calibrated visual metric for visibility and quality predictions in all luminance conditions , 2011, SIGGRAPH 2011.

[12]  M. Herzog,et al.  Sex-related differences in vision are heterogeneous , 2018, Scientific Reports.

[13]  Mikhail Okunev,et al.  DeepFovea , 2019, ACM Trans. Graph..

[14]  Michael Kass,et al.  Coherent noise for non-photorealistic rendering , 2011, SIGGRAPH 2011.

[15]  C. E. Rogers,et al.  Symbolic Description of Factorial Models for Analysis of Variance , 1973 .

[16]  Hans-Peter Seidel,et al.  Luminance-contrast-aware foveated rendering , 2019, ACM Trans. Graph..

[17]  Nicole C. Rust,et al.  Do We Know What the Early Visual System Does? , 2005, The Journal of Neuroscience.

[18]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Ares Lagae,et al.  Procedural isotropic stochastic textures by example , 2010, Comput. Graph..

[20]  D. Ruderman,et al.  Statistics of cone responses to natural images: implications for visual coding , 1998 .

[21]  Ken Perlin,et al.  An image synthesizer , 1988 .

[22]  Min H. Kim,et al.  Edge-aware color appearance , 2011, TOGS.

[23]  Aubert,et al.  Untersuchungen über den Raumsinn der Retina , 1857, Archiv für Ophthalmologie.

[24]  Alexei A. Efros,et al.  Texture synthesis by non-parametric sampling , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[25]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[26]  Baining Guo,et al.  Real-time texture synthesis by patch-based sampling , 2001, TOGS.

[27]  G E Legge,et al.  Contrast discrimination in peripheral vision. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[28]  F. Clarke A Study of Troxler's Effect , 1960 .

[29]  Josh H. McDermott,et al.  Metamers of neural networks reveal divergence from human perceptual systems , 2019, NeurIPS.

[30]  Henry Fuchs,et al.  Manufacturing Application-Driven Foveated Near-Eye Displays , 2019, IEEE Transactions on Visualization and Computer Graphics.

[31]  Rachel A. Albert,et al.  Foveated AR: Dynamically-Foveated Augmented Reality Display , 2019 .

[32]  Matthias Bethge,et al.  Testing models of peripheral encoding using metamerism in an oddity paradigm. , 2016, Journal of vision.

[33]  Bruno Galerne,et al.  Gabor noise by example , 2012, ACM Trans. Graph..

[34]  Yan Gu,et al.  Extending the graphics pipeline with adaptive, multi-rate shading , 2014, ACM Trans. Graph..

[35]  Leslie G. Ungerleider,et al.  ‘What’ and ‘where’ in the human brain , 1994, Current Opinion in Neurobiology.

[36]  Eero P. Simoncelli,et al.  Metamers of the ventral stream , 2011, Nature Neuroscience.

[37]  Mark Meyer,et al.  Kernel-predicting convolutional networks for denoising Monte Carlo renderings , 2017, ACM Trans. Graph..

[38]  Joohwan Kim,et al.  Towards foveated rendering for gaze-tracked virtual reality , 2016, ACM Trans. Graph..

[39]  Joohwan Kim,et al.  Latency Requirements for Foveated Rendering in Virtual Reality , 2017, ACM Trans. Appl. Percept..

[40]  Eric Turner,et al.  Limits of peripheral acuity and implications for VR system design , 2018, Journal of the Society for Information Display.

[41]  Keiji Tanaka,et al.  Inferotemporal cortex and object vision. , 1996, Annual review of neuroscience.

[42]  Andrew T. Duchowski,et al.  Gaze-Contingent Displays: A Review , 2004, Cyberpsychology Behav. Soc. Netw..

[43]  Hans Strasburger,et al.  Backpack Adidas Classic Ii 3s Med Pink ffqtSZw at deniscount.com , 2011 .

[44]  Tobias Ritschel,et al.  Perceptual rasterization for head-mounted display image synthesis , 2018, ACM Trans. Graph..

[45]  Alexandre Bernardino,et al.  A review of log-polar imaging for visual perception in robotics , 2010, Robotics and Autonomous Systems.