论文信息 - CUDA-Optimized real-time rendering of a Foveated Visual System - 字舞流文

CUDA-Optimized real-time rendering of a Foveated Visual System

The spatially-varying field of the human visual system has recently received a resurgence of interest with the development of virtual reality (VR) and neural networks. The computational demands of high resolution rendering desired for VR can be offset by savings in the periphery, while neural networks trained with foveated input have shown perceptual gains in i.i.d and o.o.d generalization. In this paper, we present a technique that exploits the CUDA GPU architecture to efficiently generate Gaussian-based foveated images at high definition (1920x1080 px) in real-time (165 Hz), with a larger number of pooling regions than previous Gaussian-based foveation algorithms by several orders of magnitude, producing a smoothly foveated image that requires no further blending or stitching, and that can be well fit for any contrast sensitivity function. The approach described can be adapted from Gaussian blurring to any eccentricity-dependent image processing and our algorithm can meet demand for experimentation to evaluate the role of spatially-varying processing across biological and artificial agents, so that foveation can be added easily on top of existing systems rather than forcing their redesign (emulated foveated renderer). Altogether, this paper demonstrates how a GPU, with a CUDA block-wise architecture, can be employed for radially-variant rendering, with opportunities for more complex post-processing to ensure a metameric foveation scheme. Code is provided.

Tomaso Poggio | Arturo Deza | Elian Malkin | T. Poggio | Arturo Deza | Elian Malkin

[1] Talia Konkle,et al. Sociality and interaction envelope organize visual action representations , 2019, Nature Communications.

[2] Brenden M. Lake,et al. Self-supervised learning through the eyes of a child , 2020, NeurIPS.

[3] Jonathon S. Hare,et al. Foveated convolutions: improving spatial transformer networks by modelling the retina , 2019 .

[4] Joohwan Kim,et al. Towards foveated rendering for gaze-tracked virtual reality , 2016, ACM Trans. Graph..

[5] Bruno A. Olshausen,et al. Emergence of foveal image sampling from learning to attend in visual scenes , 2016, ICLR.

[6] Bryan Reimer,et al. SideEye: A Generative Neural Network Based Simulator of Human Peripheral Vision , 2017, 1706.04568.

[7] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[8] Kevin K. Sit,et al. Distributed and retinotopically asymmetric processing of coherent motion in mouse visual cortex , 2019, Nature Communications.

[9] Edward H. Adelson,et al. The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[10] Lei Xiao,et al. DeepFocus: learned image synthesis for computational display , 2018, SIGGRAPH Talks.

[11] Douglas Lanman,et al. Retinal image quality in near-eye pupil-steered systems. , 2019, Optics express.

[12] Philipp Berens,et al. Understanding the retinal basis of vision across species , 2019, Nature Reviews Neuroscience.

[13] Shane Legg,et al. Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents , 2018, ArXiv.

[14] Wilson S. Geisler,et al. Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[15] Hans-Peter Seidel,et al. Luminance-contrast-aware foveated rendering , 2019, ACM Trans. Graph..

[16] R. Rosenholtz. Capabilities and Limitations of Peripheral Vision. , 2016, Annual review of vision science.

[17] A. Kelber,et al. Visual adaptations of diurnal and nocturnal raptors. , 2020, Seminars in cell & developmental biology.

[18] Andrzej Banburski,et al. Biologically Inspired Mechanisms for Adversarial Robustness , 2020, NeurIPS.

[19] Chuang Gan,et al. ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation , 2020, ArXiv.

[20] Harish Katti,et al. Human peripheral blur is optimal for object recognition , 2018, Vision Research.

[21] Talia Konkle,et al. Large-scale dissociations between views of objects, scenes, and reachable-scale environments in visual cortex , 2020, Proceedings of the National Academy of Sciences.

[22] Miguel P. Eckstein,et al. Towards Metamerism via Foveated Style Transfer , 2017, ICLR.

[23] Gemma Roig,et al. Scale and translation-invariance for novel objects in human vision , 2020, Scientific Reports.

[24] Talia Konkle,et al. Emergent Properties of Foveated Perceptual Systems , 2020, ArXiv.

[25] Yuanzhen Li,et al. Measuring visual clutter. , 2007, Journal of vision.

[26] Mikhail Okunev,et al. DeepFovea , 2019, ACM Trans. Graph..

[27] Tomaso A. Poggio,et al. Computational role of eccentricity dependent cortical magnification , 2014, ArXiv.

[28] Qi Zhao,et al. Foveation-based Mechanisms Alleviate Adversarial Examples , 2015, ArXiv.

[29] Miguel P. Eckstein,et al. Object detection through search with a foveated visual system , 2014, PLoS Comput. Biol..

[30] Marwan Mattar,et al. Unity: A General Platform for Intelligent Agents , 2018, ArXiv.

[31] Emilie Josephs,et al. Perceptual dissociations among views of objects, scenes, and reachable spaces. , 2019, Journal of experimental psychology. Human perception and performance.