Conditional Neural Processes (CNP; Garnelo et al., 2018a) are an attractive family of metalearning models which produce well-calibrated predictions, enable fast inference at test time, and are trainable via a simple maximum likelihood procedure. A limitation of CNPs is their inability to model dependencies in the outputs. This significantly hurts predictive performance and renders it impossible to draw coherent function samples, which limits the applicability of CNPs in downstream applications and decision making. Neural Processes (NPs; Garnelo et al., 2018b) attempt to alleviate this issue by using latent variables, relying on these to model output dependencies, but introduces difficulties stemming from approximate inference. One recent alternative (Bruinsma et al., 2021), which we refer to as the FullConvGNP, models dependencies in the predictions while still being trainable via exact maximum-likelihood. Unfortunately, the FullConvGNP relies on expensive 2D-dimensional convolutions, which limit its applicability to only one-dimensional data. In this work, we present an alternative way to model output dependencies which also lends itself maximum likelihood training but, unlike the FullConvGNP, can be scaled to twoand threedimensional data. The proposed models exhibit good performance in synthetic experiments.
[1]
Tuan Anh Le,et al.
Empirical Evaluation of Neural Process Objectives
,
2018
.
[2]
Y. Teh,et al.
Equivariant Conditional Neural Processes
,
2020,
ArXiv.
[3]
B. Øksendal.
Stochastic differential equations : an introduction with applications
,
1987
.
[4]
Y. Matsuo,et al.
Group Equivariant Conditional Neural Processes
,
2021,
ICLR.
[5]
Carl E. Rasmussen,et al.
Gaussian processes for machine learning
,
2005,
Adaptive computation and machine learning.
[6]
Richard E. Turner,et al.
Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes
,
2020,
NeurIPS.
[7]
Max Welling,et al.
E(n) Equivariant Graph Neural Networks
,
2021,
ICML.