Poincaré ResNet

This paper introduces an end-to-end residual network that operates entirely on the Poincar\'e ball model of hyperbolic space. Hyperbolic learning has recently shown great potential for visual understanding, but is currently only performed in the penultimate layer(s) of deep networks. All visual representations are still learned through standard Euclidean networks. In this paper we investigate how to learn hyperbolic representations of visual data directly from the pixel-level. We propose Poincar\'e ResNet, a hyperbolic counterpart of the celebrated residual network, starting from Poincar\'e 2D convolutions up to Poincar\'e residual connections. We identify three roadblocks for training convolutional networks entirely in hyperbolic space and propose a solution for each: (i) Current hyperbolic network initializations collapse to the origin, limiting their applicability in deeper networks. We provide an identity-based initialization that preserves norms over many layers. (ii) Residual networks rely heavily on batch normalization, which comes with expensive Fr\'echet mean calculations in hyperbolic space. We introduce Poincar\'e midpoint batch normalization as a faster and equally effective alternative. (iii) Due to the many intermediate operations in Poincar\'e layers, we lastly find that the computation graphs of deep learning libraries blow up, limiting our ability to train on deep hyperbolic networks. We provide manual backward derivations of core hyperbolic operations to maintain manageable computation graphs.

[1]  P. Mettes,et al.  HypLL: The Hyperbolic Learning Library , 2023, ArXiv.

[2]  Mina Ghadimi Atigh,et al.  Hyperbolic Deep Learning in Computer Vision: A Survey , 2023, International Journal of Computer Vision.

[3]  Fabio Galasso,et al.  HYperbolic Self-Paced Learning for Self-Supervised Skeleton-based Action Representations , 2023, ICLR.

[4]  Albert K Lee,et al.  Hippocampal spatial representations exhibit a hyperbolic geometry that expands with experience , 2022, Nature Neuroscience.

[5]  Toan N. Nguyen,et al.  Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings , 2022, MICCAI.

[6]  Hao Jiang,et al.  Hyperbolic Knowledge Transfer with Class Hierarchy for Few-Shot Learning , 2022, IJCAI.

[7]  N. Sebe,et al.  Hyperbolic Vision Transformers: Combining Improvements in Metric Learning , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  J. Röning,et al.  Hyperbolic Uncertainty Aware Semantic Segmentation , 2022, ArXiv.

[9]  Mina Ghadimi Atigh,et al.  Hyperbolic Image Segmentation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  J. Han,et al.  Meta hyperbolic networks for zero-shot learning , 2022, Neurocomputing.

[11]  Tom Drummond,et al.  Adaptive Poincaré Point to Set Distance for Few-Shot Classification , 2021, AAAI.

[12]  Mehrtash Harandi,et al.  Kernel Methods in Hyperbolic Spaces , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Heng Huang,et al.  Learning Better Visual Data Similarities via New Grouplet Non-Euclidean Embedding , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[14]  M. Harandi,et al.  Curvature Generation in Curved Spaces for Few-Shot Learning , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Stella X. Yu,et al.  Clipped Hyperbolic Classifiers Are Super-Hyperbolic Classifiers , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Pascal Mettes,et al.  Hyperbolic Busemann Learning with Ideal Prototypes , 2021, NeurIPS.

[17]  Heng Huang,et al.  Unsupervised Hyperbolic Metric Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Peng Li,et al.  Fully Hyperbolic Neural Networks , 2021, ACL.

[19]  Yunde Jia,et al.  A Hyperbolic-to-Hyperbolic Graph Convolutional Network , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Werner Creixell,et al.  HGAN: Hyperbolic Generative Adversarial Network , 2021, IEEE Access.

[21]  Guoying Zhao,et al.  Hyperbolic Deep Neural Networks: A Survey , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Carl Vondrick,et al.  Learning the Predictability of the Future , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Joy Hsu,et al.  Capturing implicit hierarchical structure in 3D biomedical images with self-supervised hyperbolic representations , 2020, NeurIPS.

[24]  Yixuan Li,et al.  Energy-based Out-of-distribution Detection , 2020, NeurIPS.

[25]  Albert Gu,et al.  From Trees to Continuous Embeddings and Back: Hyperbolic Hierarchical Clustering , 2020, NeurIPS.

[26]  Maximilian Nickel,et al.  Riemannian Continuous Normalizing Flows , 2020, NeurIPS.

[27]  Yu-Gang Jiang,et al.  Hyperbolic Visual Embedding Learning for Zero-Shot Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Heng Tao Shen,et al.  Searching for Actions on the Hyperbole , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Dario Pavllo,et al.  Hierarchical Image Classification using Entailment Cone Embeddings , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[30]  Christopher De Sa,et al.  Differentiating through the Fréchet Mean , 2020, ICML.

[31]  Renjie Liao,et al.  Latent Variable Modelling with Hyperbolic Normalizing Flows , 2020, ICML.

[32]  Timothy M. Hospedales,et al.  Multi-relational Poincaré Graph Embeddings , 2019, NeurIPS.

[33]  Yanfang Ye,et al.  Hyperbolic Graph Attention Network , 2019, IEEE Transactions on Big Data.

[34]  Douwe Kiela,et al.  Hyperbolic Graph Neural Networks , 2019, NeurIPS.

[35]  Jure Leskovec,et al.  Hyperbolic Graph Convolutional Neural Networks , 2019, NeurIPS.

[36]  Andrew McCallum,et al.  Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space , 2019, KDD.

[37]  David Lopez-Paz,et al.  Poincaré maps for analyzing complex hierarchies in single-cell data , 2019, Nature Communications.

[38]  Renjie Liao,et al.  Lorentzian Distance Learning for Hyperbolic Representations , 2019, ICML.

[39]  Timothy M. Hospedales,et al.  Multi-relational Poincar\'e Graph Embeddings , 2019, 1905.09791.

[40]  Valentin Khrulkov,et al.  Hyperbolic Image Embeddings , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Shoichiro Yamaguchi,et al.  A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning , 2019, ICML.

[42]  Charline Le Lan,et al.  Continuous Hierarchical Representations with Poincaré Variational Auto-Encoders , 2019, NeurIPS.

[43]  Gary Bécigneul,et al.  Poincaré GloVe: Hyperbolic Word Embeddings , 2018, ICLR.

[44]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Andrew M. Dai,et al.  Embedding Text in Hyperbolic Spaces , 2018, TextGraphs@NAACL-HLT.

[46]  Thomas Hofmann,et al.  Hyperbolic Neural Networks , 2018, NeurIPS.

[47]  Christopher De Sa,et al.  Representation Tradeoffs for Hyperbolic Embeddings , 2018, ICML.

[48]  Thomas Hofmann,et al.  Hyperbolic Entailment Cones for Learning Hierarchical Embeddings , 2018, ICML.

[49]  Douwe Kiela,et al.  Poincaré Embeddings for Learning Hierarchical Representations , 2017, NIPS.

[50]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[51]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[53]  Jonathon Shlens,et al.  Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[54]  Iasonas Kokkinos,et al.  Describing Textures in the Wild , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  Silvere Bonnabel,et al.  Stochastic Gradient Descent on Riemannian Manifolds , 2011, IEEE Transactions on Automatic Control.

[56]  Rik Sarkar,et al.  Low Distortion Delaunay Embedding of Trees in Hyperbolic Plane , 2011, GD.

[57]  Abraham Albert Ungar,et al.  A Gyrovector Space Approach to Hyperbolic Geometry , 2009, A Gyrovector Space Approach to Hyperbolic Geometry.

[58]  Abhinav Valada,et al.  On Hyperbolic Embeddings in Object Detection , 2022, GCPR.

[59]  Yunhui Guo,et al.  Supplementary for Clipped Hyperbolic Classifiers Are Super-Hyperbolic Classifiers , 2022 .

[60]  Mehmet Giray Ogut,et al.  Supplementary Material for Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision , 2021 .

[61]  Andrew Y. Ng,et al.  Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .