Hyperbolic Busemann Learning with Ideal Prototypes

Hyperbolic space has become a popular choice of manifold for representation learning of arbitrary data, from tree-like structures and text to graphs. Building on the success of deep learning with prototypes in Euclidean and hyperspherical spaces, a few recent works have proposed hyperbolic prototypes for classification. Such approaches enable effective learning in low-dimensional output spaces and can exploit hierarchical relations amongst classes, but require privileged information about class labels to position the hyperbolic prototypes. In this work, we propose Hyperbolic Busemann Learning. The main idea behind our approach is to position prototypes on the ideal boundary of the Poincaré ball, which does not require prior label knowledge. To be able to compute proximities to ideal prototypes, we introduce the penalised Busemann loss. We provide theory supporting the use of ideal prototypes and the proposed loss by proving its equivalence to logistic regression in the one-dimensional case. Empirically, we show that our approach provides a natural interpretation of classification confidence, while outperforming recent hyperspherical and hyperbolic prototype approaches.

[1]  Douwe Kiela,et al.  Learning Continuous Hierarchies in the Lorentz Model of Hyperbolic Geometry , 2018, ICML.

[2]  Octavian-Eugen Ganea,et al.  Constant Curvature Graph Convolutional Networks , 2019, ICML.

[3]  Bernt Schiele,et al.  Attribute Prototype Network for Zero-Shot Learning , 2020, NeurIPS.

[4]  Huan Liu,et al.  Graph Prototypical Networks for Few-shot Learning on Attributed Networks , 2020, CIKM.

[5]  Renjie Liao,et al.  Lorentzian Distance Learning for Hyperbolic Representations , 2019, ICML.

[6]  Pietro Perona,et al.  The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[7]  Christopher De Sa,et al.  Representation Tradeoffs for Hyperbolic Embeddings , 2018, ICML.

[8]  Douwe Kiela,et al.  Poincaré Embeddings for Learning Hierarchical Representations , 2017, NIPS.

[9]  Lei Zhang,et al.  Spherical Zero-Shot Learning , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Chuan Shi,et al.  Lorentzian Graph Convolutional Networks , 2021, WWW.

[11]  Valentin Khrulkov,et al.  Hyperbolic Image Embeddings , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  M. Bridson,et al.  Metric Spaces of Non-Positive Curvature , 1999 .

[13]  Chong-Wah Ngo,et al.  Transferrable Prototypical Networks for Unsupervised Domain Adaptation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Bernard Ghanem,et al.  ActivityNet: A large-scale video benchmark for human activity understanding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Thomas Hofmann,et al.  Hyperbolic Entailment Cones for Learning Hierarchical Embeddings , 2018, ICML.

[16]  J. Ratcliffe Foundations of Hyperbolic Manifolds , 2019, Graduate Texts in Mathematics.

[17]  B. Caputo,et al.  DEEP NEAREST CLASS MEAN CLASSIFIERS , 2018 .

[18]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[19]  Tassilo Klein,et al.  Multimodal Prototypical Networks for Few-shot Learning , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[20]  Heng Tao Shen,et al.  Searching for Actions on the Hyperbole , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Bonnie Berger,et al.  Large-Margin Classification in Hyperbolic Space , 2018, AISTATS.

[22]  Yunlong Yu,et al.  Episode-Based Prototype Generating Network for Zero-Shot Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Martin Keller-Ressel,et al.  Hydra: A method for strain-minimizing hyperbolic embedding , 2019, J. Complex Networks.

[24]  Jure Leskovec,et al.  Hyperbolic Graph Convolutional Neural Networks , 2019, NeurIPS.

[25]  Shantanu Acharya,et al.  Every child should have parents: a taxonomy refinement algorithm based on hyperbolic term embeddings , 2019, ACL.

[26]  Herbert Busemann,et al.  The geometry of geodesics , 1955 .

[27]  Douwe Kiela,et al.  Hyperbolic Graph Neural Networks , 2019, NeurIPS.

[28]  David Lopez-Paz,et al.  Poincaré maps for analyzing complex hierarchies in single-cell data , 2019, Nature Communications.

[29]  Silvere Bonnabel,et al.  Stochastic Gradient Descent on Riemannian Manifolds , 2011, IEEE Transactions on Automatic Control.

[30]  Carl Vondrick,et al.  Learning the Predictability of the Future , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Renjie Liao,et al.  Latent Variable Modelling with Hyperbolic Normalizing Flows , 2020, ICML.

[32]  W. Floyd,et al.  HYPERBOLIC GEOMETRY , 1996 .

[33]  Christopher De Sa,et al.  Numerically Accurate Hyperbolic Embeddings Using Tiling-Based Models , 2019, NeurIPS.

[34]  Rik Sarkar,et al.  Low Distortion Delaunay Embedding of Trees in Hyperbolic Plane , 2011, GD.

[35]  Christopher De Sa,et al.  Differentiating through the Fréchet Mean , 2020, ICML.

[36]  Maximilian Nickel,et al.  Riemannian Continuous Normalizing Flows , 2020, NeurIPS.

[37]  Cees Snoek,et al.  Hyperspherical Prototype Networks , 2019, NeurIPS.

[38]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[39]  Open Cross-Domain Visual Search , 2019, Comput. Vis. Image Underst..

[40]  Joshua B. Tenenbaum,et al.  Meta-Learning for Semi-Supervised Few-Shot Classification , 2018, ICLR.

[41]  Yu-Gang Jiang,et al.  Hyperbolic Visual Embedding Learning for Zero-Shot Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Qun Liu,et al.  HyperText: Endowing FastText with Hyperbolic Geometry , 2020, FINDINGS.

[43]  J. Wade Davis,et al.  Statistical Pattern Recognition , 2003, Technometrics.

[44]  Thomas Hofmann,et al.  Hyperbolic Neural Networks , 2018, NeurIPS.

[45]  Chuan Zhou,et al.  Graph Geometry Interaction Learning , 2020, NeurIPS.

[46]  Timothy M. Hospedales,et al.  Multi-relational Poincaré Graph Embeddings , 2019, NeurIPS.

[47]  Gabriela Csurka,et al.  Distance-Based Image Classification: Generalizing to New Classes at Near-Zero Cost , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Gary Bécigneul,et al.  Riemannian Adaptive Optimization Methods , 2018, ICLR.

[49]  Gary Bécigneul,et al.  Poincaré GloVe: Hyperbolic Word Embeddings , 2018, ICLR.

[50]  Joachim Denzler,et al.  Deep Learning on Small Datasets without Pre-Training using Cosine Loss , 2019, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[51]  Chen Sun,et al.  Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification , 2017, ECCV.

[52]  Suvrit Sra,et al.  Fast stochastic optimization on Riemannian manifolds , 2016, ArXiv.

[53]  Barbara Caputo,et al.  Knowledge is Never Enough: Towards Web Aided Deep Open World Recognition , 2019, 2019 International Conference on Robotics and Automation (ICRA).