论文信息 - Contextual HyperNetworks for Novel Feature Adaptation

Contextual HyperNetworks for Novel Feature Adaptation

While deep learning has obtained state-of-the-art results in many applications, the adaptation of neural network architectures to incorporate new output features remains a challenge, as neural networks are commonly trained to produce a fixed output dimension. This issue is particularly severe in online learning settings, where new output features, such as items in a recommender system, are added continually with few or no associated observations. As such, methods for adapting neural networks to novel features which are both time and data-efficient are desired. To address this, we propose the Contextual HyperNetwork (CHN), an auxiliary model which generates parameters for extending the base model to a new feature, by utilizing both existing data as well as any observations and/or metadata associated with the new feature. At prediction time, the CHN requires only a single forward pass through a neural network, yielding a significant speed-up when compared to re-training and fine-tuning approaches. To assess the performance of CHNs, we use a CHN to augment a partial variational autoencoder (P-VAE), a deep generative model which can impute the values of missing features in sparsely-observed data. We show that this system obtains improved few-shot learning performance for novel features over existing imputation and meta-learning baselines across recommender systems, e-learning, and healthcare tasks. ∗Equal contribution. †Work carried out while at Microsoft Research Cambridge. 4th Workshop on Meta-Learning at NeurIPS 2020, Vancouver, Canada. ar X iv :2 10 4. 05 86 0v 1 [ cs .L G ] 1 2 A pr 2 02 1

[1] Sebastian Nowozin,et al. Icebreaker: Element-wise Efficient Information Acquisition with a Bayesian Deep Latent Gaussian Model , 2019, NeurIPS.

[2] Hugo Larochelle,et al. A Meta-Learning Perspective on Cold-Start Recommendations for Items , 2017, NIPS.

[3] Matthew D. Hoffman,et al. Variational Autoencoders for Collaborative Filtering , 2018, WWW.

[4] Sebastian Nowozin,et al. EDDI: Efficient Dynamic Discovery of High-Value Information with Partial VAE , 2018, ICML.

[5] Sebastian Nowozin,et al. Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes , 2019, NeurIPS.

[6] Thore Graepel,et al. Matchbox: large scale online bayesian recommendations , 2009, WWW '09.

[7] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[8] John Riedl,et al. Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[9] Hedvig Kjellström,et al. Neuropathic Pain Diagnosis Simulator for Causal Discovery Algorithm Evaluation , 2019, NeurIPS.

[10] Michael J. Pazzani,et al. Content-Based Recommendation Systems , 2007, The Adaptive Web.

[11] Nikos Komodakis,et al. Dynamic Few-Shot Visual Learning Without Forgetting , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Yoav Shoham,et al. Fab: content-based, collaborative recommendation , 1997, CACM.

[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15] Tsendsuren Munkhdalai,et al. Rapid Adaptation with Conditionally Shifted Neurons , 2017, ICML.

[16] José Miguel Hernández-Lobato,et al. Partial VAE for Hybrid Recommender System , 2018 .

[17] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[18] Anh Duc Duong,et al. Addressing cold-start problem in recommendation systems , 2008, ICUIMC '08.

[19] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[20] Stathes Hadjiefthymiades,et al. Facing the cold start problem in recommender systems , 2014, Expert Syst. Appl..

[21] Pasquale Lops,et al. Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[22] Tat-Seng Chua,et al. Neural Collaborative Filtering , 2017, WWW.

[23] Alexander J. Smola,et al. Deep Sets , 2017, 1703.06114.

[24] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[25] F. Maxwell Harper,et al. The MovieLens Datasets: History and Context , 2016, TIIS.

[26] Mehrbakhsh Nilashi,et al. Collaborative filtering recommender systems , 2013 .

[27] Scott Sanner,et al. AutoRec: Autoencoders Meet Collaborative Filtering , 2015, WWW.

[28] Homanga Bharadhwaj,et al. Meta-Learning for User Cold-Start Recommendation , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[29] C. Gomez-Uribe,et al. The Netflix Recommender System: Algorithms, Business Value, and Innovation , 2016, ACM Trans. Manag. Inf. Syst..

[30] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[31] Luca Bertinetto,et al. Learning feed-forward one-shot learners , 2016, NIPS.

[32] Sebastian Nowozin,et al. Versa: Versatile and Efficient Few-shot Learning , 2018 .