Self-Attention and Ingredient-Attention Based Model for Recipe Retrieval from Image Queries
暂无分享,去创建一个
Stavroula G. Mougiakakou | Matthias Fontanellaz | Stergios Christodoulidis | S. Mougiakakou | S. Christodoulidis | Matthias Fontanellaz
[1] Antonio Torralba,et al. Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[3] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .
[4] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[5] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[7] Keiji Yanai,et al. Recognition of Multiple-Food Images by Detecting Candidate Regions , 2012, 2012 IEEE International Conference on Multimedia and Expo.
[8] Keiji Yanai,et al. Food image recognition using deep convolutional network with pre-training and fine-tuning , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).
[9] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.
[10] Hao Chen,et al. DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning , 2017, 2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC).
[11] Marios Anthimopoulos,et al. Two-View 3D Reconstruction for Food Volume Estimation , 2017, IEEE Transactions on Multimedia.
[12] C. Spence,et al. Multisensory Integration: Space, Time and Superadditivity , 2005, Current Biology.
[13] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[14] Matthieu Guillaumin,et al. Food-101 - Mining Discriminative Components with Random Forests , 2014, ECCV.
[15] Matthieu Cord,et al. Recipe recognition with large multimodal food dataset , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).
[16] Shuang Wang,et al. Geolocalized Modeling for Dish Recognition , 2015, IEEE Transactions on Multimedia.
[17] Giovanni Maria Farinella,et al. A multi-task learning approach for meal assessment , 2018, MADiMa@IJCAI.
[18] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[19] Matthieu Cord,et al. Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings , 2018, SIGIR.
[20] N. Forouhi,et al. Global diet and health: old questions, fresh evidence, and new horizons , 2019, The Lancet.
[21] Chong-Wah Ngo,et al. Deep-based Ingredient Recognition for Cooking Recipe Retrieval , 2016, ACM Multimedia.
[22] Amaia Salvador,et al. Learning Cross-Modal Embeddings for Cooking Recipes and Food Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Edward J. Delp,et al. Food image analysis: Segmentation, identification and weight estimation , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).
[24] Keiji Yanai,et al. Automatic Expansion of a Food Image Dataset Leveraging Existing Categories with Domain Adaptation , 2014, ECCV Workshops.
[25] R. Schreyer,et al. THE DEPARTMENT OF AGRICULTURE. , 1901, Science.
[26] Beatriz Remeseiro,et al. Grab, Pay, and Eat: Semantic Food Detection for Smart Restaurants , 2018, IEEE Transactions on Multimedia.