Approaches to Food/Non-food Image Classification Using Deep Learning in Cookpad
暂无分享,去创建一个
In this paper we report our approach to image classification, in particular to the food/non-food image classification problem, as used by our Cooking Log product of Cookpad Inc. We augment our existing services with an architecture which includes a loosely-connected and asynchronous image analysis module. One challenge of the classification is that the non-food class is very vast and varied and can only be defined in context. We find that having the nonfood class consist of multiple subclasses effectively improves both precision and recall by capturing different types of features in the images of that class.
[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[2] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[3] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[4] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).