Towards a sparse low-rank regression model for memorability prediction of images

Abstract Nowadays, it is inevitable to experience plenty of images in everyday life. Some of them are remembered for a long time while others are forgotten after only a glance. It has been proved that memorability is an intrinsically stable property of images which measures the degree to which images are remembered. Although some work have been conducted to investigate the factors that make an image memorable, yet studies on designing robust models to predict image memorability have rarely been reported. Inspired by the good property of Low-Rank Representation (LRR) in dealing with noisy data, in this paper we propose a sparse low-rank regression framework for image memorability prediction, in which a projection matrix, applied to capture the global low-rank structure embedded in original feature space, and a sparse coefficient vector, applied to build connections between images and their memorability scores, are jointly learnt to guarantee the superior performance. In particular, to enable our proposed approach to discover discriminant attribute features automatically, we impose a structured sparsity constraint on the reconstruction error matrix against the existence of noisy attributes. We develop an alternating direction algorithm by applying augmented Lagrangian multipliers method to solve the objective function of our model. Experiments conducted on two publicly available memorability datasets demonstrates the effectiveness of the proposed method. Source code is freely available: https://www.github.com/HodorHoldthedoor/image-memorability.

[1]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Chong-Wah Ngo,et al.  Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study , 2010, IEEE Transactions on Multimedia.

[3]  Sitian Qin,et al.  A Two-Layer Recurrent Neural Network for Nonsmooth Convex Optimization Problems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Liqiang Nie,et al.  Predicting Image Memorability Through Adaptive Transfer Learning From External Sources , 2017, IEEE Transactions on Multimedia.

[5]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Yong Yu,et al.  Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[7]  Ke Lu,et al.  Low-Rank Discriminant Embedding for Multiview Learning , 2017, IEEE Transactions on Cybernetics.

[8]  Vladimir Pavlovic,et al.  Relative spatial features for image memorability , 2013, ACM Multimedia.

[9]  Zi Huang,et al.  Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis , 2013, IEEE Transactions on Multimedia.

[10]  Ali Jalali,et al.  Low-Rank Matrix Recovery From Errors and Erasures , 2013, IEEE Transactions on Information Theory.

[11]  Sitian Qin,et al.  A One-Layer Recurrent Neural Network for Pseudoconvex Optimization Problems With Equality and Inequality Constraints , 2017, IEEE Transactions on Cybernetics.

[12]  Jun Wang,et al.  LRSR: Low-Rank-Sparse representation for subspace clustering , 2016, Neurocomputing.

[13]  Daming Shi,et al.  Low-Rank-Sparse Subspace Representation for Robust Regression , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Yun Fu,et al.  Low-Rank Common Subspace for Multi-view Learning , 2014, 2014 IEEE International Conference on Data Mining.

[15]  Hanspeter Pfister,et al.  What Makes a Visualization Memorable? , 2013, IEEE Transactions on Visualization and Computer Graphics.

[16]  Yong Luo,et al.  Low-Rank Multi-View Learning in Matrix Completion for Multi-Label Image Classification , 2015, AAAI.

[17]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[18]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[19]  Changyin Sun,et al.  Kernel Low-Rank Representation for face recognition , 2015, Neurocomputing.

[20]  Jianxiong Xiao,et al.  Image memorability and visual inception , 2012, SIGGRAPH Asia Technical Briefs.

[21]  Antonio Torralba,et al.  Understanding and Predicting Image Memorability at a Large Scale , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[22]  Shuicheng Yan,et al.  Correlation Adaptive Subspace Segmentation by Trace Lasso , 2013, 2013 IEEE International Conference on Computer Vision.

[23]  Antonio Torralba,et al.  Modifying the Memorability of Face Photographs , 2013, 2013 IEEE International Conference on Computer Vision.

[24]  Jianxiong Xiao,et al.  What makes an image memorable? , 2011, CVPR 2011.

[25]  Feiping Nie,et al.  Efficient Image Classification via Multiple Rank Regression , 2013, IEEE Transactions on Image Processing.

[26]  Yi Yang,et al.  Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations , 2015, ACM Multimedia.

[27]  Wilbert O. Galitz,et al.  The Essential Guide to User Interface Design: An Introduction to GUI Design Principles and Techniques , 1996 .

[28]  Liang Wang,et al.  Hierarchical feature coding for image classification , 2014, Neurocomputing.

[29]  Jiebo Luo,et al.  Indoor vs outdoor classification of consumer photographs using low-level and semantic features , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[30]  Yandong Hou,et al.  Sparse representation-based robust face recognition by graph regularized low-rank sparse representation recovery , 2015, Neurocomputing.

[31]  Aykut Erdem,et al.  Predicting memorability of images using attention-driven spatial pooling and image semantics , 2015, Image Vis. Comput..

[32]  Patrick Le Callet,et al.  Deep Learning for Image Memorability Prediction: the Emotional Bias , 2016, ACM Multimedia.

[33]  Bernard Ghanem,et al.  What Makes an Object Memorable? , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Jianxiong Xiao,et al.  What Makes a Photograph Memorable? , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Philip H. S. Torr,et al.  An embarrassingly simple approach to zero-shot learning , 2015, ICML.

[36]  Chao Yang,et al.  Attentive Group Recommendation , 2018, SIGIR.

[37]  Sitian Qin,et al.  A neurodynamic approach to convex optimization problems with general constraint , 2016, Neural Networks.

[38]  Timothy F. Brady,et al.  Conceptual Distinctiveness Supports Detailed Visual Long-term Memory for Real-world Objects the Fidelity of Long-term Memory for Visual Information , 2022 .

[39]  Meng Wang,et al.  Low-Rank Multi-View Embedding Learning for Micro-Video Popularity Prediction , 2018, IEEE Transactions on Knowledge and Data Engineering.

[40]  Nenghai Yu,et al.  Non-negative low rank and sparse graph for semi-supervised learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Jian Yang,et al.  Low-rank representation based discriminative projection for robust feature extraction , 2013, Neurocomputing.

[42]  Meng Wang,et al.  Oracle in Image Search: A Content-Based Approach to Performance Prediction , 2012, TOIS.

[43]  P. Sedgwick Spearman’s rank correlation coefficient , 2018, British Medical Journal.

[44]  Shanmuganathan Raman,et al.  Robust PCA-based solution to image composition using augmented Lagrange multiplier (ALM) , 2016, The Visual Computer.

[45]  Shuicheng Yan,et al.  Latent Low-Rank Representation for subspace segmentation and feature extraction , 2011, 2011 International Conference on Computer Vision.

[46]  Zhixun Su,et al.  Linearized Alternating Direction Method with Adaptive Penalty for Low-Rank Representation , 2011, NIPS.

[47]  A. Torralba,et al.  Intrinsic and extrinsic effects on image memorability , 2015, Vision Research.

[48]  Bing Li,et al.  Predicting Image Memorability by Multi-view Adaptive Regression , 2015, ACM Multimedia.

[49]  Chao Zhang,et al.  Integrated Low-Rank-Based Discriminative Feature Learning for Recognition , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[50]  Antonio Torralba,et al.  Understanding the Intrinsic Memorability of Images , 2011, NIPS.

[51]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[52]  Aude Oliva,et al.  Visual long-term memory has a massive storage capacity for object details , 2008, Proceedings of the National Academy of Sciences.

[53]  Wai Keung Wong,et al.  Low-Rank Embedding for Robust Image Feature Extraction , 2017, IEEE Transactions on Image Processing.

[54]  Shuyuan Yang,et al.  Low-rank representation with local constraint for graph construction , 2013, Neurocomputing.

[55]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.