Advances in Multimedia Modeling

This talk covers perspectives on adaptive video streaming and how such techniques are essential for systems with heterogeneous devices. Adaptation is possible using flexible video coding techniques, such as the H.264 Scalable VIdeo Coding (SVC). In this context, it is important to consider various aspects of the video coding system (interdependencies, quality layers, QoE, etc) as well of the delivery architectures (client server, P2P, connectivity, etc). The first part relates to quality adaptation algorithms that match the video quality with available local and system resources without any a-priori knowledge about those resources. Subsequently in the second part, mechanisms that use Quality of Experience (QoE) metrics to enhance its performance for the users will be shown. The decision of which SVC quality to choose is usually driven by QoS metrics, such as throughput. Instead, it will be presented how objective QoE of the different SVC qualities can be used in the decision process. The talk concludes by presenting the major further research activities in this research area.

[1]  Zhou Wang,et al.  No-reference perceptual quality assessment of JPEG compressed images , 2002, Proceedings. International Conference on Image Processing.

[2]  Alan C. Bovik,et al.  A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms , 2006, IEEE Transactions on Image Processing.

[3]  Lior Rokach,et al.  Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[4]  Xuelong Li,et al.  Modality Mixture Projections for Semantic Video Event Detection , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[6]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[7]  Kian-Lee Tan,et al.  A novel framework for efficient automated singer identification in large music databases , 2009, TOIS.

[8]  Meng Wang,et al.  Visual query suggestion , 2010, ACM Trans. Multim. Comput. Commun. Appl..

[9]  Diane M. Strong,et al.  AIMQ: a methodology for information quality assessment , 2002, Inf. Manag..

[10]  Qi Tian,et al.  Less is More: Efficient 3-D Object Retrieval With Query View Selection , 2011, IEEE Transactions on Multimedia.

[11]  M. Gilly,et al.  We Are What We Post? Self‐Presentation in Personal Web Space , 2003 .

[12]  Qionghai Dai,et al.  Contourlet-based image quality assessment for synthesised virtual image , 2010 .

[13]  T. Daugherty,et al.  Exploring Consumer Motivations for Creating User-Generated Content , 2008 .

[14]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[15]  Xian-Sheng Hua,et al.  Bayesian Visual Reranking , 2011, IEEE Transactions on Multimedia.

[16]  Jiebo Luo,et al.  Aesthetics and Emotions in Images , 2011, IEEE Signal Processing Magazine.

[17]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[18]  Dacheng Tao,et al.  Visual Reranking: From Objectives to Strategies , 2011, IEEE MultiMedia.

[19]  Meng Wang,et al.  Active learning in multimedia annotation and retrieval: A survey , 2011, TIST.

[20]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[21]  Yue Gao,et al.  Camera Constraint-Free View-Based 3-D Object Retrieval , 2012, IEEE Transactions on Image Processing.

[22]  Yue Gao,et al.  Cross-View Down/Up-Sampling Method for Multiview Depth Video Coding , 2012, IEEE Signal Processing Letters.

[23]  Bing Liu,et al.  Opinion Mining and Sentiment Analysis , 2011 .

[24]  Xian-Sheng Hua,et al.  Towards a Relevant and Diverse Search of Social Images , 2010, IEEE Transactions on Multimedia.

[25]  Bingbing Ni,et al.  Assistive tagging: A survey of multimedia tagging with human-computer joint exploration , 2012, CSUR.

[26]  Qi Tian,et al.  Mining flickr landmarks by modeling reconstruction sparsity , 2011, TOMCCAP.

[27]  Meng Wang,et al.  Visual query suggestion , 2009, ACM Multimedia.

[28]  Yi Yang,et al.  Interactive Video Indexing With Statistical Active Learning , 2012, IEEE Transactions on Multimedia.

[29]  Gangyi Jiang,et al.  Research on subjective stereoscopic image quality assessment , 2009, Electronic Imaging.

[30]  Meng Wang,et al.  Modeling concept dynamics for large scale music search , 2012, SIGIR '12.