Optimal Representations for Adaptive Streaming in Interactive Multiview Video Systems

Interactive multiview video streaming (IMVS) services permit to remotely navigate within a 3D scene with an immersive experience. This is possible by transmitting a set of reference camera views (anchor views), which are used by the clients to freely navigate in the scene and possibly synthesize additional viewpoints of interest. From a networking perspective, the big challenge in IMVS systems is to deliver to each client the best set of anchor views that maximizes the navigation quality, minimizes the view-switching delay and yet satisfies the network constraints. Integrating adaptive streaming solutions in free-viewpoint systems offers a promising solution to deploy IMVS in large and heterogeneous scenarios, as long as the multiview video representations on the server are properly selected. Therefore, we propose to optimize the multiview data at the server by minimizing the overall resource requirements while offering a good navigation quality to the different users. We propose a representation set optimization problem for multiview adaptive streaming systems, and we show that it is NP-hard. Therefore, we introduce the concept of multiview navigation segment that permits to cast the video representation set selection as an integer linear programming problem with a bounded computational complexity. We then show that the proposed solution reduces the computational complexity, while preserving optimality in most of the 3D scenes. We finally provide simulation results for different classes of users and show the gain offered by an optimal multiview video representation selection compared to recommended representation sets (e.g., Netflix and Apple ones) or to a baseline representation selection algorithm, where the encoding parameters are decided a priori for all the camera views.

[1]  Sachin Lodha,et al.  Multi-view image inpainting with sparse representations , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[2]  Alberto Blanc,et al.  Transcoding live adaptive video streams at a massive scale in the cloud , 2015, MMSys.

[3]  Bernd Girod,et al.  Distributed Video Coding , 2005, Proceedings of the IEEE.

[4]  Thomas Maugey,et al.  Encoder-Driven Inpainting Strategy in Multiview Video Compression , 2016, IEEE Transactions on Image Processing.

[5]  Wojciech Matusik,et al.  Anahita: A System for 3D Video Streaming with Depth Customization , 2014, ACM Multimedia.

[6]  Oscar C. Au,et al.  Merge Frame Design for Video Stream Switching Using Piecewise Constant Functions , 2015, IEEE Transactions on Image Processing.

[7]  Federico Chiariotti,et al.  Online learning adaptation strategy for DASH clients , 2016, MMSys.

[8]  Pascal Frossard,et al.  Interactive free viewpoint video streaming using prioritized network coding , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[9]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Thomas Maugey,et al.  Optimized Packet Scheduling in Multiview Video Navigation Systems , 2015, IEEE Transactions on Multimedia.

[11]  Thomas Maugey,et al.  Graph-Based Representation for Multiview Image Geometry , 2015, IEEE Transactions on Image Processing.

[12]  Chia-Wen Lin,et al.  mDASH: A Markov Decision-Based Rate Adaptation Approach for Dynamic HTTP Streaming , 2016, IEEE Transactions on Multimedia.

[13]  Dirk Staehle,et al.  QoE-Based Traffic and Resource Management for Adaptive HTTP Video Delivery in LTE , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Klara Nahrstedt,et al.  OmniViewer: Enabling Multi-modal 3D DASH , 2015, ACM Multimedia.

[15]  Luce Morin,et al.  A study of depth/texture bit-rate allocation in multi-view video plus depth compression , 2013, Ann. des Télécommunications.

[16]  Pascal Frossard,et al.  In-Network View Synthesis for Interactive Multiview Video Systems , 2015, IEEE Transactions on Multimedia.

[17]  Pascal Frossard,et al.  Re-sampling and interpolation of DIBR-synthesized images using graph-signal smoothness prior , 2015, 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[18]  Thomas Maugey,et al.  Optimal layered representation for adaptive interactive multiview video streaming , 2015, J. Vis. Commun. Image Represent..

[19]  Alberto Blanc,et al.  Optimal Selection of Adaptive Streaming Representations , 2014, ACM Trans. Multim. Comput. Commun. Appl..

[20]  Pascal Frossard,et al.  Optimal Lagrange Multipliers for Dependent Rate Allocation in Video Coding , 2016, Signal Process. Image Commun..

[21]  Mohamed Hefeeda,et al.  A DASH-based Free Viewpoint Video Streaming System , 2014, NOSSDAV 2014.

[22]  Toshiaki Fujii,et al.  View Generation with 3D Warping Using Depth Information for FTV , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[23]  Antonio Ortega,et al.  On Dependent Bit Allocation for Multiview Image Coding With Depth-Image-Based Rendering , 2011, IEEE Transactions on Image Processing.

[24]  Pascal Frossard,et al.  Graph-based representation and coding of 3D images for interactive multiview navigation , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Jia Guo,et al.  A cloud-assisted DASH-based Scalable Interactive Multiview Video Streaming framework , 2015, 2015 Picture Coding Symposium (PCS).

[26]  覃政 Google Cardboard:伟大的搅局者 , 2014 .

[27]  David Tschumperlé,et al.  Depth-aware patch-based image disocclusion for virtual view synthesis , 2015, SIGGRAPH Asia Technical Briefs.

[28]  Brendan Iribe Oculus Rift를 이용한 체감형 게임 구현 , 2014 .

[29]  Yonggang Wen,et al.  QoE-Driven Cache Management for HTTP Adaptive Bit Rate Streaming Over Wireless Networks , 2012, IEEE Transactions on Multimedia.

[30]  Touradj Ebrahimi,et al.  Objective quality metrics for video scalability , 2013, 2013 IEEE International Conference on Image Processing.

[31]  Abdulsalam Yassine,et al.  A DASH-based HEVC multi-view video streaming system , 2015, Journal of Real-Time Image Processing.

[32]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Mohamed Hefeeda,et al.  Adaptive streaming of interactive free viewpoint videos to heterogeneous clients , 2016, MMSys.

[34]  Dong Tian,et al.  View synthesis techniques for 3D video , 2009, Optical Engineering + Applications.

[35]  Abdulsalam Yassine,et al.  A DASH-based 3D multi-view video rate control system , 2014, 2014 8th International Conference on Signal Processing and Communication Systems (ICSPCS).

[36]  Pascal Frossard,et al.  A comparative study of DASH representation sets using real user characteristics , 2016, NOSSDAV.

[37]  Yong Man Ro,et al.  An Evaluation of Bitrate Adaptation Methods for HTTP Live Streaming , 2014, IEEE Journal on Selected Areas in Communications.