Semantics-aware content delivery framework for 3D Tele-immersion

3D Tele-immersion (3DTI) technology allows full-body, multimodal interaction among geographically dispersed users, which opens a variety of possibilities in cyber collaborative applications such as art performance, exergaming, and physical rehabilitation. However, with its great potential, the resource and quality demands of 3DTI rise inevitably, especially when some advanced applications target resource-limited computing environments with stringent scalability demands. Under these circumstances, the tradeoffs between 1) resource requirements, 2) content complexity, and 3) user satisfaction in delivery of 3DTI services are magnified. In this dissertation, we argue that these tradeoffs of 3DTI systems are actually avoidable when the underlying delivery framework of 3DTI takes the semantic information into consideration. We introduce the concept of semantic information into 3DTI, which encompasses information about the three factors: environment, activity, and user role in 3DTI applications. With semantic information, 3DTI systems are able to 1) identify the characteristics of its computing environment to allocate computing power and bandwidth to delivery of prioritized contents, 2) pinpoint and discard the dispensable content in activity capturing according to properties of target application, and 3) differentiate contents by their contributions on fulfilling the objectives and expectation of user's role in the application so that the adaptation module can allocate resource budget accordingly. With these capabilities we can change the tradeoffs into synergy between resource requirements, content complexity, and user satisfaction. We implement semantics-aware 3DTI systems to verify the performance gain on the three phases in 3DTI systems' delivery chain: capturing phase, dissemination phase, and receiving phase. By introducing semantics information to distinct 3DTI systems, the efficiency improvements brought by our semantics-aware content delivery framework are validated under different application requirements, different scalability bottlenecks, and different user and application models. To sum up, in this dissertation we aim to change the tradeoff between requirements, complexity, and satisfaction in 3DTI services by exploiting the semantic information about the computing environment, the activity, and the user role upon the underlying delivery systems of 3DTI. The devised mechanisms will enhance the efficiency of 3DTI systems targeting on serving different purposes and 3DTI applications with different computation and scalability requirements. iii To my dear family.

[1]  Greg Welch,et al.  Smart instrumented training ranges: bringing automated system solutions to support critical domain needs , 2013 .

[2]  Indranil Gupta,et al.  3DTI Amphitheater , 2015, ACM Trans. Multim. Comput. Commun. Appl..

[3]  Yuval Shavitt,et al.  An Optimal Median Calculation Algorithm for Estimating Internet Link Delays from Active Measurements , 2007, 2007 Workshop on End-to-End Monitoring Techniques and Services.

[4]  Bo Li,et al.  CoolStreaming/DONet: a data-driven overlay network for peer-to-peer live media streaming , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[5]  Wei Niu,et al.  Human activity detection and recognition for video surveillance , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[6]  Klara Nahrstedt,et al.  Metadata-Based Activity Analysis in 3D Tele-Immersion , 2016, 2016 IEEE Second International Conference on Multimedia Big Data (BigMM).

[7]  George Coulouris,et al.  Distributed systems - concepts and design , 1988 .

[8]  Klara Nahrstedt,et al.  Next generation session management for 3D teleimmersive interactive environments , 2010, Multimedia Tools and Applications.

[9]  Ruzena Bajcsy,et al.  Tele-MFAsT: Kinect-Based Tele-Medicine Tool for Remote Motion and Function Assessment , 2014, MMVR.

[10]  Titus B. Zaharia,et al.  TFAN: A low complexity 3D mesh compression algorithm , 2009, Comput. Animat. Virtual Worlds.

[11]  Bart Selman,et al.  Human Activity Detection from RGBD Images , 2011, Plan, Activity, and Intent Recognition.

[12]  Klara Nahrstedt,et al.  3D teleimmersive activity classification based on application-system metadata , 2013, MM '13.

[13]  Du Ling End-to-end available bandwidth measurement methodology , 2011 .

[14]  Klara Nahrstedt,et al.  Activity-aware adaptive compression: a morphing-based frame synthesis application in 3DTI , 2013, ACM Multimedia.

[15]  Klara Nahrstedt,et al.  "I'm the Jedi!" - A Case Study of User Experience in 3D Tele-immersive Gaming , 2010, 2010 IEEE International Symposium on Multimedia.

[16]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[17]  Robert L. Mason,et al.  Statistical Principles in Experimental Design , 2003 .

[18]  Ruzena Bajcsy,et al.  A multi-stream adaptation framework for bandwidth management in 3D tele-immersion , 2006, NOSSDAV '06.

[19]  Peter Neri,et al.  How inherently noisy is human sensory processing? , 2010, Psychonomic bulletin & review.

[20]  Adam Wolisz,et al.  EvalVid - A Framework for Video Transmission and Quality Evaluation , 2003, Computer Performance Evaluation / TOOLS.

[21]  Mohamed Hefeeda,et al.  A DASH-based Free Viewpoint Video Streaming System , 2014, NOSSDAV 2014.

[22]  BajcsyRuzena,et al.  Enabling multi-party 3D tele-immersive environments with ViewCast , 2010 .

[23]  Klara Nahrstedt,et al.  Quality of experience in distributed interactive multimedia environments: toward a theoretical framework , 2009, ACM Multimedia.

[24]  Alon Wolf,et al.  An Adaptive Home-Use Robotic Rehabilitation System for the Upper Body , 2014, IEEE Journal of Translational Engineering in Health and Medicine.

[25]  Klara Nahrstedt,et al.  A real-time remote rendering system for interactive mobile graphics , 2012, TOMCCAP.

[26]  Klara Nahrstedt,et al.  F.Live: Towards interactive live broadcast FTV experience , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[27]  Hiroshi Sankoh,et al.  Interactive music video application for smartphones based on free-viewpoint video and audio rendering , 2012, ACM Multimedia.

[28]  Klara Nahrstedt,et al.  Activity-based synthesized frame generation in 3DTI video , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[29]  Jon Crowcroft,et al.  Quality-of-Service Routing for Supporting Multimedia Applications , 1996, IEEE J. Sel. Areas Commun..

[30]  Klara Nahrstedt,et al.  Perception-based playout scheduling for high-quality real-time interactive multimedia , 2012, 2012 Proceedings IEEE INFOCOM.

[31]  Klara Nahrstedt,et al.  Impact of Morphing-Based Frame Synthesis on Bandwidth Optimization for 3DTI Video , 2013, 2013 IEEE International Symposium on Multimedia.

[32]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[33]  John A. Barnden,et al.  Semantic Networks , 1998, Encyclopedia of Social Network Analysis and Mining.

[34]  Klara Nahrstedt,et al.  Prioritized evolutionary optimization in open session management for 3D tele-immersion , 2013, MMSys.

[35]  Klara Nahrstedt 3D teleimmersion for remote injury assessment , 2012, SAM '12.

[36]  Philippe Fraisse,et al.  Adaptive Interface for Personalized Center of Mass Self-Identification in Home Rehabilitation , 2015, IEEE Sensors Journal.

[37]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[38]  Manish Jain,et al.  End-to-end available bandwidth: measurement methodology, dynamics, and relation with TCP throughput , 2003, TNET.

[39]  Klara Nahrstedt,et al.  Bundle of Streams: Concept and Evaluation in Distributed Interactive Multimedia Environments , 2010, 2010 IEEE International Symposium on Multimedia.

[40]  Andrew Priorov,et al.  Methods of complete surface reconstruction through merging of point clouds according to stereo vision data , 2014, Proceedings of 16th Conference of Open Innovations Association FRUCT.

[41]  Zhenyu Yang A multi-stream adaptation framework for tele-immersion , 2006, MM '06.

[42]  Klara Nahrstedt,et al.  SyncCast: synchronized dissemination in multi-site interactive 3D tele-immersion , 2011, MMSys.

[43]  Klara Nahrstedt,et al.  4D TeleCast: Towards Large Scale Multi-site and Multi-view Dissemination of 3DTI Contents , 2012, 2012 IEEE 32nd International Conference on Distributed Computing Systems.

[44]  Greg Welch,et al.  Illuminating collaboration in emergency health care situations: paramedic-physician collaboration and 3D telepresence technology , 2014, Inf. Res..

[45]  Klara Nahrstedt,et al.  Stevens' Power Law in 3D Tele-immersion: Towards Subjective Modeling of Multimodal Cyber Interaction , 2014, ACM Multimedia.

[46]  Ruzena Bajcsy,et al.  3D teleimmersion for collaboration and interaction of geographically distributed users , 2012, Virtual Reality.

[47]  ITU-T Rec. P.910 (04/2008) Subjective video quality assessment methods for multimedia applications , 2009 .

[48]  Stewart Robinson,et al.  Simulation: The Practice of Model Development and Use , 2004 .

[49]  Ruzena Bajcsy,et al.  Towards Multi-Site Collaboration in 3D Tele-Immersive Environments , 2008, 2008 The 28th International Conference on Distributed Computing Systems.

[50]  Indranil Gupta,et al.  3DTI amphitheater: a manageable 3DTI environment with hierarchical stream prioritization , 2014, MMSys '14.

[51]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[52]  Klara Nahrstedt,et al.  Advancing interactive collaborative mediums through tele-immersive dance (TED): a symbiotic creativity and design environment for art and computer science , 2008, ACM Multimedia.

[53]  Henry Fuchs,et al.  Immersive 3D Telepresence , 2014, Computer.

[54]  Thaddeus Beier,et al.  Feature-based image metamorphosis , 1992, SIGGRAPH.

[55]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[56]  Ruzena Bajcsy,et al.  Color-plus-depth level-of-detail in 3D tele-immersive video: a psychophysical approach , 2011, MM '11.

[57]  RTP Payload Format for JPEG-compressed Video , 1996, RFC.

[58]  Ruzena Bajcsy,et al.  High-Quality Visualization for Geographically Distributed 3-D Teleimmersive Applications , 2011, IEEE Transactions on Multimedia.

[59]  Klara Nahrstedt,et al.  TEEVE endpoint: towards the ease of 3D tele-immersive application development , 2013, MM '13.

[60]  Frans Mäyrä,et al.  Fundamental Components of the Gameplay Experience: Analysing Immersion , 2005, DiGRA Conference.

[61]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[62]  Abdulsalam Yassine,et al.  A DASH-based 3D multi-view video rate control system , 2014, 2014 8th International Conference on Signal Processing and Communication Systems (ICSPCS).

[63]  Rahul Malik,et al.  Immersive environments for rehabilitation activities , 2009, ACM Multimedia.

[64]  Michele Sanna,et al.  A 3D tele-immersion system based on live captured mesh geometry , 2013, MMSys.

[65]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[66]  K. K. Ramakrishnan,et al.  Understanding couch potatoes: measurement and modeling of interactive usage of IPTV at large scale , 2011, IMC '11.