论文信息 - Increasing pose comprehension through augmented reality reenactment

Increasing pose comprehension through augmented reality reenactment

Standard video does not capture the 3D aspect of human motion, which is important for comprehension of motion that may be ambiguous. In this paper, we apply augmented reality (AR) techniques to give viewers insight into 3D motion by allowing them to manipulate the viewpoint of a motion sequence of a human actor using a handheld mobile device. The motion sequence is captured using a single RGB-D sensor, which is easier for a general user, but presents the unique challenge of synthesizing novel views using images captured from a single viewpoint. To address this challenge, our proposed system reconstructs a 3D model of the actor, then uses a combination of the actor’s pose and viewpoint similarity to find appropriate images to texture it. The system then renders the 3D model on the mobile device using visual SLAM to create a map in order to use it to estimate the mobile device’s camera pose relative to the original capturing environment. We call this novel view of a moving human actor a reenactment, and evaluate its usefulness and quality with an experiment and a survey.

Naokazu Yokoya | Tomokazu Sato | Yuta Nakashima | Fabian Lorenzo Dayrit

[1] Harry Shum,et al. Review of image-based rendering techniques , 2000, Visual Communications and Image Processing.

[2] Tovi Grossman,et al. YouMove: enhancing movement training with an augmented reality mirror , 2013, UIST.

[3] Ramesh Raskar,et al. Image-based visual hulls , 2000, SIGGRAPH.

[4] Andrew W. Fitzgibbon,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[5] David W. Murray,et al. Video-rate localization in multiple maps for wearable augmented reality , 2008, 2008 12th IEEE International Symposium on Wearable Computers.

[6] Markus H. Gross,et al. Scalable 3D video of dynamic scenes , 2005, The Visual Computer.

[7] Andrew W. Fitzgibbon,et al. KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[8] Richard Szeliski,et al. High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[9] Ronald Azuma,et al. A Survey of Augmented Reality , 1997, Presence: Teleoperators & Virtual Environments.

[10] Hans-Peter Seidel,et al. Free-viewpoint video of human actors , 2003, ACM Trans. Graph..

[11] Stephen Lin,et al. Image-based clothes animation for virtual fitting , 2012, SIGGRAPH Asia Technical Briefs.

[12] Hans-Werner Gellersen,et al. MotionMA: motion modelling and analysis by demonstration , 2013, CHI.

[13] Bernd Fröhlich,et al. Immersive Group-to-Group Telepresence , 2013, IEEE Transactions on Visualization and Computer Graphics.

[14] Martin Klaudiny,et al. Single-View RGBD-Based Reconstruction of Dynamic Human Geometry , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[15] Qionghai Dai,et al. Free-Viewpoint Video of Human Actors Using Multiple Handheld Kinects , 2013, IEEE Transactions on Cybernetics.

[16] Hans-Peter Seidel,et al. Performance capture from sparse multi-view video , 2008, ACM Trans. Graph..

[17] Jitendra Malik,et al. Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[18] Gerhard Reitmayr,et al. Image-based clothes transfer , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[19] Petros Daras,et al. Real-Time, Full 3-D Reconstruction of Moving Foreground Objects From Multiple Consumer Depth Cameras , 2013, IEEE Transactions on Multimedia.

[20] Steven K. Feiner,et al. Augmented reality in the psychomotor phase of a procedural task , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[21] Jan Kautz,et al. Video-based characters: creating new human performances from a multi-view video database , 2011, SIGGRAPH 2011.

[22] Cristina V. Lopes,et al. A Spatial Augmented Reality Rehab System for Post-Stroke Hand Rehabilitation , 2013, MMVR.

[23] Zhengyou Zhang,et al. A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24] Naokazu Yokoya,et al. Free-viewpoint AR human-motion reenactment based on a single RGB-D video stream , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[25] G. Klein,et al. Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[26] Markus H. Gross,et al. 3D video recorder , 2002, 10th Pacific Conference on Computer Graphics and Applications, 2002. Proceedings..

[27] Soh-Khim Ong,et al. Augmented reality aided interactive manual assembly design , 2013, The International Journal of Advanced Manufacturing Technology.

[28] Anna Hilsmann,et al. Pose Space Image Based Rendering , 2013, Comput. Graph. Forum.

[29] William E. Lorensen,et al. Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[30] Xubo Yang,et al. A low-latency 3D teleconferencing system with image based approach , 2013, VRCAI '13.

[31] Daniel Berjón,et al. Automatic system for virtual human reconstruction with 3D mesh multi-texturing and facial enhancement , 2013, Signal Process. Image Commun..

[32] Tatsuo Nakajima,et al. Playful training with augmented reality games: case studies towards reality-oriented system design , 2011, Multimedia Tools and Applications.