A FPGA Accelerator for Real-Time 3D Non-rigid Registration Using Tree Reweighted Message Passing and Dynamic Markov Random Field Generation

Non-rigid 3D registration is a technique for matching 3D scans of a scene involving deformable objects. Augmented reality, gesture recognition, medical imaging, and many other computer vision and graphics applications require real-time registration to model deformable or articulated objects. Unfortunately, non-rigid registration is a computationally intensive problem that requires careful optimization to maximize throughput and latency. We present a FPGA+CPU accelerator for real-time non-rigid 3D registration based on Tree Reweighted Message Passing (TRW-S). We overcome memory bound issues and scheduling limitations of conventional TRW-S by dynamically generating the Markov Random Fields. This, along with a bevy of other architectural optimizations, allows us to almost saturate 1024 multipliers in a Arria 10 at 100MHz. We achieve a 600x speed up over baseline TRW-S and our registration architecture has up to 81x energy reduction over a software implementation of our algorithm. We demonstrate the performance of our system by performing real-time (20 scan per second) registration on a complicated surgical scene.

[1]  M. Pauly,et al.  Embedded deformation for shape manipulation , 2007, SIGGRAPH 2007.

[2]  Michael J. Black,et al.  FAUST: Dataset and Evaluation for 3D Mesh Registration , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Liang-Gee Chen,et al.  Hardware-Efficient Belief Propagation , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Vladimir Kolmogorov,et al.  Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  J. Marescaux,et al.  Towards cybernetic surgery: robotic and augmented reality-assisted liver segmentectomy , 2015, Langenbeck's Archives of Surgery.

[6]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[7]  Huamin Wang,et al.  Real-time Human Pose and Shape Estimation for Virtual Try-On Using a Single Commodity Depth Camera , 2014, IEEE Trans. Vis. Comput. Graph..

[8]  Makoto Hashizume,et al.  Augmented reality navigation system for laparoscopic splenectomy in children based on preoperative CT image using optical tracking device , 2012, Pediatric Surgery International.

[9]  Vladlen Koltun,et al.  Robust Nonrigid Registration by Convex Optimization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10]  Rob A. Rutenbar,et al.  Video-Rate Stereo Matching Using Markov Random Field TRW-S Inference on a Hybrid CPU+FPGA Computing Platform , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Lale Akarun,et al.  Real time hand pose estimation using depth sensors , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[12]  Gérard G. Medioni,et al.  Human pose estimation from a single view point, real-time range sensor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[13]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Andrew W. Fitzgibbon,et al.  Real-time non-rigid reconstruction using an RGB-D camera , 2014, ACM Trans. Graph..

[15]  Pushmeet Kohli,et al.  Fusion4D , 2016, ACM Trans. Graph..

[16]  Raj Madhavan,et al.  Temporal Range Registration for Unmanned Ground and Aerial Vehicles , 2004, ICRA.

[17]  Tobias Ortmaier,et al.  Tissue surface information for intraoperative incision planning and focus adjustment in laser surgery , 2015, International Journal of Computer Assisted Radiology and Surgery.

[18]  Guangwen Yang,et al.  Patra: Parallel tree-reweighted message passing architecture , 2014, 2014 24th International Conference on Field Programmable Logic and Applications (FPL).

[19]  Dieter Fox,et al.  DynamicFusion: Reconstruction and tracking of non-rigid scenes in real-time , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Ara Darzi,et al.  The current and future use of imaging in urological robotic surgery: a survey of the European Association of Robotic Urological Surgeons , 2015, The international journal of medical robotics + computer assisted surgery : MRCAS.

[21]  Rob A. Rutenbar,et al.  Fast hierarchical implementation of sequential tree-reweighted belief propagation for probabilistic inference , 2015, 2015 25th International Conference on Field Programmable Logic and Applications (FPL).

[22]  Jörg Stückler,et al.  Multi-resolution surfel maps for efficient dense 3D modeling and tracking , 2014, J. Vis. Commun. Image Represent..