High-performance computing model for 3D camera system

Computer vision and sensing systems have been widely applied to industry, and this includes robotics, an area requiring real-time applications. However, robust and high-accuracy systems are usually time-consuming, but high-performance devises can speed up the systems. Many applications use GPU (Graphic Processing Unit) for faster computation. The greatest advantages of the GPU system include high-memory bandwidth and flexibility and ease of programming. However, this method requires transferring data between host (computer) and device (graphic card). Moreover, it only supports data parallelism or overlap copy data and kernel execution. Multi-core CPU offers another approach for improving system performance. OpenMP, an API that supports multi-core CPU programming strongly, offers flexibility and simple programming using compiler directives items; in particular, the computation can be done without transferring data to the device and full task parallel execution. However, multi-core CPU is limited to several cores. Combining the advantages of GPU and multi-core CPU optimizes system performance. This paper proposes a novel high-performance computing model using combination architecture of GPU and multi-core CPU to obtain a two parallel layer system including data parallel and task parallel models. The result implements a highly accurate 3D reconstruction application using the improved HOC structured light coding algorithm. The experimental results of the proposed model reveal a speed 20 times faster than the original implementation in the single CPU.

[1]  Sukhan Lee,et al.  Ray-tracing codec for structured light 3D camera , 2012, 2012 IEEE International Conference on Robotics and Automation.

[2]  Oscar C. Au,et al.  Highly Parallel Rate-Distortion Optimized Intra-Mode Decision on Multicore Graphics Processors , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  K. Srinathan,et al.  A performance prediction model for the CUDA GPGPU platform , 2009, 2009 International Conference on High Performance Computing (HiPC).

[4]  Jens Guehring,et al.  Dense 3D surface acquisition by structured light using off-the-shelf components , 2000, IS&T/SPIE Electronic Imaging.

[5]  Alejandro Duran,et al.  Techniques supporting threadprivate in OpenMP , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[6]  Keechul Jung,et al.  Neural Network Implementation Using CUDA and OpenMP , 2008, 2008 Digital Image Computing: Techniques and Applications.

[7]  Tsutomu Maruyama,et al.  Performance comparison of FPGA, GPU and CPU in image processing , 2009, 2009 International Conference on Field Programmable Logic and Applications.

[8]  Hakil Kim,et al.  A fast feature extraction in object recognition using parallel processing on CPU and GPU , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[9]  Jeffrey L. Posdamer,et al.  Surface measurement by space-encoded projected beam systems , 1982, Comput. Graph. Image Process..

[10]  Sukhan Lee,et al.  Signal Separation Coding for Robust Depth Imaging Based on Structured Light , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[11]  Thomas Ertl,et al.  Optimized data transfer for time-dependent, GPU-based glyphs , 2009, 2009 IEEE Pacific Visualization Symposium.