CrossbowCam: a handheld adjustable multi-camera system

This paper presents a novel multi-functional, low-cost handheld multi-camera system (one dimensional camera array) - “CrossbowCam”. The CrossbowCam is suitable for multi-viewpoint image acquisition, smooth switching, alignment and seamless stitching applications. The proposed system differs from the traditional fixed image acquisition systems which are large-sized, high-priced, single functional, and can only captured images at specific locations. With the proposed system, the users can push one single button to change the configuration of the camera array rapidly to divergence (convex arc), parallel (linear), or convergence (concave arc). The three camera configurations can each be suitable for applications such as panorama image stitching, autostereoscopic 3D display, bullet-time (time-freeze) visual effect, 3D scene reconstruction, etc. To rapidly acquire the relationship among cameras after configuration change, we propose a two-stage calibration method to compensate the mechanical misalignment. The first stage adopts the traditional checkerboard calibration method to get the intrinsic parameters (focal length, principal point) and the lens distortion for each camera. The second stage requires no auxiliary tool but utilizes a large number of common feature points from multiple viewpoint images to acquire the extrinsic parameters (translation and rotation matrix) and to compensate the vertical misalignment and the horizontal uneven angle distribution due to the mechanical structure. The proposed system can then insert virtual viewpoint images between actual viewpoint images to allow the viewpoint switching more smoothly. The proposed system has eight cameras with maximum viewing angle of 90° in divergence mode, 38 mm spacing in parallel mode, and imaging radius of 10 m ∼ 0.5 m in convergence mode. We believe that the proposed system can potentially change the consumer habits and becomes the new type of home-use handheld camcorder system in the future.

[1]  Matthew A. Brown,et al.  Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[2]  Yo-Sung Ho,et al.  Geometrical Compensation Algorithm of Multiview Image for Arc Multi-camera Arrays , 2008, PCM.

[3]  Hideo Saito,et al.  On-line free-viewpoint video: From single to multiple view rendering , 2008, Int. J. Autom. Comput..

[4]  Shih-Fu Chang,et al.  CamSwarm: Instantaneous Smartphone Camera Arrays for Collaborative Photography , 2015, ArXiv.

[5]  Vincent Nozick,et al.  Camera array image rectification and calibration for stereoscopic and autostereoscopic displays , 2013, Ann. des Télécommunications.

[6]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Yo-Sung Ho,et al.  An efficient image rectification method for parallel multi-camera arrangement , 2011, IEEE Transactions on Consumer Electronics.

[8]  Fei Guo,et al.  Multiview image rectification algorithm for parallel camera arrays , 2014, J. Electronic Imaging.

[9]  Anita Sellent,et al.  Virtual Video Camera: Image‐Based Viewpoint Navigation Through Space and Time , 2010, Comput. Graph. Forum.

[10]  Julius Ziegler,et al.  StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[11]  Shree K. Nayar,et al.  PiCam , 2013, ACM Trans. Graph..

[12]  Ding Chen,et al.  Optimizing infinite homography for bullet-time effect , 2014, SIGGRAPH '14.

[13]  Adrien Bartoli,et al.  Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces , 2013, BMVC.

[14]  Charles T. Loop,et al.  Computing rectifying homographies for stereo vision , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[15]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  Yuichi Ohta,et al.  Multi-resolution bullet-time effect , 2014, SIGGRAPH ASIA Posters.

[17]  S. Ullman The interpretation of structure from motion , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[18]  Paul Debevec,et al.  The Light Stages and Their Applications to Photoreal Digital Actors , 2012, SIGGRAPH 2012.

[19]  Tomoyuki Mishina,et al.  Bullet time using multi-viewpoint robotic camera system , 2014, CVMP.

[20]  Shih-Fu Chang,et al.  PanoSwarm: Collaborative and Synchronized Multi-Device Panoramic Photography , 2016, IUI.

[21]  Shree K. Nayar,et al.  Scene Collages and Flexible Camera Arrays , 2007, Rendering Techniques.

[22]  Marc Levoy,et al.  High performance imaging using large camera arrays , 2005, SIGGRAPH 2005.

[23]  Roel Vertegaal,et al.  FlexCam: using thin-film flexible OLED color prints as a camera array , 2012, CHI EA '12.

[24]  Vincent Lepetit,et al.  BRIEF: Computing a Local Binary Descriptor Very Fast , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Yo-Sung Ho,et al.  An Efficient Rectification Algorithm for Multi-View Images in Parallel Camera Array , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.