Reading between lines: high-rate, non-intrusive visual codes within regular videos via ImplicitCode

Given the penetration of mobile devices equipped with cameras, there has been increasing interest in enabling user interaction via visual codes. Simple examples like QR Codes abound. Since many codes like QR Codes are visually intrusive, various mechanisms have been explored to design visual codes that can be hidden inside regular images or videos, though the capacity of these codes remains low to ensure invisibility. We argue, however, that high capacity while maintaining invisibility would enable a vast range of applications that embed rich contextual information in video screens. To this end, we propose ImplicitCode, a high-rate visual codes that can be hidden inside regular videos. Our scheme combines existing techniques to achieve invisibility. However, we show that these techniques, when employed individually, are too constraining to deliver a high capacity. Experiment results show that ImplicitCode can deliver a significant capacity boost over two recent schemes, up to 12x that of HiLight [19] and 6x or 7x that of InFrame [32], while maintaining a similar or better level of invisibility.

[1]  Tianxing Li,et al.  Real-Time Screen-Camera Communication Behind Any Scene , 2015, MobiSys.

[2]  Frank Hartung,et al.  Multimedia watermarking techniques , 1999, Proc. IEEE.

[3]  Jun Xiao,et al.  AiD: augmented information display , 2014, UbiComp.

[4]  Guobin Shen,et al.  InFrame: Multiflexing Full-Frame Visible Communication Channel for Humans and Devices , 2014, HotNets.

[5]  Ramesh Raskar,et al.  Bokode: imperceptible visual tags for camera based interaction from a distance , 2009, ACM Trans. Graph..

[6]  Tapio Seppänen,et al.  Reading Watermarks from Printed Binary Images with a Camera Phone , 2009, IWDW.

[7]  Mark Weiser,et al.  Creating the invisible interface: (invited talk) , 1994, UIST '94.

[8]  Ahmed H. Tewfik,et al.  Geometric Invariance in image watermarking , 2004, IEEE Transactions on Image Processing.

[9]  Michael Rohs,et al.  Real-World Interaction with Camera Phones , 2004, UCS.

[10]  Bahram Javidi,et al.  Information hiding technique with double phase encoding. , 2002, Applied optics.

[11]  Ramesh Raskar,et al.  VRCodes: Unobtrusive and active visual codes for interaction by exploiting rolling shutter , 2012, 2012 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[12]  Suresh Chande,et al.  Mobile visual interaction: enhancing local communication and collaboration with visual interactions , 2006, MUM '06.

[13]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[14]  Tianxing Li,et al.  HiLight: Hiding Bits in Pixel Translucency Changes , 2015, MOCO.

[15]  Guobin Shen,et al.  InFrame++: Achieve Simultaneous Screen-Human Viewing and Hidden Screen-Camera Communication , 2015, MobiSys.

[16]  O. Meur,et al.  Predicting visual fixations on video based on low-level visual features , 2007, Vision Research.

[17]  Kevin Curran,et al.  Digital image steganography: Survey and analysis of current methods , 2010, Signal Process..

[18]  Qingzhong Liu,et al.  Video Steganalysis Based on the Expanded Markov and Joint Distribution on the Transform Domains Detecting MSU StegoVideo , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[19]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Kaigui Bian,et al.  Strata: layered coding for scalable visual communication , 2014, MobiCom.

[21]  Dina Katabi,et al.  PixNet: interference-free wireless links using LCD-camera pairs , 2010, MobiCom.

[22]  Tapio Seppänen,et al.  Watermark robustness in the print-cam process , 2008 .

[23]  Chunming Hu,et al.  Enhancing reliability to boost the throughput over screen-camera links , 2014, MobiCom.

[24]  Ingemar J. Cox,et al.  Secure spread spectrum watermarking for multimedia , 1997, IEEE Trans. Image Process..

[25]  Guoliang Xing,et al.  COBRA: color barcode streaming for smartphone systems , 2012, MobiSys '12.

[26]  Xiaolin Wu,et al.  Temporal Psychovisual Modulation: A New Paradigm of Information Display [Exploratory DSP] , 2012, IEEE Signal Processing Magazine.

[27]  Wenjun Hu,et al.  LightSync: unsynchronized visual communication over screen-camera links , 2013, MobiCom.

[28]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[29]  Tapio Seppänen,et al.  CAMERA BASED WATERMARK EXTRACTION – PROBLEMS AND EXAMPLES , 2007 .