Towards Real-Time Service from Remote Sensing: Compression of Earth Observatory Video Data via Long-Term Background Referencing

City surveillance enables many innovative applications of smart cities. However, the real-time utilization of remotely sensed surveillance data via unmanned aerial vehicles (UAVs) or video satellites is hindered by the considerable gap between the high data collection rate and the limited transmission bandwidth. High efficiency compression of the data is in high demand. Long-term background redundancy (LBR) (in contrast to local spatial/temporal redundancies in a single video clip) is a new form of redundancy common in Earth observatory video data (EOVD). LBR is induced by the repetition of static landscapes across multiple video clips and becomes significant as the number of video clips shot of the same area increases. Eliminating LBR improves EOVD coding efficiency considerably. First, this study proposes eliminating LBR by creating a long-term background referencing library (LBRL) containing high-definition geographically registered images of an entire area. Then, it analyzes the factors affecting the variations in the image representations of the background. Next, it proposes a method of generating references for encoding current video and develops the encoding and decoding framework for EOVD compression. Experimental results show that encoding UAV video clips with the proposed method saved an average of more than 54% bits using references generated under the same conditions. Bitrate savings reached 25–35% when applied to satellite video data with arbitrarily collected reference images. Applying the proposed coding method to EOVD will facilitate remote surveillance, which can foster the development of online smart city applications.

[1]  Erik Reinhard,et al.  Color Transfer between Images , 2001, IEEE Computer Graphics and Applications.

[2]  Xiaoyan Sun,et al.  Feature-based image set compression , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[3]  Luisa Verdoliva,et al.  Region-Based Transform Coding of Multispectral Images , 2007, IEEE Transactions on Image Processing.

[4]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Amin Sedaghat,et al.  Distinctive Order Based Self-Similarity descriptor for multi-sensor remote sensing image matching , 2015 .

[6]  Patrick Pérez,et al.  Reconstructing an image from its local descriptors , 2011, CVPR 2011.

[7]  Enrico Magli,et al.  Optimized onboard lossless and near-lossless compression of hyperspectral data using CALIC , 2004, IEEE Geoscience and Remote Sensing Letters.

[8]  Xuelong Li,et al.  A Hybrid Sparsity and Distance-Based Discrimination Detector for Hyperspectral Images , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Wen Gao,et al.  Quality-progressive coding for high bit-rate background frames on surveillance videos , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[10]  Soonhoi Ha,et al.  An efficient parallelization technique for x264 encoder on heterogeneous platforms consisting of CPUs and GPUs , 2012, Journal of Real-Time Image Processing.

[11]  Xiangtao Zheng,et al.  Exploring Models and Data for Remote Sensing Image Caption Generation , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[13]  Baowen Xu,et al.  Super-resolution Person re-identification with semi-coupled low-rank discriminant dictionary learning , 2015, CVPR.

[14]  Yu Chen,et al.  Exploiting global redundancy in big surveillance video data for efficient coding , 2015, Cluster Computing.

[15]  Ahmed Bouridane,et al.  Hyperspectral image compression with modified 3D SPECK , 2010, 2010 7th International Symposium on Communication Systems, Networks & Digital Signal Processing (CSNDSP 2010).

[16]  Manoranjan Paul,et al.  Adaptive weighted non-parametric background model for efficient video coding , 2017, Neurocomputing.

[17]  Guizhong Liu,et al.  Efficient compression algorithm for hyperspectral images based on correlation coefficients adaptive 3D zerotree coding , 2008 .

[18]  Xiangtao Zheng,et al.  Remote Sensing Scene Classification by Unsupervised Representation Learning , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[20]  Xiaoyan Sun,et al.  Lossless Compression of JPEG Coded Photo Collections , 2016, IEEE Transactions on Image Processing.

[21]  Dong Liu,et al.  Surveillance video coding with vehicle library , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[22]  Jun Wu,et al.  Joint Compression of Near-Duplicate Videos , 2017, IEEE Transactions on Multimedia.

[23]  Wei Dai,et al.  Digital photo album compression based on Global Motion Compensation and Intra/Inter prediction , 2012, 2012 International Conference on Audio, Language and Image Processing.

[24]  Ruimin Hu,et al.  A Block-Based Background Model for Surveillance Video Coding , 2015, 2015 Data Compression Conference.

[25]  Ke Lu,et al.  G-IK-SVD: parallel IK-SVD on GPUs for sparse representation of spatial big data , 2017, The Journal of Supercomputing.

[26]  Jarno Mielikäinen,et al.  Correlation-based band-ordering heuristic for lossless compression of hyperspectral sounder data , 2005, IEEE Geoscience and Remote Sensing Letters.

[27]  Jarno Mielikäinen,et al.  Clustered DPCM for the lossless compression of hyperspectral images , 2003, IEEE Trans. Geosci. Remote. Sens..

[28]  Dong Yue,et al.  Multi-view low-rank dictionary learning for image classification , 2016, Pattern Recognit..

[29]  Guangming Shi,et al.  Cloud-Based Distributed Image Coding , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Enrico Magli,et al.  Transform Coding Techniques for Lossy Hyperspectral Data Compression , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Chongyu Chen,et al.  Surveillance video coding via low-rank and sparse decomposition , 2012, ACM Multimedia.

[32]  Zixiang Xiong,et al.  Knowledge-Based Coding of Objects for Multisource Surveillance Video Data , 2016, IEEE Transactions on Multimedia.

[33]  Dong Liu,et al.  Block-Composed Background Reference for High Efficiency Video Coding , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[34]  Rajiv Ranjan,et al.  IK-SVD: Dictionary Learning for Spatial Big Data via Incremental Atom Update , 2014, Computing in Science & Engineering.

[35]  Chongyu Chen,et al.  Incremental low-rank and sparse decomposition for compressing videos captured by fixed cameras , 2015, J. Vis. Commun. Image Represent..

[36]  Xiaoyan Sun,et al.  Cloud-Based Image Coding for Mobile Devices—Toward Thousands to One Compression , 2013, IEEE Transactions on Multimedia.

[37]  Nadia Magnenat-Thalmann,et al.  Sparse Low-Rank Matrix Approximation for Data Compression , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  Xuelong Li,et al.  Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features , 2018, IEEE Transactions on Image Processing.

[39]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[40]  Xianguo Zhang,et al.  Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding , 2014, IEEE Transactions on Image Processing.

[41]  Oscar C. Au,et al.  Personal photo album compression and management , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).