论文信息 - Protecting the privacy of humans in video sequences using a computer vision-based de-identification pipeline

Protecting the privacy of humans in video sequences using a computer vision-based de-identification pipeline

Abstract We propose a computer vision-based de-identification pipeline that enables automated protection of privacy of humans in video sequences through obfuscating their appearance, while preserving the naturalness and utility of the de-identified data. Our pipeline specifically addresses de-identifying soft and non-biometric features, such as clothing, hair, skin color etc., which often remain recognizable when simpler techniques such as blurring are applied. Assuming a surveillance scenario, we combine background subtraction based on Gaussian mixtures with an improved version of the GrabCut algorithm to find and segment pedestrians. De-identification is performed by altering the appearance of the segmented pedestrians through the neural art algorithm that uses the responses of a deep neural network to render the pedestrian images in a different style. Experimental evaluation is performed both by automated classification and through a user study. Results suggest that the proposed pipeline successfully de-identifies a range of hard and soft biometric and non-biometric identifiers, including face, clothing and hair.

[1] Leon A. Gatys,et al. A Neural Algorithm of Artistic Style , 2015, ArXiv.

[2] Arun Ross,et al. Soft biometrics for surveillance: an overview , 2013 .

[3] Edward J. Delp,et al. Robust local and global shape context for tattoo image matching , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[4] Shin'ichi Satoh,et al. VabCut: A video extension of GrabCut for unsupervised video foreground object segmentation , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[5] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Z. Zivkovic. Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[7] Fatih Murat Porikli,et al. CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[8] Bernt Schiele,et al. Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[10] Andrew Senior. Protecting Privacy in Video Surveillance , 2009 .

[11] Alex Pentland,et al. Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Álvaro García-Martín,et al. People detection in surveillance: classification and evaluation , 2015, IET Comput. Vis..

[13] Paul A. Viola,et al. Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[14] Xiaogang Wang,et al. Modeling Mutual Visibility Relationship in Pedestrian Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15] Yuan Lin,et al. Face Swapping under Large Pose Variations: A 3D Model Based Approach , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[16] Chandrika Kamath,et al. Robust techniques for background subtraction in urban traffic video , 2004, IS&T/SPIE Electronic Imaging.

[17] Tomislav Hrkac,et al. Iterative Automated Foreground Segmentation in Video Sequences Using Graph Cuts , 2015, GCPR.

[18] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[19] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20] Pietro Perona,et al. Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Vladimir Kolmogorov,et al. "GrabCut": interactive foreground extraction using iterated graph cuts , 2004, ACM Trans. Graph..

[22] Bernt Schiele,et al. Pictorial structures revisited: People detection and articulated pose estimation , 2009, CVPR.

[23] Zoran Kalafatic,et al. Towards neural art-based face de-identification in video data , 2016, 2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE).

[24] Terrance E. Boult,et al. Detecting and classifying scars, marks, and tattoos found in the wild , 2012, 2012 IEEE Fifth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[25] Ralph Gross,et al. Model-Based Face De-Identification , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[26] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[27] Leon A. Gatys,et al. Texture synthesis and the controlled generation of natural stimuli using convolutional neural networks , 2015, ArXiv.

[28] B. Schiele,et al. How Far are We from Solving Pedestrian Detection? , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Álvaro García-Martín,et al. People-background segmentation with unequal error cost , 2012, 2012 19th IEEE International Conference on Image Processing.

[30] Xiaogang Wang,et al. A discriminative deep model for pedestrian detection with occlusion handling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31] Rita Cucchiara,et al. Mapping Appearance Descriptors on 3D Body Models for People Re-identification , 2015, International Journal of Computer Vision.

[32] Touradj Ebrahimi,et al. Scrambling for Privacy Protection in Video Surveillance Systems , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[33] N. Altman. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[34] P. J. Narayanan,et al. Person De-Identification in Videos , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[35] Bradley Malin,et al. Preserving privacy by de-identifying face images , 2005, IEEE Transactions on Knowledge and Data Engineering.

[36] Alexandros André Chaaraoui,et al. Visual privacy protection methods: A survey , 2015, Expert Syst. Appl..

[37] Álvaro García-Martín,et al. On collaborative people detection and tracking in complex scenarios , 2012, Image Vis. Comput..

[38] Benjamin Höferlin,et al. Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[39] Yann LeCun,et al. Pedestrian Detection with Unsupervised Multi-stage Feature Learning , 2012, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[40] Nikola Pavesic,et al. De-identification for privacy protection in multimedia content: A survey , 2016, Signal Process. Image Commun..

[41] Edoardo M. Airoldi,et al. Integrating Utility into Face De-identification , 2005, Privacy Enhancing Technologies.

[42] Alfredo Gardel Vicente,et al. Modeling feature distances by orientation driven classifiers for person re-identification , 2016, J. Vis. Commun. Image Represent..

[43] Karla Brkić,et al. Towards Reversible De-Identification in Video Sequences Using 3D Avatars and Steganography , 2015, ArXiv.

[44] Anil K. Jain,et al. Tattoo based identification: Sketch to image matching , 2013, 2013 International Conference on Biometrics (ICB).

[45] Harry Shum,et al. Background Cut , 2006, ECCV.

[46] Mohan M. Trivedi,et al. A track-based human movement analysis and privacy protection system adaptive to environmental contexts , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[47] Pietro Perona,et al. Integral Channel Features , 2009, BMVC.

[48] Sergio Escalera,et al. GrabCut-Based Human Segmentation in Video Sequences , 2012, Sensors.

[49] Shree K. Nayar,et al. Face swapping: automatically replacing faces in photographs , 2008, SIGGRAPH 2008.

[50] Jesús Bescós,et al. Background Subtraction Techniques: Systematic Evaluation and Comparative Analysis , 2009, ACIVS.

[51] W. Eric L. Grimson,et al. Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[52] Cristian Sminchisescu,et al. Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[54] Bernt Schiele,et al. Ten Years of Pedestrian Detection, What Have We Learned? , 2014, ECCV Workshops.