Protecting the privacy of humans in video sequences using a computer vision-based de-identification pipeline

Abstract We propose a computer vision-based de-identification pipeline that enables automated protection of privacy of humans in video sequences through obfuscating their appearance, while preserving the naturalness and utility of the de-identified data. Our pipeline specifically addresses de-identifying soft and non-biometric features, such as clothing, hair, skin color etc., which often remain recognizable when simpler techniques such as blurring are applied. Assuming a surveillance scenario, we combine background subtraction based on Gaussian mixtures with an improved version of the GrabCut algorithm to find and segment pedestrians. De-identification is performed by altering the appearance of the segmented pedestrians through the neural art algorithm that uses the responses of a deep neural network to render the pedestrian images in a different style. Experimental evaluation is performed both by automated classification and through a user study. Results suggest that the proposed pipeline successfully de-identifies a range of hard and soft biometric and non-biometric identifiers, including face, clothing and hair.

[1]  Leon A. Gatys,et al.  A Neural Algorithm of Artistic Style , 2015, ArXiv.

[2]  Arun Ross,et al.  Soft biometrics for surveillance: an overview , 2013 .

[3]  Edward J. Delp,et al.  Robust local and global shape context for tattoo image matching , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[4]  Shin'ichi Satoh,et al.  VabCut: A video extension of GrabCut for unsupervised video foreground object segmentation , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[5]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[7]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[8]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[10]  Andrew Senior Protecting Privacy in Video Surveillance , 2009 .

[11]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Álvaro García-Martín,et al.  People detection in surveillance: classification and evaluation , 2015, IET Comput. Vis..

[13]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[14]  Xiaogang Wang,et al.  Modeling Mutual Visibility Relationship in Pedestrian Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Yuan Lin,et al.  Face Swapping under Large Pose Variations: A 3D Model Based Approach , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[16]  Chandrika Kamath,et al.  Robust techniques for background subtraction in urban traffic video , 2004, IS&T/SPIE Electronic Imaging.

[17]  Tomislav Hrkac,et al.  Iterative Automated Foreground Segmentation in Video Sequences Using Graph Cuts , 2015, GCPR.

[18]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[19]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Vladimir Kolmogorov,et al.  "GrabCut": interactive foreground extraction using iterated graph cuts , 2004, ACM Trans. Graph..

[22]  Bernt Schiele,et al.  Pictorial structures revisited: People detection and articulated pose estimation , 2009, CVPR.

[23]  Zoran Kalafatic,et al.  Towards neural art-based face de-identification in video data , 2016, 2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE).

[24]  Terrance E. Boult,et al.  Detecting and classifying scars, marks, and tattoos found in the wild , 2012, 2012 IEEE Fifth International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[25]  Ralph Gross,et al.  Model-Based Face De-Identification , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[26]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[27]  Leon A. Gatys,et al.  Texture synthesis and the controlled generation of natural stimuli using convolutional neural networks , 2015, ArXiv.

[28]  B. Schiele,et al.  How Far are We from Solving Pedestrian Detection? , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Álvaro García-Martín,et al.  People-background segmentation with unequal error cost , 2012, 2012 19th IEEE International Conference on Image Processing.

[30]  Xiaogang Wang,et al.  A discriminative deep model for pedestrian detection with occlusion handling , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Rita Cucchiara,et al.  Mapping Appearance Descriptors on 3D Body Models for People Re-identification , 2015, International Journal of Computer Vision.

[32]  Touradj Ebrahimi,et al.  Scrambling for Privacy Protection in Video Surveillance Systems , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[34]  P. J. Narayanan,et al.  Person De-Identification in Videos , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Bradley Malin,et al.  Preserving privacy by de-identifying face images , 2005, IEEE Transactions on Knowledge and Data Engineering.

[36]  Alexandros André Chaaraoui,et al.  Visual privacy protection methods: A survey , 2015, Expert Syst. Appl..

[37]  Álvaro García-Martín,et al.  On collaborative people detection and tracking in complex scenarios , 2012, Image Vis. Comput..

[38]  Benjamin Höferlin,et al.  Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[39]  Yann LeCun,et al.  Pedestrian Detection with Unsupervised Multi-stage Feature Learning , 2012, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Nikola Pavesic,et al.  De-identification for privacy protection in multimedia content: A survey , 2016, Signal Process. Image Commun..

[41]  Edoardo M. Airoldi,et al.  Integrating Utility into Face De-identification , 2005, Privacy Enhancing Technologies.

[42]  Alfredo Gardel Vicente,et al.  Modeling feature distances by orientation driven classifiers for person re-identification , 2016, J. Vis. Commun. Image Represent..

[43]  Karla Brkić,et al.  Towards Reversible De-Identification in Video Sequences Using 3D Avatars and Steganography , 2015, ArXiv.

[44]  Anil K. Jain,et al.  Tattoo based identification: Sketch to image matching , 2013, 2013 International Conference on Biometrics (ICB).

[45]  Harry Shum,et al.  Background Cut , 2006, ECCV.

[46]  Mohan M. Trivedi,et al.  A track-based human movement analysis and privacy protection system adaptive to environmental contexts , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[47]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[48]  Sergio Escalera,et al.  GrabCut-Based Human Segmentation in Video Sequences , 2012, Sensors.

[49]  Shree K. Nayar,et al.  Face swapping: automatically replacing faces in photographs , 2008, SIGGRAPH 2008.

[50]  Jesús Bescós,et al.  Background Subtraction Techniques: Systematic Evaluation and Comparative Analysis , 2009, ACIVS.

[51]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[52]  Cristian Sminchisescu,et al.  Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[54]  Bernt Schiele,et al.  Ten Years of Pedestrian Detection, What Have We Learned? , 2014, ECCV Workshops.