Motion Magnified 3-D Residual-in-Dense Network for DeepFake Detection

Driven by the advances in deep learning, highly photo-realistic techniques capable of switching the identity and expression of faces have emerged. Cheap access to computing has brought such technology within the reach of anyone with a computer and Internet including people with sinister motives. To detect these forgeries, we present a novel compression resilient approach for deepfake detection in videos. The proposed approach employs motion magnification as a pre-processing step to amplify temporal inconsistencies common in forged videos. Utilizing these processed videos, we propose the 3D Residual-in-Dense ConvNet, which captures low level spatiotemporal features, which help classify a video as pristine or forged. The proposed method yields more than 93% average detection accuracy on the high compression variant of the FaceForensics++ dataset and achieves state-of-the-art performance on multiple benchmarks across the FaceForensics++ and CelebDF datasets. Further, we study the behavior of deepfake detection algorithms across ethnicities and demonstrate how the proposed method reduces the inherent bias against minority ethnicities prevalent in existing algorithms.

[1]  Mayank Vatsa,et al.  Facial Retouching and Alteration Detection , 2022, Handbook of Digital Face Manipulation and Detection.

[2]  Mayank Vatsa,et al.  MagNet: Detecting Digital Presentation Attacks on Face Recognition , 2021, Frontiers in Artificial Intelligence.

[3]  Mayank Vatsa,et al.  MD-CSDNetwork: Multi-Domain Cross Stitched Network for Deepfake Detection , 2021, 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021).

[4]  Mayank Vatsa,et al.  Detection of Digital Manipulation in Facial Images (Student Abstract) , 2021, AAAI.

[5]  Yongdong Zhang,et al.  Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Suya You,et al.  DefakeHop: A Light-Weight High-Performance Deepfake Detector , 2021, 2021 IEEE International Conference on Multimedia and Expo (ICME).

[7]  Nenghai Yu,et al.  Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Hao Zhu,et al.  AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection , 2020, NeurIPS.

[9]  Antitza Dantcheva,et al.  A video is worth more than 1000 lies. Comparing 3DCNN approaches for detecting deepfakes , 2020, 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020).

[10]  Yu-Gang Jiang,et al.  WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection , 2020, ACM Multimedia.

[11]  Yuan He,et al.  Sharp Multiple Instance Learning for DeepFake Video Detection , 2020, ACM Multimedia.

[12]  Jiahuan Zhou,et al.  Exposing Deep-faked Videos by Anomalous Co-motion Pattern Detection , 2020, ArXiv.

[13]  Daiheng Gao,et al.  DeepFaceLab: A simple, flexible and extensible face swapping framework , 2020, ArXiv.

[14]  Xi Wu,et al.  SSTNet: Detecting Manipulated Faces Through Spatial, Steganalysis and Temporal Features , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Mayank Vatsa,et al.  On the Robustness of Face Recognition Algorithms Against Attacks and Bias , 2020, AAAI.

[16]  Mayank Vatsa,et al.  Detecting Face2Face Facial Reenactment in Videos , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[17]  Fang Wen,et al.  Face X-Ray for More General Face Forgery Detection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Siwei Lyu,et al.  Celeb-DF: A Large-Scale Challenging Dataset for DeepFake Forensics , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Fang Wen,et al.  FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping , 2019, ArXiv.

[21]  Junichi Yamagishi,et al.  Use of a Capsule Network to Detect Fake Images and Videos , 2019, ArXiv.

[22]  Alberto Del Bimbo,et al.  Deepfake Video Detection through Optical Flow Based CNN , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[23]  Tal Hassner,et al.  FSGAN: Subject Agnostic Face Swapping and Reenactment , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Erhard Taverna Deepfakes , 2019, Bulletin des Médecins Suisses.

[25]  Junichi Yamagishi,et al.  Multi-task Learning for Detecting and Segmenting Manipulated Facial Images and Videos , 2019, 2019 IEEE 10th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[26]  Premkumar Natarajan,et al.  Recurrent Convolutional Strategies for Face Manipulation Detection in Videos , 2019, CVPR Workshops.

[27]  Justus Thies,et al.  Deferred Neural Rendering: Image Synthesis using Neural Textures , 2019 .

[28]  Richa Singh,et al.  Deep Learning for Face Recognition: Pride or Prejudiced? , 2019, ArXiv.

[29]  Andreas Rössler,et al.  FaceForensics++: Learning to Detect Manipulated Facial Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Timo Aila,et al.  A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Stefanos Zafeiriou,et al.  ArcFace: Additive Angular Margin Loss for Deep Face Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Hao Li,et al.  Protecting World Leaders Against Deep Fakes , 2019, CVPR Workshops.

[33]  Sébastien Marcel,et al.  DeepFakes: a New Threat to Face Recognition? Assessment and Detection , 2018, ArXiv.

[34]  Siwei Lyu,et al.  In Ictu Oculi: Exposing AI Created Fake Videos by Detecting Eye Blinking , 2018, 2018 IEEE International Workshop on Information Forensics and Security (WIFS).

[35]  Edward J. Delp,et al.  Deepfake Video Detection Using Recurrent Neural Networks , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[36]  Junichi Yamagishi,et al.  MesoNet: a Compact Facial Video Forgery Detection Network , 2018, 2018 IEEE International Workshop on Information Forensics and Security (WIFS).

[37]  Frédo Durand,et al.  Learning-based Video Motion Magnification , 2018, ECCV.

[38]  Jung-Woo Ha,et al.  StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39]  Jaakko Lehtinen,et al.  Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[40]  Daniel Cohen-Or,et al.  Bringing portraits to life , 2017, ACM Trans. Graph..

[41]  Richa Singh,et al.  SWAPPED! Digital face presentation attack detection via weighted local magnitude pattern , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[42]  Yutaka Satoh,et al.  Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[43]  Larry S. Davis,et al.  Two-Stream Neural Networks for Tampered Face Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[44]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[45]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Justus Thies,et al.  Face2Face: Real-Time Face Capture and Reenactment of RGB Videos , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[48]  Patrick Pérez,et al.  VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track , 2015, Comput. Graph. Forum.

[49]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[51]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[52]  Patrick Pérez,et al.  Automatic Face Reenactment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Frédo Durand,et al.  Eulerian video magnification for revealing subtle changes in the world , 2012, ACM Trans. Graph..

[54]  Jessica J. Fridrich,et al.  Rich Models for Steganalysis of Digital Images , 2012, IEEE Transactions on Information Forensics and Security.

[55]  Wojciech Matusik,et al.  Video face replacement , 2011, ACM Trans. Graph..

[56]  Brian C. Lovell,et al.  Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference , 2009, ICB.

[57]  Maneesh Agrawala,et al.  The cartoon animation filter , 2006, ACM Trans. Graph..

[58]  Frédo Durand,et al.  Motion magnification , 2005, ACM Trans. Graph..

[59]  Christoph Bregler,et al.  Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[60]  Thaddeus Beier,et al.  Feature-based image metamorphosis , 1992, SIGGRAPH.

[61]  P. Ekman,et al.  Measuring facial movement , 1976 .