A New Deep Learning-Based Methodology for Video Deepfake Detection Using XGBoost

Currently, face-swapping deepfake techniques are widely spread, generating a significant number of highly realistic fake videos that threaten the privacy of people and countries. Due to their devastating impacts on the world, distinguishing between real and deepfake videos has become a fundamental issue. This paper presents a new deepfake detection method: you only look once–convolutional neural network–extreme gradient boosting (YOLO-CNN-XGBoost). The YOLO face detector is employed to extract the face area from video frames, while the InceptionResNetV2 CNN is utilized to extract features from these faces. These features are fed into the XGBoost that works as a recognizer on the top level of the CNN network. The proposed method achieves 90.62% of an area under the receiver operating characteristic curve (AUC), 90.73% accuracy, 93.53% specificity, 85.39% sensitivity, 85.39% recall, 87.36% precision, and 86.36% F1-measure on the CelebDF-FaceForencics++ (c23) merged dataset. The experimental study confirms the superiority of the presented method as compared to the state-of-the-art methods.

[1]  R. Kumar,et al.  Accurate Prediction of COVID-19 using Chest X-Ray Images through Deep Feature Learning model with SMOTE and Machine Learning Classifiers , 2020, medRxiv.

[2]  Solomon Atnafu,et al.  Deepfake Video Detection Using Convolutional Vision Transformer , 2021, ArXiv.

[3]  Paolo Bestagini,et al.  Video Face Manipulation Detection Through Ensemble of CNNs , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[4]  Parth Goel,et al.  An amalgamation of YOLOv4 and XGBoost for next-gen smart traffic management system , 2021, PeerJ Comput. Sci..

[5]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[6]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Siwei Lyu,et al.  Exposing DeepFake Videos By Detecting Face Warping Artifacts , 2018, CVPR Workshops.

[8]  Hanxiang Hao,et al.  Deepfakes Detection with Automatic Face Weighting , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9]  Yan Wang,et al.  Fighting Against Deepfake: Patch&Pair Convolutional Neural Networks (PPCNN) , 2020, WWW.

[10]  Arnav Bhavsar,et al.  Detecting Deepfakes with Metric Learning , 2020, 2020 8th International Workshop on Biometrics and Forensics (IWBF).

[11]  Amit Ganatra,et al.  A Deep Learning Approach for Face Detection using YOLO , 2018, 2018 IEEE Punecon.

[12]  B. S. Jayasri,et al.  Real-time object detection and face recognition system to assist the visually impaired , 2020 .

[13]  Stefano Tornincasa,et al.  Application of geometry to RGB images for facial landmark localisation - a preliminary approach , 2016, Int. J. Biom..

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Dipankar Dasgupta,et al.  Face Authenticity: An Overview of Face Manipulation Generation, Detection and Recognition , 2019, SSRN Electronic Journal.

[16]  Yihui He Object Detection with YOLO on Artwork Dataset , 2016 .

[17]  Junichi Yamagishi,et al.  Capsule-forensics: Using Capsule Networks to Detect Forged Images and Videos , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Yu-Gang Jiang,et al.  WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection , 2020, ACM Multimedia.

[19]  Sébastien Marcel,et al.  DeepFakes: a New Threat to Face Recognition? Assessment and Detection , 2018, ArXiv.

[20]  Saeid Nahavandi,et al.  Deep Learning for Deepfakes Creation and Detection , 2019, ArXiv.

[21]  Yiannis Kompatsiaris,et al.  A Face Preprocessing Approach for Improved DeepFake Detection , 2020, ArXiv.

[22]  Junichi Yamagishi,et al.  Multi-task Learning for Detecting and Segmenting Manipulated Facial Images and Videos , 2019, 2019 IEEE 10th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[23]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[24]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[25]  Charles P. Staelin Parameter selection for support vector machines , 2002 .

[26]  Wei Zhang,et al.  Multiview Facial Landmark Localization in RGB-D Images via Hierarchical Regression With Binary Patterns , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Licheng Zhang,et al.  Machine Learning in Rock Facies Classification: An Application of XGBoost , 2017 .

[28]  I. Muchnik,et al.  Support Vector Machines for Classification , 2015 .

[29]  Zhi-Qiang Liu,et al.  A real-time face detector , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[30]  Chen Change Loy,et al.  DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Sean Franklin,et al.  Deepfake Detection using Spatiotemporal Convolutional Networks , 2020, ArXiv.

[32]  Sherin M. Youssef,et al.  iCaps-Dfake: An Integrated Capsule-Based Model for Deepfake Image and Video Detection , 2021, Future Internet.

[33]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[34]  Stefanos Zafeiriou,et al.  RetinaFace: Single-stage Dense Face Localisation in the Wild , 2019, ArXiv.

[35]  Junichi Yamagishi,et al.  MesoNet: a Compact Facial Video Forgery Detection Network , 2018, 2018 IEEE International Workshop on Information Forensics and Security (WIFS).

[36]  Worku Muluye Wubet The Deepfake Challenges and Deepfake Video Detection , 2020 .

[37]  Tomas E. Ward,et al.  Generative Adversarial Networks in Computer Vision , 2019, ACM Comput. Surv..

[38]  Cuiping Zhang,et al.  YOLO-face: a real-time face detector , 2020, The Visual Computer.

[39]  Siwei Lyu,et al.  In Ictu Oculi: Exposing AI Created Fake Videos by Detecting Eye Blinking , 2018, 2018 IEEE International Workshop on Information Forensics and Security (WIFS).

[40]  L. Spreeuwers,et al.  Deepfake Detection using Capsule Networks and Long Short-Term Memory Networks , 2021, VISIGRAPP.

[41]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[42]  Cristian Canton Ferrer,et al.  The DeepFake Detection Challenge (DFDC) Dataset. , 2020 .

[43]  Feng Liu,et al.  On the Detection of Digital Face Manipulation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[45]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[46]  Thai Son Tran,et al.  Learning Spatio-temporal features to detect manipulated facial videos created by the Deepfake techniques , 2021, Digit. Investig..

[47]  Valentin Bazarevsky,et al.  BlazeFace: Sub-millisecond Neural Face Detection on Mobile GPUs , 2019, ArXiv.

[48]  Premkumar Natarajan,et al.  Recurrent Convolutional Strategies for Face Manipulation Detection in Videos , 2019, CVPR Workshops.

[49]  Timothy Dozat,et al.  Incorporating Nesterov Momentum into Adam , 2016 .

[50]  Andreas Rössler,et al.  FaceForensics++: Learning to Detect Manipulated Facial Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[51]  Edward J. Delp,et al.  Deepfake Video Detection Using Recurrent Neural Networks , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[52]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[53]  Andrew W. H. Ip,et al.  A Multilevel Single Stage Network for Face Detection , 2021, Wirel. Commun. Mob. Comput..

[54]  Qi Yao,et al.  YOLO5Face: Why Reinventing a Face Detector , 2021, ECCV Workshops.

[55]  Amritpal Singh,et al.  DeepFake Video Detection: A Time-Distributed Approach , 2020, SN Computer Science.

[56]  Lisa Kaati,et al.  Detecting Multipliers of Jihadism on Twitter , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[57]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[59]  Xin Yang,et al.  Exposing Deep Fakes Using Inconsistent Head Poses , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).