On the Horizon: Interactive and Compositional Deepfakes

Over a five-year period, computing methods for generating high-fidelity, fictional depictions of people and events moved from exotic demonstrations by computer science research teams into ongoing use as a tool of disinformation. The methods, referred to with the portmanteau of “deepfakes," have been used to create compelling audiovisual content. Here, I share challenges ahead with malevolent uses of two classes of deepfakes that we can expect to come into practice with costly implications for society: interactive and compositional deepfakes. Interactive deepfakes have the capability to impersonate people with realistic interactive behaviors, taking advantage of advances in multimodal interaction. Compositional deepfakes leverage synthetic content in larger disinformation plans that integrate sets of deepfakes over time with observed, expected, and engineered world events to create persuasive synthetic histories. Synthetic histories can be constructed manually but may one day be guided by adversarial generative explanation (AGE) techniques. In the absence of mitigations, interactive and compositional deepfakes threaten to move us closer to a post-epistemic world, where fact cannot be distinguished from fiction. I shall describe interactive and compositional deepfakes and reflect about cautions and potential mitigations to defend against them.

[1]  S. Lewandowsky,et al.  Psychological inoculation improves resilience against misinformation on social media , 2022, Science advances.

[2]  Noah A. Smith,et al.  Imagined versus Remembered Stories: Quantifying Differences in Narrative Flow , 2022, 2201.02662.

[3]  Carlo M. Horz Propaganda and Skepticism , 2021 .

[4]  John P. Dickerson,et al.  Counterfactual Explanations for Machine Learning: A Review , 2020, ArXiv.

[5]  Tobias Gerstenberg,et al.  Inference from explanation. , 2020, Journal of experimental psychology. General.

[6]  Henrique S. Malvar,et al.  AMP: authentication of media via provenance , 2020, MMSys.

[7]  Justus Thies,et al.  Neural Voice Puppetry: Audio-driven Facial Reenactment , 2019, ECCV.

[8]  David G. Rand,et al.  Reliance on emotion promotes belief in fake news , 2019, Cognitive research: principles and implications.

[9]  M. Zollhöfer,et al.  Face2Face , 2018, Communications of the ACM.

[10]  Mark A. Neerincx,et al.  Contrastive Explanations with Local Foil Trees , 2018, ICML 2018.

[11]  Yejin Choi,et al.  Event2Mind: Commonsense Inference on Events, Intents, and Reactions , 2018, ACL.

[12]  Sercan Ömer Arik,et al.  Neural Voice Cloning with a Few Samples , 2018, NeurIPS.

[13]  Chris L. Baker,et al.  Rational quantitative attribution of beliefs, desires and percepts in human mentalizing , 2017, Nature Human Behaviour.

[14]  Adam Coates,et al.  Deep Voice: Real-time Neural Text-to-Speech , 2017, ICML.

[15]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[16]  Joseph Y. Halpern,et al.  Actual Causality , 2016, A Logical Theory of Causality.

[17]  Justus Thies,et al.  Face2Face: Real-Time Face Capture and Reenactment of RGB Videos , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[19]  Joseph M. Parent,et al.  American Conspiracy Theories , 2014 .

[20]  Tomoki Toda,et al.  Acquiring a Dictionary of Emotion-Provoking Events , 2014, EACL.

[21]  Ro'i Zultan,et al.  Causal Responsibility and Counterfactuals , 2013, Cogn. Sci..

[22]  Eric Horvitz,et al.  Decisions about turns in multiparty conversation: from perception to action , 2011, ICMI '11.

[23]  Zornitsa Kozareva,et al.  SemEval-2012 Task 7: Choice of Plausible Alternatives: An Evaluation of Commonsense Causal Reasoning , 2011, *SEMEVAL.

[24]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[25]  Keith A. Markus,et al.  Making Things Happen: A Theory of Causal Explanation , 2007 .

[26]  S. Carey,et al.  Functional explanation and the function of explanation , 2006, Cognition.

[27]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[28]  Craig Joseph,et al.  Conspiracy Thinking in the Middle East , 1994 .

[29]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[30]  D. Noble The Paranoid Style in American Politics and Other Essays by Richard Hofstadter (review) , 1966, Canadian Historical Review.

[31]  Illtyd Trethowan Causality , 1938 .

[32]  Silja Renooij,et al.  Persuasive Contrastive Explanations for Bayesian Networks , 2021, ECSQARU.

[33]  Henrique S. Malvar,et al.  MULTI-STAKEHOLDER MEDIA PROVENANCE MANAGEMENT TO COUNTER SYNTHETIC MEDIA RISKS IN NEWS PUBLISHING , 2020 .

[34]  Yejin Choi,et al.  ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning , 2019, AAAI.

[35]  Joshua B. Tenenbaum,et al.  What happened? Reconstructing the past through vision and sound , 2018, CogSci.

[36]  F. Fukuyama,et al.  Conspiracy: How the Paranoid Style Flourishes and Where It Comes From , 1998, Foreign Affairs.

[37]  Wendy Wood,et al.  Stages in the Analysis of Persuasive Messages: The Role of Causal Attributions and Message Comprehension. , 1981 .

[38]  Lloyd M. Abernethy Book Review: The Paranoid Style in American Politics and other Essays, by Richard Hofstadter , 1966 .