Self-Supervised Video Forensics by Audio-Visual Anomaly Detection