Supplementary Material: X3D: Expanding Architectures for Efficient Video Recognition