Real-time Attention-Augmented Spatio-Temporal Networks for Video-based Driver Activity Recognition