Local and Global Contextual Features Fusion for Pedestrian Intention Prediction