Understanding Temporal Relations from Video: A Pathway to Learning Sequential Tasks from Visual Demonstrations