Self-Supervised Learning of Structure and Motion from Video