Interactive Grounding and Inference in Instruction Following