ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room