Hierarchical Representations for Learning and Inferring Office Activity from Multiple Sensory Channels