Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs