Hierarchical Reinforcement Learning in Partially Observable Markovian Environments