Approximate value-directed belief state monitoring for partially observable Markov decision processes