Guided Adaptive Credit Assignment for Sample Efficient Policy Optimization