Truly Deterministic Policy Optimization