Off-line Deep Reinforcement Learning for Maintenance Optimization