A novel deep reinforcement learning for POMDP-based autonomous ship collision decision-making