Learning Optimal Treatment Strategies for Sepsis Using Offline Reinforcement Learning in Continuous Space