Reinforcement learning-based dynamic production-logistics-integrated tasks allocation in smart factories