Deep reinforcement learning assisted reticle floorplanning with rectilinear polygon modules for multiple-project wafer