Deep reinforcement learning in NOMA-assisted UAV networks for path selection and resource offloading