Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review