Optimizing accuracy and efficacy in data-driven materials discovery for the solar production of hydrogen