Between the Devil and the Deep Blue Sea: Tensions Between Scientific Judgement and Statistical Model Selection