Sequentially valid tests for forecast calibration