ER-Test: Evaluating Explanation Regularization Methods for Language Models