How Robust Are Cross-Country Comparisons of PISA Scores to the Scaling Model Used?