Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks