The Klarna Product Page Dataset: A Realistic Benchmark for Web Representation Learning