BLESS: Benchmarking Large Language Models on Sentence Simplification