论文信息 - Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models - 字舞流文

Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models

Lisa Anne Hendricks | William S. Isaac | John F. J. Mellor | Geoffrey Irving | Po-Sen Huang | Sumanth Dathathri | Laura Weidinger | Iason Gabriel | Johannes Welbl | J. Uesato | M. Rauh | A. Glaese | G. Irving