Natural language supervision with a large and diverse dataset builds better models of human high-level visual cortex