Benchmarks for Physical Reasoning AI