Mystique: Accurate and Scalable Production AI Benchmarks Generation