Benchmarking Commercial Intent Detection Services with Practice-Driven Evaluations