A Monte Carlo Method for Evaluating Empirical Gyrochronology Models and Its Application to Wide Binary Benchmarks