A critical problem in benchmarking and analysis of evolutionary computation methods