Twelve Ways to Fool the Masses When Giving Performance Results on Parallel Computers