A Picture Might Be Worth a Thousand Words, But It's Not Always Enough to Evaluate Robots