Perception robustness testing at different levels of generality