The meaninglessness of `Sit-and-stare' -- How Vision-Action-Understanding is inseparable