What you see is what you do: on the relationship between gaze and gesture in multimodal alignment