Gramformer: Learning Crowd Counting via Graph-Modulated Transformer