Joint Contextual Transformer and Multi-scale Information Shared Network for Crowd Counting