Joint CNN and Transformer Network via weakly supervised Learning for efficient crowd counting