Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning