Lightweight feature encoder for wake-up word detection based on self-supervised speech representation