Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding