Cross-scale Attention Model for Acoustic Event Classification