DOTA: detect and omit weak attentions for scalable transformer acceleration