A multi-scale semantic attention representation for multi-label image recognition with graph networks