Joint source–target encoding with pervasive attention