Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features