Cross-modal guiding and reweighting network for multi-modal RSVP-based target detection