A Unified Model for Tracking and Image-Video Detection Has More Power