Hybrid CNN-transformer network for interactive learning of challenging musculoskeletal images