Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets