From Individual to Whole: Reducing Intra-class Variance by Feature Aggregation