Learning Representations of Categorical Feature Combinations via Self-Attention