Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer