The PSG challenge: towards comprehensive scene understanding