Benchmarking methods for detecting differential states between conditions from multi-subject single-cell RNA-seq data