Benchmarking the accuracy of structure-based binding affinity predictors on Spike-ACE2 Deep Mutational Interaction Set