Data set and fitting dependencies when estimating protein mutant stability: Toward simple, balanced, and interpretable models