Analyzing Influential Factors in Human Preference Judgments via GPT-4