G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment