Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks