Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 Treatments