CL-SciSumm Shared Task - Team Magma

Finding the cited text spans of a scientific article based on the citation text is a challenging task. In this paper, we present our novel system to identify cited sentence(s) and their residential sections in a reference paper, given a citing text. We define this task as a binary classification problem. We use domain-specific features obtained from ACL terminology. The predictions of the system are generated by a logistic regression classifier, with additional predictions from an Adaboost-decision tree added if the logistic regression predictions do not show sufficient diversity according to a threshold.