Scholarly big data quality assessment: a case study of document linking and conflation with S2ORC