Assessing the Impact of OCR Quality on Downstream NLP Tasks