Data Quality Assessment in the Wild: Findings from GitHub