A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits