Data munging with Hadoop