Enabling web services to consume and produce large distributed datasets