This chapter focuses on how we can support what we would call large scale, data-driven humanities, that is, research that is based on the close examination of research materials from resources that are scattered across diverse repositories. It introduces the reader to the tradition of digital humanities, as we do not expect a detailed background knowledge of the field. Then, it continues by describing the specifics of data involved in the arts and humanities before it describes a case study to analyze large amounts of data in the humanities. It covers the Digital Research Infrastructure for the Arts and Humanities (DARIAH) experiment to prepare an infrastructure to deploy virtual research environments on the European e-Infrastructure.