Developing heterogeneous corpora using the Digital Replay System (DRS).

This paper reports on the latest developments made as part of the ESRC funded Understanding Digital Records for eSocial Science Project (DReSS) at the University of Nottingham. Specifically, it reports on some of the issues and challenges that are currently being faced in compilation and use of heterogeneous multi-modal corpora comprised of heterogeneous datasets; discussing some of the optimum ways in which these datasets are recorded, processed, stored and accessed/interrogated by the linguist. The paper profiles the Digital Replay System (DRS) software which is being developed to support these processes.