Gathering Datasets for Activity Identification

The area of activity identification is maturing well in the HCI and ubiquitous computing fields. However, although algorithm development is proceedings well, without publicly available datasets on which to compare results it is difficult to consolidate the disparate work being done. This problem exists because realistic datasets describing human activity are difficult and expensive to gather and because there are significant barriers to releasing the data once gathered. We review positive recent development with the release of two high-quality datasets. From our experiences using these datasets we list some recommendations for the gathering and release of future datasets. Finally, we propose a strategy of our own for gathering a new dataset from these recommendations.