PubFlow: provenance-aware workflows for research data publication

In this paper we present a workflow oriented data publication framework called PubFlow. PubFlow is an ongoing research project with the goal to create a framework, which alleviates the process of data publication. A main feature of PubFlow is its provenance capturing mechanism. We also present an approach for collecting provenance information in a scientific workflow environment like PubFlow and give an outlook on a data archive for storing provenance information. This archive will be based on a NoSQL graph database and the W3C provenance ontology PROVO.