Building quality into a digital library
暂无分享,去创建一个
The Web Characterization Repository contains a collection of internet log files used by researchers to analyze and improve on the architecture of the Web. This repository improves on prior collections by thoroughly testing the log files for format to assure a degree of data quality. Instituting quality control into the digital library addressed many complex issues including technical support for quality assessment, the definition of a workflow to achieve quality control, the assignment of tasks to different people and the definition and automation of quality assessment for log files. By reaching realistic compromises on these issues it was possible to build quality control as an integral part of the digital library.