Developing an automatic metadata harvesting and generation system for a continuing education repository: a pilot study
暂无分享,去创建一个
The goal of this pilot study is to assess the effectiveness and reliability of an automated metadata generation and harvesting system developed for a project repository which hosts continuing education resources for cataloging and metadata professionals. Using a web crawler developed for the repository, 500 web resources are selected as seed pages for metadata extraction and generation. This paper summarizes the processes as well as the results of the study. The metadata harvesting system combined with powerful article analysis and data generation tools such as Adlegant’s Article Anaylsis API produces significant improvement in metadata generation.
[1] Yuji Tosaka,et al. Developing an automatic crawling system for populating a digital repository of professional development resources: A pilot study , 2016 .
[2] Jane Greenberg,et al. Functionalities for automatic metadata generation applications: a survey of metadata experts' opinions , 2006, Int. J. Metadata Semant. Ontologies.
[3] Jung-ran Park,et al. Evaluation of Semi-Automatic Metadata Generation Tools: A Survey of the Current State of the Art , 2015 .