Construction of an Infrastructure for Providing Users with Suitable Language Resources
暂无分享,去创建一个
Our research organization has been constructing a large scale database named SHACHI by collecting detailed meta information on language resources (LRs) in Asia and Western countries. The metadata database contains more than 2,000 compiled LRs such as corpora, dictionaries, thesauruses and lexicons, forming a large scale metadata of LRs archive. Its metadata, an extended version of OLAC metadata set conforming to Dublin Core, have been collected semi-automatically. This paper explains the design and the structure of the metadata database, as well as the realization of the catalogue search tool.
[1] Shunsuke Kozawa,et al. Automatic Acquisition of Usage Information for Language Resources , 2008, LREC.
[2] Yoshihiko Hayashi,et al. Ontologies for a Global Language Infrastructure , 2008 .