论文信息 - Parallel Writing in East Asian Languages and Its Representation in Metadata in Light of the DCMI Abstract Model

Parallel Writing in East Asian Languages and Its Representation in Metadata in Light of the DCMI Abstract Model

This paper discusses the parallel writing tradition in East Asian languages and its representation in metadata. Parallel writing systems in these languages do not use the same scripts, but they all share a common scheme and have a well-established tradition in bibliographic data. Their data representation in the MARC bibliographic format is handled in a variety of ways. Even in the metadata world, representation of parallel writing shows some inconsistencies. It is therefore desirable to establish a new common way of representation. For this purpose, this paper discusses the class of the represented values in terms of the DCMI Abstract Model (DCAM). In the case of properties such as "Title", it is possible to see the associated value as a "literal", but for parallel writing, it is more appropriate to see such a value as "a sequence of words". Accordingly, parallel writing can be represented as multiple value strings associated with a value of the class "sequence of words". Even so, one remaining problem is that the language tags used in the value string language cannot also specify writing systems. Enumeration of the types of writing systems in various languages and registration with RFC 4646 would be required in order to express this information in DCAM value string languages.

Akira Miyazawa

[1] Mark Davis,et al. Tags for Identifying Languages , 2009, RFC.

[2] Andy Powell,et al. DCMI Abstract Model , 2005 .