Encoding standards for large text resources: The Text Encoding Initiative
暂无分享,去创建一个
The Text Encoding Initiative (TEI) is an international project established in 1988 to develop guidelines for the preparation and interchange of electronic texts for research, and to satisfy a broad range of uses by the language industries more generally. The need for standardized encoding practices has become inxreasingly critical as the need to use and, most importantly, reuse vast amounts of electronic text has dramatically increased for both research and industry, in particular for natural language processing. In January 1994, the TEI issued its Guidelines for the Encoding and Interchange of Machine-Readable Texts, which provide standardized encoding conventions for a large range of text types and features relevant for a broad range of applications.
[1] Steven J. DeRose,et al. Markup systems and the future of scholarly text processing , 1987, CACM.
[2] Charles F. Goldfarb,et al. SGML handbook , 1990 .
[3] Eric van Herwijnen,et al. Practical SGML , 1994, Springer US.
[4] Susan Armstrong-Warwick. Acquisition and Exploitation of Textual Resources for NLP , 1994 .
[5] C. M. Sperberg-McQueen,et al. Guidelines for electronic text encoding and interchange , 1994 .