A Web redirection service for variant Chinese domain name resolution

Many efforts in past years have been made to lower the linguistic barriers for non-native English speakers to access the Internet. IDNA (Faltstrom et al., 2003) focuses on access to internationalized domain names (IDN) in a range of scripts that is broader in scope than the original ASCII. However, the use of character variants that have similar appearances and/or interpretations could create confusion. A variant IDL (internationalized domain label), derived from an IDL by replacing some characters with their variants, should match the original IDL. In JET Guidelines (Konishi et al., 2004), it is suggested that zone administrators model this concept of equivalence as an atomic IDL package that contains the variant IDLs generated according to the language variant tables (LVT). In addition to the registered IDL, some of the variant IDLs are activated and stored in the zone files and thus become resolvable. However, an issue of scalability arises when the number of the activated variant IDLs is large. In this paper, we present a mechanism to resolve the variant IDLs in an IDL package into the registered IDL. Specifically, we target Han character variants. Two Han characters are said to be variants of each other if they have the same meaning and are pronounced the same. Furthermore, Han character variants usually have similar appearances. We introduce a new resource record (RR) type, denoted as VarIdx, to associate the variant IDLs with the registered IDL. Experiment results show that a small number of VarIdx RRs are sufficient for enumerating all variant IDLs in an IDL package. We then present a redirection service that uses VarIdx RRs to redirect user requests with variant IDLs to the URLs with the corresponding registered IDLs.

[1]  Matt Crawford,et al.  Non-Terminal DNS Name Redirection , 1999, RFC.

[2]  Corporate Ieee,et al.  Information Technology-Portable Operating System Interface , 1990 .

[3]  John Klensin A Search-based access model for the DNS , 2004 .

[4]  Adam M. Costello Punycode: A Bootstring encoding of Unicode for Internationalized Domain Names in Applications (IDNA) , 2003, RFC.

[5]  Yang Woo Ko,et al.  Joint Engineering Team (JET) Guidelines for Internationalized Domain Names (IDN) Registration and Administration for Chinese, Japanese, and Korean , 2004, RFC.

[6]  Paul V. Mockapetris,et al.  Domain names: Concepts and facilities , 1983, RFC.

[7]  Ron Daniel,et al.  The Naming Authority Pointer (NAPTR) DNS Resource Record , 2000, RFC.

[8]  Paul E. Hoffman,et al.  Preparation of Internationalized Strings ("stringprep") , 2002, RFC.

[9]  Paul E. Hoffman,et al.  Internationalizing Domain Names in Applications (IDNA) , 2003, RFC.

[10]  Joan M. Aliprand The Unicode Standard , 1996 .

[11]  Paul V. Mockapetris,et al.  Domain names - implementation and specification , 1987, RFC.

[12]  Butler W. Lampson,et al.  Designing a global name service , 1986, PODC '86.

[13]  Paul E. Hoffman,et al.  Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN) , 2003, RFC.

[14]  XiaoDong Lee Traditional and Simplified Chinese Conversion , 2001 .

[15]  Peter B. Danzig,et al.  An analysis of wide-area name server traffic: a study of the Internet Domain Name System , 1992, SIGCOMM 1992.

[16]  Li-Ming Tseng Internationalized Domain Names and Unique Identifiers/Names , 2001 .