Compression of a Set of Strings
暂无分享,去创建一个
Dictionary is used by many compression methods. Some of them put the dictionary into the compressed message. In such cases the improvements in dictionary compression can improve the performance of the compression methods. We have focused on compression of dictionaries of words or syllables. The dictionary compression is often based on tree representation of the dictionary. We suppose that proper encoding of the tree can save a lot of space. We have therefore focused on minimizing the stored information corresponding to a tree node. We store only the number of children encoded by Elias gamma code, the difference of encoded character from the left sibling encoded by Elias delta code, and the information whether given node represents a dictionary item stored in a single bit. The distance from left brother is skipped for the root and the information whether the node represents a dictionary item is skipped by the leaves.
[1] Michal Zemlicka,et al. Compression of a Dictionary , 2006, DATESO.
[2] Peter Elias,et al. Universal codeword sets and representations of the integers , 1975, IEEE Trans. Inf. Theory.
[3] Michal Zemlicka,et al. Text Compression: Syllables , 2005, DATESO.