Probability distribution of dependency distance

This paper investigates probability distributions of dependency distances in six texts ex- tracted from a Chinese dependency treebank. The fitting results reveal that the investigated distribu- tion can be well captured by the right truncated Zeta distribution. In order to restrict the model only to natural language, two samples with randomly generated governors are investigated. One of them can be described e.g. by the Hyperpoisson distribution, the other satisfies the Zeta distribution. The paper also presents a study on sequential plot and mean dependency distance of six texts with three analyses (syntactic, and two random). Of these three analyses, syntactic analysis has a minimum (mean) dependency distance.