Join Sizes, urn Models and Normal Limiting Distributions

Abstract We study some parameters of relational databases (sizes of relations obtained by a join) that can be described by generating functions on three variables, of the kind ϕ( x, y, z ) d . We modelize these parameters by suitable urn models and give conditions under which they asymptotically follow a gaussian distribution.