A Privacy-Enhanced Microaggregation Method

Microaggregation is a statistical disclosure control technique for protecting microdata (i.e., individual records), which are important products of statistical offices. The basic idea of microaggregation is to cluster individual records in microdata into a number of mutually exclusive groups prior to publication, and then publish the average over each group instead of individual records. Previous methods require fixed or variable group size in clustering in order to reduce information loss. However, the security aspect of microaggregation has not been extensively studied. We argue that the group size requirement is not enough for protecting the privacy of microdata. We propose a new microaggregation method, which we call secure-k-Ward, to enhance the individual's privacy. Our method, which is optimization based, minimizes information loss and overall mean deviation while at the same time guarantees that the security requirement for protecting the microdata is satisfied.