An optimisation algorithm for matching large scale databases on customers for improved characterisation of electricity consumption