De novo clustering methods outperform reference-based methods for assigning 16S rRNA gene sequences to operational taxonomic units