Overlap and diversity in antimicrobial peptide databases: compiling a non-redundant set of sequences