Generating optimal structural databases for developing atomistic potentials