Similarity analysis of DNA sequences based on k-word