Systematic Characterizations of Text Similarity in Full Text Biomedical Publications