Semantic similarity analysis of protein data: assessment with biological features and issues