Improving Web site understanding with keyword-based clustering