UDaaS: A Cloud-Based URL-Deduplication-as-a-Service for Big Datasets

Since the number of potential malicious URLs from diverse sources is large, URL deduplication is needed for the efficient identification of malicious websites. URL Deduplication-as-a-Service (UDaaS) was developed to help a URL analyst to deploy and manage a cloud-based distributed and parallel URL deduplication infrastructure easily, this can improve the performance of malicious websites detection while reducing duplication and quantity of local storage requirements.