BIOINFORMATICS A robust clustering algorithm for identifying problematic samples in genome-wide association studies