Additional file 2: of Detecting false positive sequence homology: a machine learning approach