ART-2b: Adapted ART-2a for large scale data clustering on PM2.5 mass spectra

ART-2a has been shown to be effective against stream data clustering with unknown number of cluster in nature. As data grows, ART-2a running time become a major problem. We proposed a new algorithm, ART-2b, whose runtime performance is linear to the number of input instances, while still maintaining similar clustering result to ART-2a.