Exploring Predictive States via Cantor Embeddings and Wasserstein Distance