Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric