Learning Mixture of Neural Temporal Point Processes for Multi-dimensional Event Sequence Clustering