What Makes Convolutional Models Great on Long Sequence Modeling?