A comparison of LSTM and GRU networks for learning symbolic sequences