Retrieval Augmented Convolutional Encoder-Decoder Networks for Video Captioning