Lessons on Parameter Sharing across Layers in Transformers