Warm-started encoder-decoder models (Bert2Gpt2 and Bert2Bert)

Thanks. That works.