Fusion-in-Decoder models

Are there any implementations for Fusion-in-Decoder models?

4 Likes

Same question here, it seems that I will have to hack on it with pytorch.

There is a version from ParlAI which works very well:

The original implementation is actually based on HuggingFace/Transformers. It is just a simple (yet very effective) encoder wrapper of T5.