Tiny mBART doc/info

marton-avrios · July 30, 2020, 6:53am

Have not tried Marian yet but it seems interesting. It’s for translation, right? But since I used translation mode for my problem it could definitely work.

Also I found excellent pre-trained models on TF Hub but they are not fine-tunable (according to the page). TransformerXL pre-trained on Wiki40B (a new dataset in 40 languages), separate model for each language. At least for me this would be the ultimate model. Seq2seq, unlimited sequence length and 41 languages. See https://tfhub.dev/google/collections/wiki40b-lm/1

Topic		Replies	Views
Incorrect model ``stas/tiny-wmt19-en-ru`` Models	1	313	May 3, 2021
Small miniLM model for multilingual 🤗Transformers	0	326	October 7, 2021
TinyReformer/TinyLongformer details Models	3	432	November 6, 2020
Help with finetuning mBART on an unseen language Models	19	2053	October 30, 2020
Small Decoder-only model < 1B parameters Models	2	168	September 13, 2024

Tiny mBART doc/info

Related topics