Getting started with community BART model with no documentation

tannonk · May 5, 2021, 1:45pm

Hi,

I’d like to run some fine-tuning experiments with this German BART model, but am finding it difficult to even get started due to the lack of documentation.

From what I can tell, the model is configured as FSMTForConditionalGeneration, which requires language tags to be specified when loading the tokenizer. My naïve guess would be to specify something like ['de', 'de'] (for German) or ['src', 'tgt'], however, doing either of these simply returns any input text as a sequence of tokens. Below is a minimal example.

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("timo/timo-BART-german", ['de', 'de'])
text = "Meine Freunde sind nett aber sie essen zu viel Kuchen."
input_ids = tokenizer([text], add_special_tokens=False, return_tensors='pt')['input_ids']

>>> tensor([[3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3,
                    3, 3, 3, 3, 3, 3, 3, 3, 3, 3]])

Is anyone able to point me in the direction of a good tutorial/guide on how to get started with community models? Or better yet, @timo, any chance of providing a model card for this model to give an idea of its status/usability?

Thanks in advance!

Topic		Replies	Views
Train Bart for Conditional Generation (e.g. Summarization) Models	14	17160	November 22, 2023
BART pre-training? Beginners	5	1839	August 5, 2023
Help with fine-tune BART for text infilling Beginners	2	2200	February 10, 2022
BART on numerical data 🤗Transformers	0	371	April 7, 2023
Inconsistent Model/Pipeline Behavior using Automodel/Pipeline/BartForConditionalGeneration 🤗Transformers	3	883	February 16, 2021

Getting started with community BART model with no documentation

Related topics