🤗Transformers

Topic	Replies	Views	Activity
HF Trainer: HF trainer cause a problem while fine-tuning T5 (T5 doesn't generate eos token at proper point) 🤗Transformers	0	824	March 6, 2022
How to continue BERT training 🤗Transformers	1	1355	March 4, 2022
Model.generate() OOM on 1 of 2 GPUs? 🤗Transformers	4	1698	March 4, 2022
What are the goals in Positional Embedding methods? 🤗Transformers	2	507	March 3, 2022
Any examples on VisualBERTforMultipleChoice 🤗Transformers	1	418	March 3, 2022
After vocabulary extension the tokenizer keeps on running 🤗Transformers	0	321	March 2, 2022
How to use only one bert to do generation task with 'past_key_values' mechanism？ 🤗Transformers	2	797	March 1, 2022
Use Trainer API with two valiation sets 🤗Transformers	1	1873	February 28, 2022
How to remove input from from generated text in GPTNeo? 🤗Transformers	0	987	March 1, 2022
Word embedding with BERT 🤗Transformers	0	628	February 28, 2022
Self-attention masking for T5 encoder? 🤗Transformers	0	1708	February 27, 2022
BERT for NextSentencePrediction train and inference problem, thanks 🤗Transformers	0	636	February 25, 2022
Add_tokens + finetune 🤗Transformers	0	533	February 25, 2022
BertPreTrainedModel and RobertaPreTrainedModel works, however PreTrainedModel does not work 🤗Transformers	0	1068	February 25, 2022
Errors when training on multi node single gpu 🤗Transformers	1	1771	February 25, 2022
DistilHubert: PyTorch to ONNX conversion issue 🤗Transformers	3	739	February 24, 2022
Pipeline text classification with two sequences for each example 🤗Transformers	2	748	February 24, 2022
Self-pretrained model predicts token with -1 index gap 🤗Transformers	0	669	February 22, 2022
Huge disparity between CPU and GPU memory usage? 🤗Transformers	0	406	February 22, 2022
MarianMT training produce "▁" in results 🤗Transformers	1	325	February 21, 2022
Random seed for weight initialization and data order 🤗Transformers	0	1238	February 21, 2022
Use asr-wav2vec2-commonvoice-fr model offline 🤗Transformers	1	937	February 21, 2022
Get embedding from finetuned BertForSequenceClassification model 🤗Transformers	1	3731	February 19, 2022
Transformer for TF 1.15.0? 🤗Transformers	2	1514	February 18, 2022
Errors when fine-tuning using Keras 🤗Transformers	0	670	February 18, 2022
Errors while fine-tuning using Keras 🤗Transformers	2	1203	February 18, 2022
Which model of transformers to use if I want to do multiclassification of a pair of sentences containing a questionair 🤗Transformers	0	252	February 18, 2022
Two transformers in one model 🤗Transformers	0	244	February 17, 2022
Sentence length influence on similarity 🤗Transformers	1	396	February 17, 2022
Callbacks for logging results to GPT2 🤗Transformers	1	465	February 16, 2022