🤗Transformers

Topic	Replies	Views	Activity
Error with aggregation_strategy="max", TypeError: Can't convert [' In'] to PyString 🤗Transformers	0	449	April 1, 2022
How to run on CPU? 🤗Transformers	1	10318	April 1, 2022
Understanding how Ray "objective" works with the trainer 🤗Transformers	1	1781	April 1, 2022
Huggingface longformer memory issues 🤗Transformers	0	541	March 31, 2022
Is it correct to replace a class after self.post_init() function? 🤗Transformers	0	1067	March 31, 2022
Gradients verification between JAX/Flax models and PyTorch 🤗Transformers	2	1190	March 31, 2022
How to finetune cola dataset using trainsformer and pytorch? 🤗Transformers	0	310	March 30, 2022
Get sentence ‘B’ predicted, given sentence ‘A’ using Next Sentence Prediction model 🤗Transformers	1	931	March 30, 2022
Can't load tokenizer with added special tokens 🤗Transformers	0	832	March 29, 2022
How to make Trainer train the model one epoch at a time? 🤗Transformers	1	1836	March 29, 2022
Are Word Embeddings by BERT generated for long sequences better than ones generated for short sequences? 🤗Transformers	0	238	March 29, 2022
Training loss increases suddenly at the beginning of each epoch 🤗Transformers	1	1682	March 29, 2022
Learning rate and checkpoints 🤗Transformers	0	441	March 29, 2022
Is the trainer's seed reset at every model_init? 🤗Transformers	4	1258	March 28, 2022
RuntimeError: params[0] in this process with sizes [253991, 1024] appears not to match sizes of the same param in process 0 🤗Transformers	0	653	March 28, 2022
Huggingface transformer sequence classification 🤗Transformers	3	495	March 26, 2022
Controlled Text Generation 🤗Transformers	2	2588	March 26, 2022
Transformers replacing loss function 🤗Transformers	0	3385	March 26, 2022
Using weights with transformers huggingface - running on GPUs 🤗Transformers	0	1521	March 26, 2022
:rocket: Optimum Transformers: accelerated NLP pipelines with Infinity speed 🤗Transformers	4	665	March 25, 2022
How to train model without shuffling data with multi-GPU 🤗Transformers	0	241	March 25, 2022
How to add a customized logger to Trainer? 🤗Transformers	0	656	March 25, 2022
Huggingface sequence classification unfreezing layers 🤗Transformers	2	1319	March 24, 2022
Bug: Finetune XLM-RoBERTa-large on XNLI get 0.33 in accuracy while XLM-RoBERTa-base works fine 🤗Transformers	0	343	March 23, 2022
GPT2LMHeadModel not loading from pretrained 🤗Transformers	2	4808	March 23, 2022
Combine BertForSequenceClassificaion with Additional Features 🤗Transformers	3	9495	March 23, 2022
What is rust_model.ot and how can I use it? 🤗Transformers	4	6005	March 23, 2022
How to find the beam search score for any target output? (BartForConditionalGeneration) 🤗Transformers	0	1449	March 22, 2022
Decoder attention mask in text2text/se2seq generation encoder-decoder models 🤗Transformers	1	1642	March 22, 2022
Flexible "Keep" threshhold for DETR model 🤗Transformers	0	228	March 22, 2022