🤗Transformers

Topic	Replies	Views	Activity
How to continue training on another dataset? 🤗Transformers	1	864	April 14, 2022
Inconsistency in hyperparameter search results 🤗Transformers	2	641	April 13, 2022
ValueError: too many dimensions 'str' 🤗Transformers	0	3009	April 13, 2022
Move Trainer out of GPU 🤗Transformers	0	379	April 13, 2022
Can we force first token by model.config.forced_bos_token_id? 🤗Transformers	0	667	April 12, 2022
How to customize output layer (activate function) of existing models with training arguments? 🤗Transformers	1	1561	April 12, 2022
Documentation on MMBTModel, MMBTConfig and using the MMBT model in general 🤗Transformers	0	654	April 12, 2022
How to save hugging face fine tuned model using pytorch and distributed training 🤗Transformers	0	1297	April 12, 2022
How to prune a model trained with LayerDrop 🤗Transformers	0	344	April 11, 2022
Not able to reload all weights after training 🤗Transformers	0	575	April 11, 2022
Sentiment Analysis Portuguese 🤗Transformers	1	1761	April 11, 2022
Is there a way that I can export XLNet to onnx 🤗Transformers	0	431	April 10, 2022
Add Custom Token-Level Features 🤗Transformers	0	300	April 8, 2022
Easiest way to get a senetence embedder from a transformers model? 🤗Transformers	1	1398	April 7, 2022
Fine-tuned pre-trained Roberta model on different labels 🤗Transformers	0	639	April 7, 2022
Using Trainer for BertForPretraining does not work 🤗Transformers	1	1351	April 6, 2022
How does FillMaskPipeline work with Subword-Tokenization? 🤗Transformers	1	428	April 6, 2022
T5ForConditionalGeneration, How to get prediction probabilities or logits at the inference time? (to calculate perplexity) 🤗Transformers	0	694	April 5, 2022
Huggingface classification struggling with prediction 🤗Transformers	0	835	April 5, 2022
What are the product quantization vectors 🤗Transformers	0	269	April 5, 2022
Is zeroshot classification tokenizing the input sequence more than once? 🤗Transformers	0	212	April 5, 2022
Access Quantization module in wave2vec2 🤗Transformers	0	256	April 5, 2022
Can't make inference from Longformer model build on top of MBART 🤗Transformers	0	469	April 4, 2022
`run_translation.py` example is erroring out with the recommended settings DeepSpeed	1	6262	April 4, 2022
Accelerated Inference for gpt-j using javascript 🤗Transformers	1	532	April 3, 2022
Learning rate and Data size 🤗Transformers	1	617	April 2, 2022
New layer in bert embeddings 🤗Transformers	1	684	April 1, 2022
After loading minilm, if I print the model it still shows as BertModel 🤗Transformers	0	267	April 1, 2022
How loss is calculated in MLM training 🤗Transformers	0	856	April 1, 2022
Error with aggregation_strategy="max", TypeError: Can't convert [' In'] to PyString 🤗Transformers	0	449	April 1, 2022