🤗Transformers

Topic	Replies	Views	Activity
PyTorchBenchmark pickle local object error 🤗Transformers	0	327	June 3, 2023
Bert-base-uncased performs badly in next sentence prediction (bookcorpus) 🤗Transformers	0	342	June 2, 2023
Forward() got an unexpected keyword argument 'attention_mask' in Whisper Tutorial 🤗Transformers	1	4602	June 2, 2023
How to use Adaptive Learning rate during training? 🤗Transformers	4	1618	June 2, 2023
Trainer gives error after 1st epoch and evaluation 🤗Transformers	4	4751	June 2, 2023
Unexpected results wth XLMR trasformer models 🤗Transformers	0	193	June 2, 2023
Logging_steps=1 => ValueError 🤗Transformers	0	327	June 2, 2023
AssertionError: Torch not compiled with CUDA enabled 🤗Transformers	0	2954	June 1, 2023
Stopping generation before max_new_tokens 🤗Transformers	0	803	June 1, 2023
FP-16 training producing nans on t5-large/flan-t5-xl 🤗Transformers	0	728	June 1, 2023
MLM Using AlBert - No loss error 🤗Transformers	0	361	June 1, 2023
Continuing model training takes seconds in next round 🤗Transformers	3	1429	June 1, 2023
Fail predict using Falcon-7B-Instruct 🤗Transformers	0	660	June 1, 2023
How to make the Trainer log custom quantities? 🤗Transformers	0	562	May 31, 2023
I am getting 0.0 loss value at the very first epoch of training bigscience/mt0-small seq2seq model 🤗Transformers	0	524	May 31, 2023
Seq2SeqTrainer with num_beams and generation_config 🤗Transformers	0	276	May 31, 2023
How to use a custom embedding layer as input in get_encoder function 🤗Transformers	0	204	May 30, 2023
Query execution with hugging face pipeline is happening on CPU, even if model is loaded on GPU 🤗Transformers	0	976	May 30, 2023
Inference with hugging face pipeline happening on CPU, even if model is loaded on GPU 🤗Transformers	0	1719	May 30, 2023
Error in Seq2SeqTrainingArguments 🤗Transformers	3	948	May 30, 2023
Pre-Train BERT from scratch 🤗Transformers	5	15764	May 30, 2023
The quantization code in the "Gentle Introduction to 8-bit Matrix Multiplication for transformers" blog post yields error 🤗Transformers	1	727	May 29, 2023
Trainer.__init__() got an unexpected keyword argument 'model' 🤗Transformers	1	6216	May 29, 2023
How can I save vocab for specific language in Model Whisper? 🤗Transformers	0	290	May 29, 2023
Finetuning GPT2 using Multiple GPU and Trainer 🤗Transformers	14	6793	May 22, 2023
Tokenizer cannot produce correct output once using DistributedDataParallel 🤗Transformers	0	265	May 26, 2023
Finetuning using transformers 🤗Transformers	0	239	May 26, 2023
Causal language modeling documentation is wrong? 🤗Transformers	0	171	May 26, 2023
Training HF transformer models on custom (not text) data 🤗Transformers	0	213	May 26, 2023
Unlabelled zero-shot-classification 🤗Transformers	1	471	May 26, 2023