🤗Transformers

Topic	Replies	Views	Activity
Pre-trained Model for Text Translation 🤗Transformers	0	467	June 30, 2022
Finetuning ByT5 with a batch size of 1 on T4 GPU 🤗Transformers	0	593	June 30, 2022
Wav2Vec2.0 FineTuning distributed training 🤗Transformers	0	350	June 30, 2022
Testing best model after hyperparameter_search 🤗Transformers	0	597	June 29, 2022
How to resume language modeling training with flax? 🤗Transformers	0	243	June 29, 2022
How to covert huggingface .h5 model to original tf-checkpoint? 🤗Transformers	3	432	June 29, 2022
Different classification label produced by 'predictions' and 'label_ids' from Trainer.predict() 🤗Transformers	1	2751	June 28, 2022
Obtain step by step outputs of model.generate 🤗Transformers	0	746	June 28, 2022
Google Colab Crash When Loading Pre-Trained Transformer 🤗Transformers	0	962	June 27, 2022
Difference between "Auto Model" and "Auto Model For Token Classification" in BERT fine tuning 🤗Transformers	1	1782	June 25, 2022
How to re-construct training dataset at epoch begin in Trainer using Callback? 🤗Transformers	0	547	June 25, 2022
ViT problem with GPU usage require image to be numpy 🤗Transformers	3	662	June 24, 2022
GPU memory not being freed between batches 🤗Transformers	0	1741	June 24, 2022
Unable to train Bert by splitting across GPUs 🤗Transformers	0	460	June 24, 2022
TF bert-base-uncased reserves large memory space 🤗Transformers	1	867	June 24, 2022
Using BERT embeddings as input for transformer architecture 🤗Transformers	0	723	June 23, 2022
Does it make sense that continue training BERT by wikipedia corpus drop the GLUE score? 🤗Transformers	0	316	June 22, 2022
I'm facing this problem 🤗Transformers	0	401	June 22, 2022
Getting CrossEntropy loss from beam search scores 🤗Transformers	0	402	June 21, 2022
How to get a model's initial input representation? 🤗Transformers	2	849	June 21, 2022
Difference of performance when finetuning bert use the huggingface or the google official code 🤗Transformers	0	448	June 20, 2022
VisionEncoderDecoder X-Attn Question 🤗Transformers	4	510	June 20, 2022
Snackable Brain Bites for the community 🤗Transformers	0	230	June 18, 2022
Is it possible to set epoch less than 1 when using Trainer 🤗Transformers	1	1293	June 18, 2022
Self-attention query vs key size in gpt2 🤗Transformers	1	1055	June 17, 2022
Error trying to load MarkupLMForPretraining 🤗Transformers	2	553	June 17, 2022
How to decode GPT2 🤗Transformers	3	7818	June 17, 2022
Regarding the seed in HF trainer 🤗Transformers	0	321	June 14, 2022
How to get the scores of a certain beam 🤗Transformers	0	379	June 13, 2022
Encoding video frames using CLIP 🤗Transformers	0	1354	June 12, 2022