🤗Transformers

Topic	Replies	Views	Activity
T5 training with Trainer, w/ AdaFactor 🤗Transformers	0	959	February 12, 2023
I would like to finetune the blip model on ROCO data set for image captioning of chest x-rays 🤗Transformers	0	592	February 12, 2023
MarkupLM model applied to html longer than 512 🤗Transformers	0	232	February 11, 2023
KeyError: 0 in data_collator.py 🤗Transformers	0	542	February 11, 2023
Training Transformer doesn't reach full GPU usage 🤗Transformers	0	540	February 10, 2023
German Sentiment (tokenizer)? 🤗Transformers	2	673	February 10, 2023
Unbale to deploy layoutlmv2 document image classification( RVL-CDIP) DeepSpeed	0	236	February 9, 2023
Support for exporting generate function to ONNX? 🤗Transformers	7	2330	February 8, 2023
Why OPT's token embeddings are not scaled by sqrt(dim) as in the original OPT implementation? 🤗Transformers	3	319	February 8, 2023
How to add more labels for prediction in our pre existing model 🤗Transformers	0	265	February 8, 2023
Reduced WavLMForXVector performance on LibriSpeech 🤗Transformers	1	170	February 7, 2023
How to speed up Blenderbot inference with Sagemaker? 🤗Transformers	0	417	February 7, 2023
Finetuning T5 large for paraphrasing multiple time with the same parameters and data gives different results 🤗Transformers	2	845	February 7, 2023
Optimizations and cloud instance characteristics for Flan-T5 real-time inference 🤗Transformers	0	547	February 7, 2023
How to make bart-cnn-large output end of sentence boundary in summarization 🤗Transformers	0	290	February 7, 2023
A possible bug in the transformers logging file 🤗Transformers	0	203	February 7, 2023
How to obtain correct text embeddings from CLIP? 🤗Transformers	1	9066	February 6, 2023
How to store hugging face model with flask postgresql 🤗Transformers	0	209	February 5, 2023
Electra-base returns always same output 🤗Transformers	0	226	February 5, 2023
Funetuning Longt5 Parameters which did not receive grad during training 🤗Transformers	0	784	February 3, 2023
Imbalanced data in ner task 🤗Transformers	0	628	February 3, 2023
Multi gpu not working 🤗Transformers	2	2234	February 3, 2023
How to deal with DataCollator and DataLoaders in Huggingface? DeepSpeed	0	1155	February 2, 2023
Using onnx for text-generation with GPT-2 🤗Transformers	4	4100	February 3, 2023
Input format for T5 model in Question Answering task 🤗Transformers	0	749	February 3, 2023
How to pre-train the language model in huggingface? 🤗Transformers	0	401	February 2, 2023
Both `max_new_tokens` and `max_length` have been set but they serve the same purpose 🤗Transformers	0	1636	February 2, 2023
Subword regularization in Sentencepiece and DeBERTaV2 tokenizers (not working) 🤗Transformers	0	698	February 1, 2023
Using EXTREMELY small dataset to finetune BERT 🤗Transformers	6	13339	February 1, 2023
Question about the causality of Roberta TOKENS 🤗Transformers	0	165	January 31, 2023