Hugging Face Forums

Topic	Replies	Views	Activity
How to change BERT attention value during testing Intermediate	0	407	October 6, 2021
MobileBERT too slow? Models	2	757	October 6, 2021
Can't load weights for 'facebook/bart-base' Models	2	1756	October 6, 2021
Questions when doing Transformer-XL Finetune with Trainer Beginners	3	1056	October 6, 2021
How to frozen the attention map in BERT Intermediate	0	536	October 6, 2021
How to configure # models to use for training 🤗AutoTrain	2	1122	October 5, 2021
Specify Loss for Trainer / TrainingArguments 🤗Transformers	5	21055	October 5, 2021
Fine-Tuning results suggest some underlying implementation error? 🤗Transformers	1	680	October 5, 2021
Why aren't all weights of BertForPreTraining initialized from the model checkpoint? Beginners	3	1584	October 5, 2021
Multi-task learning for masked language modeling and token classification 🤗Transformers	0	595	October 5, 2021
My own f Forward Beginners	0	227	October 5, 2021
Containerizing Huggingface Transformers for GPU inference with Docker and FastAPI 🤗Transformers	0	2953	October 5, 2021
How to train transformer (seq-to-seq) for very large seq? 🤗Transformers	0	249	October 4, 2021
Encoding error with fine-tuned model Models	1	822	October 4, 2021
Why does fine-tuning require creating two Trainers? 🤗Transformers	0	258	October 4, 2021
PreTrain RoBERTa from scratch in Portuguese Flax/JAX Projects	16	2398	October 4, 2021
What is cause and solution to Trainer error: cuda RuntimeError 711? 🤗Transformers	4	534	October 4, 2021
HF equivalent for TF's KerasLayer()? Beginners	0	253	October 4, 2021
Inference Toolkit - Init and default template for custom inference Amazon SageMaker	12	2116	October 4, 2021
Saving eval loss for every evaluation/saved checkpoint with Trainer 🤗Transformers	2	2700	October 4, 2021
How do I specify a max character length per sentence, and vol. sentences, for summarization? Beginners	0	736	October 4, 2021
Loading community JSON based datasets without a script 🤗Datasets	3	517	October 4, 2021
Easily save DatasetDict as community dataset? Beginners	1	210	October 4, 2021
How to fine-tune BERT model for next word prediction? Beginners	0	1109	October 3, 2021
Pipelines for mutliple inputs don't produce reliable results Intermediate	2	424	October 3, 2021
Using BERT and RoBERTa for (causal?) language modeling 🤗Transformers	6	5282	October 2, 2021
How to get word embedding from a TF bert model? 🤗Transformers	0	336	October 1, 2021
How can I put multiple questions in the same context at once using Question-Answering technique (i'm using BERT)? Beginners	2	1458	October 1, 2021
UML diagram Transformers repo? Beginners	0	456	September 30, 2021
Can we add special tokens such a </s> in mBart text inputs? Beginners	0	247	September 30, 2021