Hugging Face Forums

Topic	Replies	Views	Activity
Any examples on VisualBERTforMultipleChoice 🤗Transformers	1	412	March 3, 2022
T5 generate gibberish after finetune 10epochs Models	4	1563	March 2, 2022
Not enough values to unpack (expected 2, got 1) in training IMDB dataset Models	1	894	March 2, 2022
BERT model with duplicated data and f1 score Models	2	1116	March 2, 2022
Saving custom and/or finetuned models without the HUB Beginners	3	1044	March 2, 2022
Can we get per word loss from the output of a GPT model Beginners	0	364	March 2, 2022
Adding Blenderbot 2.0 to Huggingface Beginners	3	1039	March 2, 2022
After vocabulary extension the tokenizer keeps on running 🤗Transformers	0	319	March 2, 2022
Faster way to apply a model to dataframe Beginners	0	1727	March 2, 2022
Fine tuning model for stack exchange Models	0	372	March 2, 2022
Load dataset who has been automatically processed by AutoNLP 🤗Datasets	1	896	March 2, 2022
Creating masked sentences 🤗Datasets	1	410	March 2, 2022
How to train a translation model from scratch Beginners	9	12456	March 1, 2022
How to use only one bert to do generation task with 'past_key_values' mechanism？ 🤗Transformers	2	791	March 1, 2022
Different size of Roberta-base tokenizer and model embedding Beginners	1	1082	March 1, 2022
Use Trainer API with two valiation sets 🤗Transformers	1	1821	February 28, 2022
NER - Lab Reports, Vitals Intermediate	0	516	March 1, 2022
Using custom models (not necessarily transformer based) with generate() and sampling Beginners	2	1210	March 1, 2022
How to remove input from from generated text in GPTNeo? 🤗Transformers	0	985	March 1, 2022
How to get the score for a generated sentence from BartForConditionalGeneration Models	0	548	March 1, 2022
Improving zero-shot classification for roughly tokenized labels Models	0	764	December 30, 2021
Evaluating your model on more than one dataset Beginners	3	2046	February 28, 2022
How to deploy a T5 model to AWS SageMaker for fast inference? Amazon SageMaker	13	5762	February 28, 2022
Summarization on smaller set of sentences (avg. 100 words) Beginners	0	187	February 28, 2022
Why training accuracy and test accuracy on train set is significantly different? Beginners	0	1390	February 28, 2022
T5 extractive behavior Intermediate	0	402	February 28, 2022
Output embedding from each self-attention head from each encoder layer Intermediate	0	410	February 28, 2022
Strange sequence generation with xsum-distillbart (clumped tokens) Models	0	296	February 28, 2022
Word embedding with BERT 🤗Transformers	0	626	February 28, 2022
Onnx Errors pipeline_name ='question-answering' Intermediate	5	2208	February 28, 2022