🤗Transformers

Topic	Replies	Views	Activity
HuggingFace ViT 10x Slower than Native Tensorflow (Not Fully Using GPU?) 🤗Transformers	0	348	July 16, 2022
Automatic Speech Recognition - Pipeline Error when processing single-channel or multi-channel audio 🤗Transformers	1	1705	July 15, 2022
Cannot get DataCollator to prepare tf dataset 🤗Transformers	0	478	July 15, 2022
Extract final hidden unit scores after custom fine-tuning language model 🤗Transformers	0	210	July 15, 2022
Creating summaries of fixed length with PEGASUS model 🤗Transformers	1	477	July 13, 2022
Use external embeddings 🤗Transformers	0	374	July 13, 2022
How to load model saved in local (Visual Transformer ViT)? 🤗Transformers	1	1622	July 13, 2022
Using Batch Encodings 🤗Transformers	0	717	July 12, 2022
Reason for discrepancy between loss calculation in XLNetLMHeadModel and GPT2LMHeadModel 🤗Transformers	0	430	July 12, 2022
Very low GPU usage when translating text, datasets not helping 🤗Transformers	3	5892	July 12, 2022
Different lm_head size and vocab_size 🤗Transformers	0	869	July 12, 2022
Rewriting generate function for manual decoder input 🤗Transformers	7	3569	July 11, 2022
RuntimeError - invalid multinomial distribution (with replacement=False, not enough non-negative category to sample) 🤗Transformers	0	395	July 11, 2022
Apply multiple rows of pandas dataframe to text2text-generation pipeline 🤗Transformers	0	573	July 11, 2022
T5.generate() cannot get hidden states although output_hidden_states=True 🤗Transformers	0	552	July 9, 2022
What is the dimensionality of output_attentions? 🤗Transformers	0	476	July 9, 2022
Longt5 summarization using huggingface sample code 🤗Transformers	1	850	July 8, 2022
Accessing model after training with hyper-parameter search 🤗Transformers	2	1080	July 7, 2022
BertForSequenceClassification classification head question 🤗Transformers	0	298	July 7, 2022
Mixed Precision training (fp16), how to use in production? 🤗Transformers	1	928	July 7, 2022
How does the Trainer API carry out fine-tuning? 🤗Transformers	0	370	July 7, 2022
Is model.eval() equivalent to setting dropout as 0? 🤗Transformers	0	1332	July 7, 2022
Vision Transformer embeddings interpolation 🤗Transformers	0	373	July 6, 2022
The result of bart-large is more stranger compare to the bart-base 🤗Transformers	0	612	July 5, 2022
Convert Bart to seq to seq form 🤗Transformers	0	309	July 5, 2022
MLFlow error while running pytorch summarization script 🤗Transformers	0	299	July 5, 2022
Onnx tf bert sentiment-analysis input and outputs 🤗Transformers	3	1095	July 5, 2022
How to train a LM model with whole word masking using Pytorch Trainer API 🤗Transformers	0	295	July 4, 2022
Runtime Error Due to Mac M1 Architecture? 🤗Transformers	2	4016	July 4, 2022
mT5 maximum sequence length 🤗Transformers	0	424	July 2, 2022