Hugging Face Forums

Topic	Replies	Views	Activity
Difference between GAT and Transformer? Intermediate	0	885	April 7, 2022
Is there a way to use mean_pooling with Roberta? Intermediate	0	467	April 6, 2022
What is the best way to tackle OOV Intermediate	0	472	April 6, 2022
Incorporating structural information in a Transformer? Research	0	718	April 6, 2022
Detokenising output of Roberta tokeniser Beginners	0	441	April 6, 2022
Weight decay rate in create optimizer tensorflow Intermediate	0	598	April 6, 2022
Using Trainer for BertForPretraining does not work 🤗Transformers	1	1344	April 6, 2022
How does FillMaskPipeline work with Subword-Tokenization? 🤗Transformers	1	425	April 6, 2022
“No matching distribution found for wordninja==2.0.0” when using HuggingFace + SageMaker Amazon SageMaker	4	1208	April 6, 2022
Cannot import MXNet in Spaces Spaces	0	1001	April 6, 2022
Fine-tune CLIP on satellite images+captions Flax/JAX Projects	14	5042	April 6, 2022
Bert pretrained tokenizer: how to preserve hyphened words? Beginners	0	311	April 6, 2022
Can you use both copy mechanism and BPE for a NMT task? Research	0	712	April 6, 2022
Creating distillated version of gelectra-base model Intermediate	0	419	April 5, 2022
T5ForConditionalGeneration, How to get prediction probabilities or logits at the inference time? (to calculate perplexity) 🤗Transformers	0	689	April 5, 2022
Huggingface classification struggling with prediction 🤗Transformers	0	831	April 5, 2022
3-dimensional attention_mask in LongformerSelfAttention Models	0	812	April 5, 2022
Creating Batch Sizes for Video Transcription Dataset Models	0	682	April 5, 2022
What are the product quantization vectors 🤗Transformers	0	261	April 5, 2022
Is zeroshot classification tokenizing the input sequence more than once? 🤗Transformers	0	210	April 5, 2022
Dataset map method - how to pass argument to the function Beginners	4	10315	April 5, 2022
Is there an easy way to apply layer-wise decaying learning rate in huggingface trainer for RobertaMaskedForLM? Research	3	2927	April 5, 2022
Model Card for deepset/roberta-large-squad2-hp and deepset/roberta-large-squad2 Model cards	0	1115	April 5, 2022
Access Quantization module in wave2vec2 🤗Transformers	0	254	April 5, 2022
Unable to load mozilla-foundation/common_voice_6_0 dataset 🤗Datasets	2	1209	April 4, 2022
Best way to mask a multi-token word when using `.*ForMaskedLM` models 🤗Tokenizers	2	2294	April 4, 2022
3d object as gradio input/output 🔒 Gradio	2	1672	April 4, 2022
TFT5ForConditionalGeneration with custom loss Beginners	0	447	April 4, 2022
TypeError: forward() got an unexpected keyword argument 'return_dict' Beginners	0	1160	April 4, 2022
Does a tokenizer keep the mapping between my labels to their encoding? 🤗Tokenizers	3	2160	April 4, 2022