Intermediate

Topic	Replies	Views	Activity
Custom trainer evaluation function	0	2810	June 20, 2022
What is the difference between triplet loss and contrastive loss?	1	2037	June 18, 2022
Fine-tune a translation model on monolingual data	1	438	June 16, 2022
Why is uploaded model twice the size of actual model?	6	2734	June 12, 2022
How to set dropout range on classifier layer using hyperparameter search?	0	582	June 10, 2022
Slow inference while performing translation	0	605	June 10, 2022
Joining SpeechEncoderDecoder embedding chunks for processing longer audio	1	558	June 10, 2022
Getting completely different performance when trying to write a custom model	1	844	June 9, 2022
Save a Bert model with custom forward function and heads on Hugginface	1	1975	June 7, 2022
Trainer code for token-wise prediction model	0	437	June 6, 2022
Different results for the same mrm8488/t5-base-finetuned-emotion	1	648	May 20, 2022
Possible error in Dataset elasticsearch	0	510	May 20, 2022
How to Implement Numerical Inference in a Text Generation Problem	0	521	May 17, 2022
Fine tune model='facebook/bart-large-mnli'	0	1273	May 16, 2022
GPU utlization up and down	0	579	May 13, 2022
Streaming Dataset of Sequence Length 2048	7	2822	May 12, 2022
Regression with multiple targets	0	824	May 12, 2022
Equivalent for ignore token for Vision Transformers?	0	618	May 12, 2022
Perplexity from fine-tuned GPT2LMHeadModel with and without lm_head as a parameter	4	2054	May 10, 2022
Teaming Up for Kaggle NLP Competitions	7	1105	May 9, 2022
Sentence Pair Classification	1	2001	May 4, 2022
Suggestions for hugging face transformer models for Code and Formal Languages	2	1765	May 3, 2022
Running huggingface-cli from script	2	3947	May 2, 2022
How to use huge target data without source data	0	499	May 2, 2022
Getting entity offset from ONNX outputs	1	586	April 28, 2022
Batched pipeline for Question-Answering	0	559	April 28, 2022
Params stored in the GPU during training	1	674	April 27, 2022
Zero shot classification and Onnx	2	1384	April 27, 2022
Zero shot classification pipeline customization	2	1798	April 27, 2022
How big are differences between transformer implementations	0	534	April 26, 2022