About finetuning whisper
|
|
0
|
211
|
May 5, 2023
|
Why is Reformer's Vocab Size So Small?
|
|
0
|
268
|
May 5, 2023
|
Using Trainer at inference time
|
|
9
|
15980
|
May 4, 2023
|
Mirroring Huggingface S3 to download models/tokenizers
|
|
2
|
3485
|
May 4, 2023
|
Trainer gives error after 1st epoch when using F1 score
|
|
0
|
523
|
May 4, 2023
|
How to use PEFT approach to do Prompt Tuning on DollyV2 model
|
|
0
|
768
|
May 4, 2023
|
Fine-tune CLIPSeg with (image, mask) dataset
|
|
3
|
1889
|
May 4, 2023
|
How to merge LoRa weights with base model?
|
|
0
|
1312
|
May 3, 2023
|
Why I am getting no accuracy for the trainer.train() result of roberta text classification model
|
|
0
|
153
|
May 2, 2023
|
Facebook BART Fine-tuning - Transformers CUDA error: CUBLAS_STATUS_NOT_INITIALIZE
|
|
4
|
1768
|
May 2, 2023
|
How to train a T5 model to learn a programming language?
|
|
0
|
499
|
May 2, 2023
|
Masked language model for BART (Not BERT)
|
|
5
|
1513
|
July 5, 2022
|
Hyperparameter tuning using Trainer not getting same performance
|
|
0
|
280
|
May 1, 2023
|
BioGPT causal language model with unexpected error
|
|
0
|
332
|
May 1, 2023
|
Overall accuracy in Finetuning dslim/bert-base-NER with custom dataset and labels gets only up to ~0.15 using seqeval
|
|
2
|
514
|
May 1, 2023
|
Trainer.train() is stuck
|
|
5
|
7449
|
May 1, 2023
|
Tokenizer.from_pretrained calls stuck forever
|
|
0
|
647
|
April 30, 2023
|
FlashAttention or equivalent?
|
|
0
|
914
|
April 30, 2023
|
Trainable weights in automodel and comparison with lora
|
|
0
|
221
|
April 28, 2023
|
Unable to train token classification model
|
|
0
|
298
|
April 27, 2023
|
Script to Fine-Tune FLAN UL2
|
|
1
|
298
|
April 27, 2023
|
Model did not return a loss --- but why?
|
|
0
|
748
|
April 27, 2023
|
Do automatically generated attention masks ignore padding?
|
|
4
|
16728
|
March 8, 2022
|
Can Similarity Sentence Returns the Similarity Content?
|
|
0
|
325
|
April 27, 2023
|
Finetuning T5-large on Multiple GPUs
|
|
0
|
1098
|
April 26, 2023
|
Whisper identified the wrong language
|
|
0
|
357
|
April 26, 2023
|
Fine Tuning a model for Prompt Engineering
|
|
0
|
930
|
April 26, 2023
|
transformers.Tokenizer produce unexpected results
|
|
0
|
208
|
April 26, 2023
|
How to get all prefixes for T5?
|
|
0
|
192
|
April 26, 2023
|
Exclude words from GPT-2 generate( )
|
|
3
|
1768
|
April 26, 2023
|