Whisper export to onnx with prompt_id
|
|
0
|
55
|
August 29, 2024
|
Pad Tokens & Attention Masks with Data Collators
|
|
0
|
52
|
August 29, 2024
|
DDP + Compile + Torch Dynamo + Huggingface Trainer
|
|
0
|
85
|
August 28, 2024
|
Eval_loss error on evaluation at the first epoch
|
|
0
|
32
|
August 28, 2024
|
Mistral trouble when fine-tuning : Don't set pad_token_id = eos_token_id
|
|
8
|
5487
|
August 28, 2024
|
Clarification on Classification Token
|
|
0
|
16
|
August 27, 2024
|
Convert slow XLMRobertaTokenizer to fast one
|
|
3
|
1179
|
August 26, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)
|
|
5
|
3430
|
August 26, 2024
|
[SOLVED] What's the right way to do GPU paralellism for inference (not training) on AutoModelForCausalLM?
|
|
1
|
211
|
August 26, 2024
|
Inference workflow in compile mode using transformers.pipeline()
|
|
0
|
31
|
August 26, 2024
|
Incorrect logits shape for GIT model
|
|
2
|
17
|
August 26, 2024
|
How to avoid `trust_remote_code=True` for my models
|
|
6
|
10340
|
August 26, 2024
|
Forward Pass Output Logits
|
|
0
|
73
|
August 26, 2024
|
Back propogation throught a KandinskyV22Pipeline image generator
|
|
0
|
11
|
August 25, 2024
|
Trainer stuck mid epoch
|
|
0
|
27
|
August 24, 2024
|
How does padding side affect training?
|
|
0
|
223
|
August 23, 2024
|
Trainer.evaluate() freezing
|
|
3
|
479
|
August 23, 2024
|
Different padding behaviour of data collator
|
|
0
|
75
|
August 23, 2024
|
Finetune BERT for multiple tasks
|
|
1
|
80
|
August 23, 2024
|
Missing config.json and modules.safetensors
|
|
0
|
161
|
August 22, 2024
|
Feasibility of Fine-Tuning GPT2-XL Model on 3060 RTX GPU for Academic Misinformation Identification
|
|
0
|
64
|
August 22, 2024
|
AutoTrain Error DeepSpeed Zero-3
|
|
1
|
250
|
August 21, 2024
|
Removing tokens from the GPT tokenizer
|
|
2
|
1931
|
August 20, 2024
|
Getting error while loading model from local path : Exception: expected value at line 1 column 1
|
|
2
|
958
|
August 20, 2024
|
`truncate_dim` on `BertModel`
|
|
0
|
78
|
August 20, 2024
|
Benchmarking LLMs
|
|
1
|
1347
|
August 20, 2024
|
Save_pretrained() on tokenizer does not generate a tokenizer.json file
|
|
3
|
717
|
August 19, 2024
|
Kosmos-2 Fine tuning
|
|
41
|
1890
|
August 19, 2024
|
TimeSeriesTransformerModel dimensionality issue
|
|
2
|
28
|
August 19, 2024
|
How does GPT decide to stop generating sentences without EOS token?
|
|
13
|
24000
|
August 19, 2024
|