Reproducibility of LongFormer Model
|
|
0
|
15
|
August 30, 2024
|
Whisper export to onnx with prompt_id
|
|
0
|
62
|
August 29, 2024
|
Pad Tokens & Attention Masks with Data Collators
|
|
0
|
61
|
August 29, 2024
|
DDP + Compile + Torch Dynamo + Huggingface Trainer
|
|
0
|
103
|
August 28, 2024
|
Eval_loss error on evaluation at the first epoch
|
|
0
|
35
|
August 28, 2024
|
Mistral trouble when fine-tuning : Don't set pad_token_id = eos_token_id
|
|
8
|
5926
|
August 28, 2024
|
Clarification on Classification Token
|
|
0
|
18
|
August 27, 2024
|
Convert slow XLMRobertaTokenizer to fast one
|
|
3
|
1215
|
August 26, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)
|
|
5
|
3474
|
August 26, 2024
|
[SOLVED] What's the right way to do GPU paralellism for inference (not training) on AutoModelForCausalLM?
|
|
1
|
239
|
August 26, 2024
|
Inference workflow in compile mode using transformers.pipeline()
|
|
0
|
33
|
August 26, 2024
|
Incorrect logits shape for GIT model
|
|
2
|
17
|
August 26, 2024
|
How to avoid `trust_remote_code=True` for my models
|
|
6
|
12188
|
August 26, 2024
|
Forward Pass Output Logits
|
|
0
|
101
|
August 26, 2024
|
Back propogation throught a KandinskyV22Pipeline image generator
|
|
0
|
12
|
August 25, 2024
|
Trainer stuck mid epoch
|
|
0
|
33
|
August 24, 2024
|
How does padding side affect training?
|
|
0
|
263
|
August 23, 2024
|
Trainer.evaluate() freezing
|
|
3
|
519
|
August 23, 2024
|
Different padding behaviour of data collator
|
|
0
|
116
|
August 23, 2024
|
Finetune BERT for multiple tasks
|
|
1
|
90
|
August 23, 2024
|
Missing config.json and modules.safetensors
|
|
0
|
213
|
August 22, 2024
|
Feasibility of Fine-Tuning GPT2-XL Model on 3060 RTX GPU for Academic Misinformation Identification
|
|
0
|
73
|
August 22, 2024
|
AutoTrain Error DeepSpeed Zero-3
|
|
1
|
290
|
August 21, 2024
|
Removing tokens from the GPT tokenizer
|
|
2
|
2040
|
August 20, 2024
|
Getting error while loading model from local path : Exception: expected value at line 1 column 1
|
|
2
|
1182
|
August 20, 2024
|
`truncate_dim` on `BertModel`
|
|
0
|
102
|
August 20, 2024
|
Benchmarking LLMs
|
|
1
|
1437
|
August 20, 2024
|
Save_pretrained() on tokenizer does not generate a tokenizer.json file
|
|
3
|
889
|
August 19, 2024
|
Kosmos-2 Fine tuning
|
|
41
|
1966
|
August 19, 2024
|
TimeSeriesTransformerModel dimensionality issue
|
|
2
|
35
|
August 19, 2024
|