When using SGD: RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
|
|
0
|
1894
|
October 9, 2023
|
Aggregate AB test dataset
|
|
0
|
258
|
October 9, 2023
|
Problem with custom metric for custom T5 model
|
|
1
|
762
|
October 9, 2023
|
About intermediate variable text
|
|
1
|
179
|
October 9, 2023
|
CUDA is out of memory
|
|
3
|
3300
|
October 9, 2023
|
How to set trust_remote_code=True for prompt-tuning fine-tuning for local deployment models
|
|
1
|
1507
|
October 9, 2023
|
I want to create a small model that could optimize code. What are your suggestions?
|
|
3
|
526
|
October 9, 2023
|
Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this)
|
|
0
|
400
|
October 8, 2023
|
[Errno 2] No such file or directory: 'ffmpeg'
|
|
4
|
3629
|
October 9, 2023
|
Gradio message limit is there a way?
|
|
4
|
734
|
October 8, 2023
|
Get model downloads count oneach month
|
|
0
|
241
|
October 8, 2023
|
Add additional trainable layers to StableDiffusion for fine-tuning
|
|
0
|
1007
|
October 8, 2023
|
Intermediate features from a Huggingface pretrained model
|
|
0
|
323
|
October 8, 2023
|
Error with BertTokenizerFast: AttributeError - 'function' object has no attribute 'get'
|
|
0
|
635
|
October 8, 2023
|
Tried to download Mistral 7B but got an error message
|
|
3
|
13271
|
October 8, 2023
|
Where to find documentation on dataset format for finetuning
|
|
0
|
277
|
October 7, 2023
|
Converting pytorch_model.bin (Whisper )to .pt
|
|
0
|
643
|
October 8, 2023
|
Saving a Bert model
|
|
2
|
645
|
October 8, 2023
|
torch.cuda.OutOfMemoryError when evaluate while traning
|
|
0
|
510
|
October 8, 2023
|
Vocab-Transformers vs Sentence-Transformers
|
|
0
|
631
|
October 8, 2023
|
Can I use sentence-transformers with tensorflow?
|
|
1
|
341
|
October 8, 2023
|
Single batch training on multi-gpu
|
|
1
|
986
|
October 8, 2023
|
Trained a tokenizer from scratch but problem when loading
|
|
0
|
478
|
October 8, 2023
|
Qunatized model with LORA takes much more GPU memory than the un-quantized model with LORA for the (E-5-Large Embedding Transformer)
|
|
4
|
1723
|
October 8, 2023
|
I was using huugginfface meta-llama/Llama-2-7b-chat-hf and im facing an error
|
|
2
|
2565
|
October 8, 2023
|
LayoutLMV3 on dataset other than english
|
|
0
|
201
|
October 8, 2023
|
TrainingArgument
|
|
3
|
8215
|
October 8, 2023
|
How to join separate strings to translate them together for better speed?
|
|
0
|
228
|
October 7, 2023
|
Fine-tuning: Merge or chain?
|
|
2
|
954
|
October 7, 2023
|
Using fine-tuned model that wasn't explicitly saved
|
|
2
|
1054
|
October 7, 2023
|