Problem with custom metric for custom T5 model
|
|
1
|
762
|
October 9, 2023
|
About intermediate variable text
|
|
1
|
179
|
October 9, 2023
|
CUDA is out of memory
|
|
3
|
3299
|
October 9, 2023
|
How to set trust_remote_code=True for prompt-tuning fine-tuning for local deployment models
|
|
1
|
1506
|
October 9, 2023
|
I want to create a small model that could optimize code. What are your suggestions?
|
|
3
|
524
|
October 9, 2023
|
Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this)
|
|
0
|
400
|
October 8, 2023
|
[Errno 2] No such file or directory: 'ffmpeg'
|
|
4
|
3627
|
October 9, 2023
|
Gradio message limit is there a way?
|
|
4
|
734
|
October 8, 2023
|
Get model downloads count oneach month
|
|
0
|
240
|
October 8, 2023
|
Add additional trainable layers to StableDiffusion for fine-tuning
|
|
0
|
1005
|
October 8, 2023
|
Intermediate features from a Huggingface pretrained model
|
|
0
|
322
|
October 8, 2023
|
Error with BertTokenizerFast: AttributeError - 'function' object has no attribute 'get'
|
|
0
|
634
|
October 8, 2023
|
Tried to download Mistral 7B but got an error message
|
|
3
|
13265
|
October 8, 2023
|
Where to find documentation on dataset format for finetuning
|
|
0
|
277
|
October 7, 2023
|
Converting pytorch_model.bin (Whisper )to .pt
|
|
0
|
642
|
October 8, 2023
|
Saving a Bert model
|
|
2
|
645
|
October 8, 2023
|
torch.cuda.OutOfMemoryError when evaluate while traning
|
|
0
|
508
|
October 8, 2023
|
Vocab-Transformers vs Sentence-Transformers
|
|
0
|
631
|
October 8, 2023
|
Can I use sentence-transformers with tensorflow?
|
|
1
|
341
|
October 8, 2023
|
Single batch training on multi-gpu
|
|
1
|
984
|
October 8, 2023
|
Trained a tokenizer from scratch but problem when loading
|
|
0
|
478
|
October 8, 2023
|
Qunatized model with LORA takes much more GPU memory than the un-quantized model with LORA for the (E-5-Large Embedding Transformer)
|
|
4
|
1718
|
October 8, 2023
|
I was using huugginfface meta-llama/Llama-2-7b-chat-hf and im facing an error
|
|
2
|
2564
|
October 8, 2023
|
LayoutLMV3 on dataset other than english
|
|
0
|
201
|
October 8, 2023
|
TrainingArgument
|
|
3
|
8215
|
October 8, 2023
|
How to join separate strings to translate them together for better speed?
|
|
0
|
228
|
October 7, 2023
|
Fine-tuning: Merge or chain?
|
|
2
|
954
|
October 7, 2023
|
Using fine-tuned model that wasn't explicitly saved
|
|
2
|
1050
|
October 7, 2023
|
How to forbid access file from browser
|
|
1
|
266
|
October 7, 2023
|
Llama-2-7b download
|
|
1
|
1020
|
October 7, 2023
|