Saving and loading modified Unet
|
|
1
|
1865
|
December 4, 2023
|
Git lfs error on training diffusers
|
|
7
|
2838
|
December 4, 2023
|
Summarizer + TTS Gradio App Error
|
|
0
|
418
|
December 4, 2023
|
Predictions format sent to compute_metrics depends on model used
|
|
0
|
214
|
December 4, 2023
|
Getting the same embedding from llama 2 class token for any input
|
|
1
|
1286
|
December 4, 2023
|
Similarity search based on multiple text attributes
|
|
0
|
398
|
December 4, 2023
|
Response cutoff
|
|
1
|
543
|
December 4, 2023
|
Questions about ordering training inputs when fine-tuning models
|
|
5
|
2471
|
December 4, 2023
|
Get attention masks from HF pipelines
|
|
0
|
372
|
December 4, 2023
|
Running low on GPU memory on a cluster with ESM2 lowest config
|
|
2
|
389
|
December 5, 2023
|
Fine-tuning MT5 - base and make it more ChatGPT like
|
|
2
|
362
|
December 5, 2023
|
Getting no config error while creating inference endpoint
|
|
0
|
204
|
December 5, 2023
|
Seeking Recommendations for Lightweight AI Models with Strong Human Language Understanding
|
|
0
|
838
|
December 5, 2023
|
How is the number of steps calculated in trl's SFTTrainer under multiple-GPU?
|
|
2
|
2824
|
December 5, 2023
|
Launch timed out, space was not healthy after 30 min in AutotrAIN
|
|
1
|
231
|
December 5, 2023
|
Inference, checkpoint
|
|
0
|
861
|
December 5, 2023
|
Need recommendations for a 7b-AWQ text summarizer & organizer
|
|
0
|
152
|
December 5, 2023
|
CUDA Out-of-Memory Error with llama2-13b-chat Model on Multi-GPU Server
|
|
0
|
1139
|
December 5, 2023
|
I created a learning system 📚 for all disciplines use GPT
|
|
1
|
1032
|
December 5, 2023
|
Issue in deployment {pip's Timeout error while Building}
|
|
1
|
420
|
December 5, 2023
|
What to Monitor during training Val_Loss or Val_Accuracy?
|
|
0
|
345
|
December 5, 2023
|
Asymmetry in validation step vs. autoregressive inference
|
|
0
|
179
|
December 5, 2023
|
Accelerate - video encoding across GPUs fails
|
|
0
|
193
|
December 5, 2023
|
"message": "You need to specify either `text` or `text_target`."
|
|
0
|
785
|
December 5, 2023
|
TrainOutput message after training
|
|
1
|
710
|
December 5, 2023
|
AutoModelForCausalLM.from_pretrained refuses to load safetensors weights
|
|
0
|
948
|
December 5, 2023
|
Trying to choose a model for converting natural language to structured queries/output
|
|
0
|
446
|
December 5, 2023
|
Clear/Refresh gr.Gallery output on event on huggingface
|
|
1
|
211
|
December 5, 2023
|
What can you fine tune with 2x A6000s?
|
|
1
|
350
|
December 5, 2023
|
Announcement: We will be closing this Gradio section of the Forums
|
|
1
|
479
|
December 5, 2023
|