|
Stateful PEFT adapter
|
|
0
|
15
|
June 5, 2025
|
|
Fine tuning LLM for text classification -- error with SFTTrainer
|
|
2
|
1413
|
June 3, 2025
|
|
Crisp AI to AI language the road to AGI
|
|
1
|
37
|
May 29, 2025
|
|
Why do custom development?
|
|
4
|
62
|
May 28, 2025
|
|
Trouble fine-tuning Flan-T5 (with LoRA) for structured map generation – model repeats prompt or instructions
|
|
1
|
101
|
May 26, 2025
|
|
Why is the memory quickly filled up in the first few iterations when using Trainer of transformers to train the network, and then drops to a very low level as the training progresses?
|
|
0
|
22
|
May 25, 2025
|
|
Dario Schiraldi : How can I set up a commercially viable workflow in ComfyUI to perform accurate face-swapping?
|
|
0
|
93
|
May 22, 2025
|
|
How to forbade Gemma 2 from using a certain phrase and use another one in its place?
|
|
7
|
33
|
May 21, 2025
|
|
Dedicated endpoint getting 429 errors
|
|
4
|
629
|
May 21, 2025
|
|
429 for Kokoro-82M model
|
|
1
|
83
|
May 19, 2025
|
|
GradioUI + Smolagents + MCP "Event loop is closed"
|
|
1
|
147
|
May 16, 2025
|
|
🚀 New tool for AI manga creators: **MangaBuilder** (buildmanga.com)
|
|
2
|
85
|
May 16, 2025
|
|
Handling Extreme Class Imbalance for Multi-Class Classification
|
|
1
|
151
|
May 14, 2025
|
|
Matching Single Shoes with Computer Vision – Alternatives to Cosine Similarity and Siamese Networks need advice
|
|
3
|
28
|
May 12, 2025
|
|
Resize embeddings on Peft model
|
|
4
|
955
|
May 12, 2025
|
|
Blip2 peft training
|
|
2
|
327
|
May 9, 2025
|
|
How to setup JSON based workflow/flowchart generation based on user prompt?
|
|
1
|
99
|
May 9, 2025
|
|
Cuda OOM on 4 A6000s (142 GB of VRAM) even after using Zero3, Qlora, Accelerate, Max_token_length
|
|
1
|
258
|
May 8, 2025
|
|
How do i batch in streaming of data set
|
|
1
|
59
|
May 3, 2025
|
|
Help with Quantizing phi-4 MM Fine-Tuned Vision Model and Converting to ONNX
|
|
3
|
110
|
May 2, 2025
|
|
Checking if two column have the language i want
|
|
1
|
34
|
May 1, 2025
|
|
Strange pyarrow error when extracting rows from a public dataset
|
|
2
|
180
|
April 30, 2025
|
|
A Poem that help LLM improve quality & reduce 50% overhead
|
|
0
|
38
|
April 29, 2025
|
|
Gradio Chatbox - no api found
|
|
1
|
97
|
April 29, 2025
|
|
Sudden Loss Drop and Poor Performance During Model Training
|
|
0
|
95
|
April 28, 2025
|
|
🔧 Optimizing Phi-4 MM Instruct Vision Model (ONNX Inference)
|
|
1
|
61
|
April 24, 2025
|
|
Can anybody recommend a good image filename generating AI?
|
|
1
|
34
|
April 24, 2025
|
|
Arabic to French Word embedding Using skip-gram needs new Ideas in the data part
|
|
0
|
33
|
April 23, 2025
|
|
Cache Proxy - Like with Docker Registries
|
|
1
|
772
|
October 21, 2024
|
|
The model did not return a loss from the inputs, only the following keys: logits. For reference, the inputs it received are input_values
|
|
23
|
41705
|
April 23, 2025
|