Text-to-Sql model keeps missing "<" token
|
|
3
|
31
|
June 11, 2025
|
Downloading larger models with xet fails on macOS
|
|
3
|
291
|
June 5, 2025
|
How to implement bind_tools to custom LLM from huggingface pipeline(Llama-3) for a custom agent
|
|
3
|
1295
|
June 9, 2025
|
Simplifying Hugging Face Spaces API calls in Flutter using hugging_face_chat_gradio package
|
|
4
|
40
|
June 8, 2025
|
Do we need a new programming language optimized for AI to write code?
|
|
2
|
86
|
June 6, 2025
|
Regarding the Image Generation
|
|
1
|
27
|
June 6, 2025
|
Consensus Validation for LLM Outputs: Applying Blockchain-Inspired Models to AI Reliability
|
|
0
|
196
|
June 5, 2025
|
Stateful PEFT adapter
|
|
0
|
13
|
June 5, 2025
|
Fine tuning LLM for text classification -- error with SFTTrainer
|
|
2
|
1374
|
June 3, 2025
|
Crisp AI to AI language the road to AGI
|
|
1
|
24
|
May 29, 2025
|
Why do custom development?
|
|
4
|
49
|
May 28, 2025
|
Trouble fine-tuning Flan-T5 (with LoRA) for structured map generation – model repeats prompt or instructions
|
|
1
|
31
|
May 26, 2025
|
Why is the memory quickly filled up in the first few iterations when using Trainer of transformers to train the network, and then drops to a very low level as the training progresses?
|
|
0
|
11
|
May 25, 2025
|
Dario Schiraldi : How can I set up a commercially viable workflow in ComfyUI to perform accurate face-swapping?
|
|
0
|
44
|
May 22, 2025
|
How to forbade Gemma 2 from using a certain phrase and use another one in its place?
|
|
7
|
21
|
May 21, 2025
|
Dedicated endpoint getting 429 errors
|
|
4
|
309
|
May 21, 2025
|
429 for Kokoro-82M model
|
|
1
|
51
|
May 19, 2025
|
GradioUI + Smolagents + MCP "Event loop is closed"
|
|
1
|
97
|
May 16, 2025
|
🚀 New tool for AI manga creators: **MangaBuilder** (buildmanga.com)
|
|
2
|
46
|
May 16, 2025
|
Handling Extreme Class Imbalance for Multi-Class Classification
|
|
1
|
58
|
May 14, 2025
|
Matching Single Shoes with Computer Vision – Alternatives to Cosine Similarity and Siamese Networks need advice
|
|
3
|
13
|
May 12, 2025
|
Resize embeddings on Peft model
|
|
4
|
598
|
May 12, 2025
|
Blip2 peft training
|
|
2
|
222
|
May 9, 2025
|
How to setup JSON based workflow/flowchart generation based on user prompt?
|
|
1
|
50
|
May 9, 2025
|
Cuda OOM on 4 A6000s (142 GB of VRAM) even after using Zero3, Qlora, Accelerate, Max_token_length
|
|
1
|
94
|
May 8, 2025
|
How do i batch in streaming of data set
|
|
1
|
43
|
May 3, 2025
|
Help with Quantizing phi-4 MM Fine-Tuned Vision Model and Converting to ONNX
|
|
3
|
70
|
May 2, 2025
|
Checking if two column have the language i want
|
|
1
|
26
|
May 1, 2025
|
Strange pyarrow error when extracting rows from a public dataset
|
|
2
|
38
|
April 30, 2025
|
A Poem that help LLM improve quality & reduce 50% overhead
|
|
0
|
25
|
April 29, 2025
|