|
Implementation of Two Distinct Datasets with HuggingFace Trainer Module
|
|
5
|
77
|
June 18, 2025
|
|
Translation with marianmt. Early stopping stucked
|
|
4
|
56
|
June 17, 2025
|
|
Cannot get tools to work: InferenceClient + hf-inference + Qwen/Qwen3-235B-A22B -- Internal Server Error
|
|
3
|
75
|
June 17, 2025
|
|
AI House material change
|
|
0
|
21
|
June 12, 2025
|
|
Copyright policy regarding youtube datasets
|
|
1
|
28
|
June 12, 2025
|
|
Text-to-Sql model keeps missing "<" token
|
|
3
|
45
|
June 11, 2025
|
|
Downloading larger models with xet fails on macOS
|
|
3
|
1455
|
June 5, 2025
|
|
How to implement bind_tools to custom LLM from huggingface pipeline(Llama-3) for a custom agent
|
|
3
|
1565
|
June 9, 2025
|
|
Simplifying Hugging Face Spaces API calls in Flutter using hugging_face_chat_gradio package
|
|
4
|
127
|
June 8, 2025
|
|
Do we need a new programming language optimized for AI to write code?
|
|
2
|
122
|
June 6, 2025
|
|
Regarding the Image Generation
|
|
1
|
51
|
June 6, 2025
|
|
Consensus Validation for LLM Outputs: Applying Blockchain-Inspired Models to AI Reliability
|
|
0
|
300
|
June 5, 2025
|
|
Stateful PEFT adapter
|
|
0
|
19
|
June 5, 2025
|
|
Fine tuning LLM for text classification -- error with SFTTrainer
|
|
2
|
1430
|
June 3, 2025
|
|
Crisp AI to AI language the road to AGI
|
|
1
|
46
|
May 29, 2025
|
|
Why do custom development?
|
|
4
|
81
|
May 28, 2025
|
|
Trouble fine-tuning Flan-T5 (with LoRA) for structured map generation – model repeats prompt or instructions
|
|
1
|
133
|
May 26, 2025
|
|
Why is the memory quickly filled up in the first few iterations when using Trainer of transformers to train the network, and then drops to a very low level as the training progresses?
|
|
0
|
29
|
May 25, 2025
|
|
Dario Schiraldi : How can I set up a commercially viable workflow in ComfyUI to perform accurate face-swapping?
|
|
0
|
111
|
May 22, 2025
|
|
How to forbade Gemma 2 from using a certain phrase and use another one in its place?
|
|
7
|
48
|
May 21, 2025
|
|
Dedicated endpoint getting 429 errors
|
|
4
|
688
|
May 21, 2025
|
|
429 for Kokoro-82M model
|
|
1
|
109
|
May 19, 2025
|
|
GradioUI + Smolagents + MCP "Event loop is closed"
|
|
1
|
164
|
May 16, 2025
|
|
🚀 New tool for AI manga creators: **MangaBuilder** (buildmanga.com)
|
|
2
|
142
|
May 16, 2025
|
|
Handling Extreme Class Imbalance for Multi-Class Classification
|
|
1
|
221
|
May 14, 2025
|
|
Matching Single Shoes with Computer Vision – Alternatives to Cosine Similarity and Siamese Networks need advice
|
|
3
|
35
|
May 12, 2025
|
|
Resize embeddings on Peft model
|
|
4
|
1077
|
May 12, 2025
|
|
Blip2 peft training
|
|
2
|
358
|
May 9, 2025
|
|
How to setup JSON based workflow/flowchart generation based on user prompt?
|
|
1
|
120
|
May 9, 2025
|
|
Cuda OOM on 4 A6000s (142 GB of VRAM) even after using Zero3, Qlora, Accelerate, Max_token_length
|
|
1
|
325
|
May 8, 2025
|