Implementation of Two Distinct Datasets with HuggingFace Trainer Module
|
|
5
|
45
|
June 18, 2025
|
Translation with marianmt. Early stopping stucked
|
|
4
|
29
|
June 17, 2025
|
Cannot get tools to work: InferenceClient + hf-inference + Qwen/Qwen3-235B-A22B -- Internal Server Error
|
|
3
|
40
|
June 17, 2025
|
AI House material change
|
|
0
|
15
|
June 12, 2025
|
Copyright policy regarding youtube datasets
|
|
1
|
13
|
June 12, 2025
|
Text-to-Sql model keeps missing "<" token
|
|
3
|
32
|
June 11, 2025
|
Downloading larger models with xet fails on macOS
|
|
3
|
804
|
June 5, 2025
|
How to implement bind_tools to custom LLM from huggingface pipeline(Llama-3) for a custom agent
|
|
3
|
1381
|
June 9, 2025
|
Simplifying Hugging Face Spaces API calls in Flutter using hugging_face_chat_gradio package
|
|
4
|
65
|
June 8, 2025
|
Do we need a new programming language optimized for AI to write code?
|
|
2
|
94
|
June 6, 2025
|
Regarding the Image Generation
|
|
1
|
33
|
June 6, 2025
|
Consensus Validation for LLM Outputs: Applying Blockchain-Inspired Models to AI Reliability
|
|
0
|
264
|
June 5, 2025
|
Stateful PEFT adapter
|
|
0
|
14
|
June 5, 2025
|
Fine tuning LLM for text classification -- error with SFTTrainer
|
|
2
|
1391
|
June 3, 2025
|
Crisp AI to AI language the road to AGI
|
|
1
|
28
|
May 29, 2025
|
Why do custom development?
|
|
4
|
50
|
May 28, 2025
|
Trouble fine-tuning Flan-T5 (with LoRA) for structured map generation – model repeats prompt or instructions
|
|
1
|
64
|
May 26, 2025
|
Why is the memory quickly filled up in the first few iterations when using Trainer of transformers to train the network, and then drops to a very low level as the training progresses?
|
|
0
|
15
|
May 25, 2025
|
Dario Schiraldi : How can I set up a commercially viable workflow in ComfyUI to perform accurate face-swapping?
|
|
0
|
64
|
May 22, 2025
|
How to forbade Gemma 2 from using a certain phrase and use another one in its place?
|
|
7
|
22
|
May 21, 2025
|
Dedicated endpoint getting 429 errors
|
|
4
|
547
|
May 21, 2025
|
429 for Kokoro-82M model
|
|
1
|
62
|
May 19, 2025
|
GradioUI + Smolagents + MCP "Event loop is closed"
|
|
1
|
128
|
May 16, 2025
|
🚀 New tool for AI manga creators: **MangaBuilder** (buildmanga.com)
|
|
2
|
67
|
May 16, 2025
|
Handling Extreme Class Imbalance for Multi-Class Classification
|
|
1
|
91
|
May 14, 2025
|
Matching Single Shoes with Computer Vision – Alternatives to Cosine Similarity and Siamese Networks need advice
|
|
3
|
20
|
May 12, 2025
|
Resize embeddings on Peft model
|
|
4
|
777
|
May 12, 2025
|
Blip2 peft training
|
|
2
|
286
|
May 9, 2025
|
How to setup JSON based workflow/flowchart generation based on user prompt?
|
|
1
|
72
|
May 9, 2025
|
Cuda OOM on 4 A6000s (142 GB of VRAM) even after using Zero3, Qlora, Accelerate, Max_token_length
|
|
1
|
161
|
May 8, 2025
|