Hugging Face Forums

Topic	Replies	Views	Activity
Injecting Multiple Modalities into a Transformer Decoder via Cross-Attention 🤗Transformers	1	94	March 9, 2025
ZeroGPU illegal duration. The requested GPU duration (300s) is larger than the maximum allowed Spaces	5	246	March 9, 2025
Does the model "Qwen/Qwen2.5-Coder-32B-Instruct" free? Course	1	162	March 9, 2025
How to Train an Image Captioning Model for specific language Beginners	3	18	March 9, 2025
Delving in to AI: asking for advice Beginners	0	24	March 8, 2025
Support for LLaMA in EncoderDecoder framework 🤗Transformers	1	523	March 8, 2025
AANN: Agents As Neural Networks Research	0	45	March 8, 2025
Training LLM model for asking questions 🤗Datasets	4	92	March 8, 2025
Unable to run Application Beginners	2	21	March 8, 2025
Guidance on getting started with fine tuned uncensored model Beginners	2	847	March 8, 2025
How to train this model Models	0	75	March 8, 2025
Freezing layers with SFTTrainer Intermediate	2	265	March 8, 2025
Giving personality to LLM Beginners	1	535	March 8, 2025
Train bus routes csv Beginners	3	19	March 8, 2025
I can’t get Email confirmation link Site Feedback	79	8787	March 7, 2025
SFTTrainer Doubling Speed on a Single GPU with DeepSpeed: Proposal for an Update to the Official Documentation and Verification Report DeepSpeed	1	58	March 7, 2025
Issue with ALLaM-7B Model in Inference API - Size Limitation Error Inference Endpoints on the Hub	1	54	March 7, 2025
Use microsoft phi-2 Beginners	3	23	March 7, 2025
After fine tuning openai whisper model, there shows OSError WinError 123 🤗Transformers	1	14	March 7, 2025
Dataset of course is not available anymore Course	1	28	March 7, 2025
About Hyperparameter Search with Ray Tune 🤗Transformers	2	21	March 7, 2025
Is this needed: bnb 4bit use double quant = True? Beginners	3	2510	March 7, 2025
Cannot upload large files for a single repository 🤗Hub	3	1127	March 7, 2025
Is there any difference between GPT-J and GPT-2? Models	3	2751	March 7, 2025
Trainer.train() runs for long and appears to be stuck. How do I know that it's processing and not in loop 🤗Transformers	2	593	March 7, 2025
Benchmarking Vision Models for Specific Use Cases Beginners	0	52	March 7, 2025
Cursor vs. Local IDE with Extensions Beginners	0	45	March 7, 2025
Google Colab vs. Local IDE for Vision Projects Beginners	0	20	March 7, 2025
Hugging face inference support and quota Inference Endpoints on the Hub	3	112	March 7, 2025
As of transformers v4.44, default chat template is no longer allowed 🤗Transformers	2	3499	March 7, 2025