Sudden Loss Drop and Poor Performance During Model Training
|
|
0
|
13
|
April 28, 2025
|
Gradio 4.26.0 Space no longer starts - TypeError: argument of type 'bool' is not iterable (working in 5.27.0)
|
|
1
|
15
|
April 28, 2025
|
ð Bringing Supercomputer-Grade AI Performance to Local CPUs: Purem Benchmarks Now Public
|
|
0
|
5
|
April 28, 2025
|
Runtime Identity Drift in LLMs â Can We Stabilize Without Memory?
|
|
4
|
83
|
April 28, 2025
|
Colab cannot find HuggingFace dataset
|
|
7
|
4148
|
April 28, 2025
|
:wink: Please tell me the top 3 models on the market for chatting WITHOUT censorship
|
|
1
|
191
|
December 24, 2024
|
Isn't there a simpler way to run LLMs / models locally?
|
|
3
|
224
|
April 28, 2025
|
How to deploy smolagents locally
|
|
2
|
24
|
April 28, 2025
|
How can I create a new page theme that still inherits the home page style?
|
|
1
|
15
|
April 28, 2025
|
Research survey: Ethics-Based Auditing of Generative AI â share your knowledge
|
|
0
|
7
|
April 28, 2025
|
Getting "502 Server Error: Bad Gateway for url: https://api-inference.huggingface.co/models/meta-llama/Llama-3.2-3B-Instruct" error
|
|
8
|
118
|
April 28, 2025
|
Best way to make psychadelic visual videos?
|
|
6
|
29
|
April 28, 2025
|
How Can I Accurately Summarize Long Japanese Texts?
|
|
1
|
12
|
April 28, 2025
|
Unable to push agent to personal space from google colab value error
|
|
5
|
29
|
April 28, 2025
|
Can't access HF discord after verification, been trying over a month
|
|
35
|
467
|
April 28, 2025
|
Dataset preparation for LayoutLM and LiLT
|
|
1
|
41
|
April 27, 2025
|
Can I get clarification on what exactly transformers does vs what the model does?
|
|
2
|
33
|
April 27, 2025
|
How to write custom TrainerCallback functions with custom arguments?
|
|
4
|
30
|
April 28, 2025
|
Attention mask shape (custom attention masking)
|
|
3
|
506
|
April 27, 2025
|
Error message occurring since 12 hours
|
|
1
|
16
|
April 27, 2025
|
Error code 137 - cache error
|
|
16
|
33
|
April 27, 2025
|
Fine Tuning Llava 1.5 7b for Classification
|
|
1
|
15
|
April 27, 2025
|
Allow huggingface noreply email in GPG keys
|
|
1
|
12
|
April 27, 2025
|
Help making object detection dataset
|
|
4
|
18
|
April 26, 2025
|
Issue with Deploying LoRA-adapted Model on Hugging Face Endpoint
|
|
10
|
85
|
April 26, 2025
|
How to fine-tune a pretrained LLM on custom code libraries?
|
|
3
|
6289
|
April 26, 2025
|
If I want to find an app/model that does "inpainting" how do I search?
|
|
2
|
18
|
April 26, 2025
|
11B model gets OOM after using deepspeed zero 3 setting with 8 32G V100
|
|
2
|
1146
|
April 26, 2025
|
"Expected all tensors to be on the same device" with SFTTrainer
|
|
2
|
17
|
April 26, 2025
|
Agent wont respond
|
|
6
|
204
|
April 26, 2025
|