Which LLM Works Best for Prompt and Response Generation in Chinese (Simplified and Traditional)
|
|
3
|
1198
|
January 23, 2025
|
How do I report spam on this site?
|
|
6
|
293
|
November 15, 2024
|
Exception in ASGI application
|
|
3
|
3670
|
January 29, 2025
|
How to use website search functionality with my LLM
|
|
1
|
5249
|
January 11, 2025
|
Introduce Our New Paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use"
|
|
1
|
313
|
March 21, 2025
|
Whoami rate limited
|
|
2
|
1290
|
February 9, 2025
|
Fine tuning on qwen3
|
|
2
|
1311
|
May 19, 2025
|
Facing issue using a model hosted on HuggingFace Server and talking to it using API_KEY
|
|
7
|
495
|
May 12, 2025
|
RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with `TORCH_USE_CU
|
|
2
|
1260
|
November 1, 2024
|
Training fails on multiple GPUs with RuntimeError 'chuck expects at least a 1-dimensional array'
|
|
2
|
415
|
December 19, 2024
|
How to prevent catastrophic forgetting in fine tuned large language models?
|
|
3
|
1277
|
January 11, 2025
|
Smolagents Error: probability tensor contains either `inf`, `nan` or element < 0
|
|
5
|
908
|
April 4, 2025
|
Torchrun, trainer, dataset setup
|
|
4
|
994
|
December 20, 2024
|
Invalid credentials in Authorization header - HfApiModel
|
|
5
|
2759
|
August 6, 2025
|
OOM issue with large dataset streaming
|
|
6
|
162
|
March 15, 2025
|
The used dataset had no length, returning gathered tensors. You should drop the remainder yourself
|
|
4
|
328
|
December 26, 2024
|
How to track dataset downloads over time?
|
|
3
|
1051
|
November 19, 2024
|
Using huggingface CLI with a certificate
|
|
3
|
3841
|
September 30, 2024
|
New Project - Echo Nova
|
|
3
|
116
|
September 3, 2025
|
How to initialize a model with random weights
|
|
3
|
1052
|
October 28, 2024
|
How do I resume training a finetuned model from the epoch it has ended
|
|
3
|
1025
|
October 31, 2024
|
How to set different learning rates for different parameters in the model?
|
|
7
|
440
|
December 17, 2024
|
requests.exceptions.HTTPError: 429 Client Error: Too Many Requests
|
|
3
|
3391
|
November 19, 2024
|
Starting with AI and assistants
|
|
2
|
680
|
May 11, 2025
|
AI Tools You Need to Master Hugging Face Daily Papers!
|
|
3
|
146
|
January 16, 2025
|
Fine-tuning "reasoning" models
|
|
1
|
1410
|
January 23, 2025
|
Isn't there a simpler way to run LLMs / models locally?
|
|
3
|
958
|
April 28, 2025
|
Research Project Ideas on LLMs
|
|
6
|
253
|
August 2, 2025
|
New function in Kaggle
|
|
2
|
58
|
July 16, 2025
|
Chainlit WebSocket Issue on Hugging Face Spaces: Missing websockets in Requirements?
|
|
4
|
158
|
June 12, 2025
|
Simple Model to rewrite/paraphrase
|
|
7
|
723
|
March 19, 2025
|
Loading a dataset cached in a LocalFileSystem is not supported
|
|
3
|
931
|
July 23, 2025
|
How to enter longer prompt words
|
|
3
|
1056
|
January 19, 2025
|
How to run agents from `smolagents` locally?
|
|
4
|
867
|
May 27, 2025
|
Downloading larger models with xet fails on macOS
|
|
3
|
904
|
June 5, 2025
|
Reward Hacking Solutions
|
|
0
|
62
|
August 9, 2025
|
SmolAgents: Try to run Agent with local model (mistral)
|
|
3
|
536
|
March 24, 2025
|
What is the best AMD Ryzen 5 laptops for running machine learning models and libraries like TensorFlow or PyTorch?
|
|
4
|
791
|
September 29, 2024
|
How do I access my models I saved inside a folder?
|
|
1
|
677
|
November 12, 2024
|
Resize embeddings on Peft model
|
|
4
|
811
|
May 12, 2025
|
Remove causal mask from Llama decoder
|
|
5
|
816
|
October 22, 2024
|
Should I just get more RAM?
|
|
4
|
2603
|
December 22, 2024
|
AutoModel from_pretrained does not recursively download relative imports
|
|
0
|
56
|
March 11, 2025
|
How can I obtain the logits via model.generate()?
|
|
2
|
3154
|
October 8, 2024
|
Guidance Scale for Flux LoRA
|
|
2
|
967
|
February 12, 2025
|
Introducing The AGI Framework: Open-Source Modular Architecture for Artificial General Intelligence Development
|
|
0
|
336
|
January 28, 2025
|
GRPO Trainer for VLM?
|
|
5
|
387
|
July 7, 2025
|
Offering a Technical Deep Dive on GRPO/DAPO/Dr. GRPO Algorithms
|
|
2
|
538
|
May 11, 2025
|
RL Course Unit 1: "python setup.py egg_info did not run successfully"
|
|
3
|
194
|
August 20, 2025
|
Access to LlaMa 2 denied
|
|
2
|
506
|
January 12, 2025
|