Bitsandbytes and CUBLAS_STATUS_NOT_INITIALIZED error
|
|
21
|
56
|
August 29, 2025
|
Model responses are random ignoring my dataset
|
|
12
|
51
|
July 28, 2025
|
How to evaluate a tranied model?
|
|
10
|
75
|
September 25, 2025
|
How to separate predictions map to create id2label.json
|
|
13
|
48
|
December 15, 2024
|
Models give nonsensical answers, how do I improve them?
|
|
9
|
56
|
September 18, 2025
|
OneFormer ID/Labels for FineTuning
|
|
13
|
49
|
July 8, 2025
|
CPU Dominance and an Open Challenge to the AI Community
|
|
21
|
47
|
September 15, 2025
|
Payment Issues - Langchain/ Iterative Summarisation
|
|
9
|
48
|
July 6, 2025
|
Model for evaluation of scanned survey data
|
|
9
|
39
|
January 19, 2025
|
How to solve factual inconsistency when fine tuning
|
|
10
|
72
|
October 3, 2024
|
Calculus Ratiocinator vs. Characteristica Universalis? The Two Traditions in Logic: An open conversation
|
|
11
|
92
|
September 25, 2025
|
Is there such thing as getting masks length and width?
|
|
10
|
30
|
January 16, 2025
|
How to build a tokenizer from a vocab subset of a BPE tokenizer
|
|
9
|
28
|
September 27, 2025
|
Junior AI Engineer (RAG / LLM) â DeepTech Mental Health Project
|
|
11
|
46
|
September 15, 2025
|
Padding side in instruction fine-tuning using SFTT
|
|
1
|
1828
|
December 9, 2024
|
TypeError: SentenceTransformerTrainer.compute_loss() got an unexpected keyword argument 'num_items_in_batch'
|
|
6
|
6714
|
February 10, 2025
|
Multi-Latent Attention (MLA) Implementation from DeepSeek-V2
|
|
1
|
1330
|
February 11, 2025
|
Problem in AI Agents course - Smolagents
|
|
2
|
931
|
April 20, 2025
|
Perfect LoRA Training parameters human character
|
|
1
|
10932
|
March 24, 2025
|
Alternative options for API endpoints
|
|
4
|
397
|
July 25, 2025
|
New battle in AI field
|
|
3
|
337
|
January 25, 2025
|
Save accelerate model
|
|
4
|
1105
|
February 5, 2025
|
The ChatDEAF Project Has Officially Launched!
|
|
4
|
137
|
April 24, 2025
|
"Load Diffusion Model" and "Unet Loader (GGUF)" null/undefined
|
|
8
|
7326
|
March 22, 2025
|
Trainer warning with the new version
|
|
2
|
5915
|
January 2, 2025
|
Cannot copy out of meta tensor; no data!
|
|
4
|
7186
|
February 28, 2025
|
Image Fusing Tool
|
|
4
|
1197
|
July 23, 2025
|
Problem access public model?
|
|
2
|
1024
|
January 30, 2025
|
But is there even a single model working here?!
|
|
4
|
416
|
May 10, 2025
|
Unauthorized 401
|
|
4
|
1119
|
February 26, 2025
|
AI Agent - How to create
|
|
7
|
2808
|
July 14, 2025
|
How do you use Beam Search in Whisper correctly?
|
|
3
|
1810
|
December 15, 2024
|
Tutorial: Implementing Transformer from Scratch - A Step-by-Step Guide
|
|
5
|
7612
|
May 1, 2025
|
Making a model "think" before doing a tool call (ReAct paper)
|
|
2
|
331
|
April 4, 2025
|
Request to download / access Llama 3.3 rejected
|
|
4
|
2136
|
January 19, 2025
|
ValueError: Unrecognized model in ./trained_model. Should have a `model_type` key in its config.json
|
|
3
|
8765
|
January 7, 2025
|
[Help wanted] Common Crawl needs help to be richer & more multilingual
|
|
1
|
88
|
January 27, 2025
|
John6666 the man
|
|
2
|
143
|
June 15, 2025
|
Running an LLM with high output quality locally
|
|
5
|
3022
|
February 22, 2025
|
LM Studio compatible Text To Image Models, click and go
|
|
1
|
8896
|
April 20, 2025
|
Keep hitting 500 Internal server error when trying to launch gradio app in Spaces
|
|
6
|
2572
|
December 13, 2024
|
Impossible to train a model using both bf16 mixed precision training and torch compile, RuntimeError: expected mat1 and mat2 to have the same dtype
|
|
8
|
2289
|
October 28, 2024
|
Pro Account $2 inference limit
|
|
8
|
1259
|
March 23, 2025
|
How to use llm model's api?
|
|
2
|
3647
|
November 14, 2024
|
Too many requests for URL
|
|
5
|
4686
|
May 25, 2025
|
An error occurred while fetching the blob
|
|
1
|
1297
|
November 14, 2024
|
How to train a Model for Erotic Story Writing with Explicit Details?
|
|
5
|
4929
|
June 19, 2025
|
Translation model to 100+ Languages
|
|
4
|
2392
|
January 25, 2025
|
Best model for music generation
|
|
3
|
2547
|
December 31, 2024
|
ChatDEAF Project â First Open ISL/TİD Dataset for Sign Language Accessibility
|
|
1
|
149
|
April 20, 2025
|