[RuntimeError] DPOTrainer - "element 0 of tensors does not require grad and does not have a grad_fn" on 8x A100 GPUs
|
|
1
|
37
|
May 20, 2025
|
Column Mapping in Autotrain
|
|
1
|
34
|
April 11, 2025
|
504 error with serverless HF Inference API
|
|
1
|
35
|
March 17, 2025
|
Qwen embedding mode inference does not work
|
|
5
|
21
|
July 2, 2025
|
What does it mean if a hardware option I need is greyed out in Spaces
|
|
5
|
25
|
April 30, 2025
|
Loading the Mdeberta-v3-base
|
|
5
|
19
|
March 13, 2025
|
torch.nn.DataParallel Mistral-7B-Instruct RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0!
|
|
1
|
64
|
August 20, 2024
|
Download the images I get in (.webp) format
|
|
2
|
64
|
August 20, 2024
|
Always 【initializing】 until time out without any error log
|
|
3
|
45
|
August 27, 2024
|
Build Queued at 1970-01-01 00:00:00
|
|
3
|
53
|
October 23, 2024
|
PermissionError: [Errno 13] Permission denied while training
|
|
0
|
84
|
September 24, 2024
|
RetrievalQA output repeats prompt and context sources
|
|
0
|
82
|
July 26, 2024
|
Problem with API
|
|
1
|
33
|
May 15, 2025
|
Trainer + Datasets + Pytorch Dataloader Workers - how to manage memory usage?
|
|
1
|
36
|
April 29, 2025
|
Whats the email I message to change my username?
|
|
1
|
32
|
March 25, 2025
|
Spaces and nginx: docs are out of date?
|
|
1
|
33
|
February 27, 2025
|
Tracking resource utilization per process with callbacks
|
|
1
|
33
|
October 4, 2024
|
Inference endpoint
|
|
1
|
33
|
August 11, 2024
|
Is a Pro account 5x or 8x ZeroGPU Quota?
|
|
2
|
33
|
June 28, 2025
|
Bot / Garbage Accounts?
|
|
3
|
26
|
April 1, 2025
|
.bat file to launch llamafile --server with yaml config file
|
|
2
|
28
|
February 26, 2025
|
No matter what I do the HF…
|
|
2
|
28
|
January 13, 2025
|
How to make or customize a model for someone that has no idea what he's doing
|
|
2
|
28
|
October 10, 2024
|
Access information across conversations
|
|
2
|
29
|
October 3, 2024
|
I have a problem with Preparing Space
|
|
3
|
50
|
July 30, 2024
|
What will you do with surplus or used GPUs?
|
|
0
|
81
|
November 16, 2024
|
Accelerate Distributed Randomly Hangs
|
|
0
|
81
|
September 11, 2024
|
Tokenization for overlapping tokens
|
|
1
|
10
|
July 13, 2025
|
How to add VAT (tax ID) into my billing address?
|
|
4
|
26
|
June 10, 2025
|
How can I dynamically update the system configuration for different users using my demo?
|
|
6
|
31
|
August 28, 2024
|
How is duplicate data in dataset splits/subsets handled in the hub
|
|
1
|
63
|
August 17, 2024
|
HF Accelerate uses multiple GPUs even when setting `num_processes` to 1
|
|
0
|
81
|
August 2, 2024
|
A fine-tuing problem in localhost
|
|
0
|
81
|
July 20, 2024
|
Hugging Face Paid Plans and Features
|
|
0
|
15
|
July 11, 2025
|
📊 Automated SLA Report Generator from JIRA Tickets – Open for Feedback & Use Cases
|
|
0
|
14
|
July 4, 2025
|
Introduction to Hinduja Family Swiss (Switzerland)
|
|
0
|
15
|
June 20, 2025
|
AI House material change
|
|
0
|
14
|
June 12, 2025
|
Chronicles of AI Conversations
|
|
0
|
20
|
May 17, 2025
|
The Alpha Version of TrixGenius Has Arrived — Supercharge Your Trix Editing Experience with AI!
|
|
0
|
14
|
April 10, 2025
|
Does HuggingFace have a free (preferably CDN hosted) code analyzer?
|
|
0
|
15
|
March 21, 2025
|
Fine-tuning vs. RAG for Maths tutor in German
|
|
0
|
16
|
January 14, 2025
|
HuggingFaceModel() Sagemaker invoke error in Stable Diffusion 2.1
|
|
0
|
16
|
December 17, 2024
|
Datacamp course token error
|
|
0
|
14
|
December 13, 2024
|
Test if a sentence is different from the training data
|
|
0
|
16
|
November 11, 2024
|
Is there any way to fine tuning model with existing embedding?
|
|
0
|
15
|
November 7, 2024
|
Add the missing independent feature in graph
|
|
0
|
17
|
October 27, 2024
|
Please advise on parameter settings!
|
|
0
|
15
|
October 15, 2024
|
How to use Cache with message API
|
|
0
|
16
|
October 13, 2024
|
GPU Status Pending
|
|
3
|
28
|
May 21, 2025
|
TGI & guidance making a strange behavior
|
|
3
|
25
|
March 24, 2025
|