Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!
|
|
1
|
889
|
September 20, 2023
|
Encoder Decoder Embedding layer shared in BartModel code
|
|
1
|
340
|
September 20, 2023
|
Improving Whisper for Inference
|
|
11
|
3815
|
September 20, 2023
|
How to save predictions for each epoch by Trainer?
|
|
4
|
2234
|
September 19, 2023
|
What should be indicated in the payload
|
|
0
|
297
|
September 19, 2023
|
What is the loss Function when fine-tuning LlamaV2
|
|
0
|
2123
|
September 19, 2023
|
How to stop a step2step generation model while streaming
|
|
0
|
190
|
September 19, 2023
|
Docker space rebuilds with "NotFound" error
|
|
12
|
619
|
September 19, 2023
|
Order between optimization and quantization
|
|
1
|
507
|
September 19, 2023
|
Questions about the connection between tokenizer and the model
|
|
0
|
307
|
September 19, 2023
|
Condtional_download how to for huggingface resources
|
|
6
|
1959
|
September 19, 2023
|
Different loss values during training
|
|
0
|
210
|
September 19, 2023
|
Improve use of .then - Example chat text and audio code
|
|
0
|
415
|
September 19, 2023
|
OpenAPI key compromised
|
|
1
|
436
|
September 19, 2023
|
Git-base-vatex: input pixel_value dimension mismatch (blocking issue)
|
|
0
|
223
|
September 19, 2023
|
GPT4all in a personal server to be access by many users
|
|
0
|
901
|
September 19, 2023
|
Licensing limitations of âtransformersâ?
|
|
0
|
376
|
September 19, 2023
|
What are 'min_duration_off' and 'threshold' means (segmentation)
|
|
1
|
917
|
September 19, 2023
|
Multiple tasks for one fine-tuned LLM
|
|
2
|
6508
|
September 18, 2023
|
Cannot install orbax-checkpoint due to uvicorn error
|
|
2
|
488
|
September 18, 2023
|
How to perform finetuning on llama2 adapters
|
|
0
|
324
|
September 15, 2023
|
When training Llama for sequence classification, should the final token be an EOS?
|
|
2
|
566
|
September 18, 2023
|
The output of dataframe
|
|
8
|
2353
|
September 18, 2023
|
Loading weights straight to GPU & Training support
|
|
0
|
214
|
September 18, 2023
|
Adding a contributor
|
|
1
|
2729
|
September 18, 2023
|
Question about FP16/32, LoRA and GPU Memory Usage
|
|
1
|
3708
|
September 18, 2023
|
gr.Dataframe(): changes to height, scrolling initialization
|
|
5
|
2778
|
September 18, 2023
|
How to remove biasness and censorship while training LLms?
|
|
0
|
265
|
September 18, 2023
|
Selection for suitable compute metrics in SFTTrainer for QA
|
|
0
|
666
|
September 18, 2023
|
Default argument '-1'
|
|
0
|
106
|
September 18, 2023
|