Creating DPO Dataset Using Llama
|
|
0
|
319
|
July 5, 2024
|
Git push running but not appearing online
|
|
1
|
394
|
August 21, 2024
|
Feasibility of Fine-Tuning GPT2-XL Model on 3060 RTX GPU for Academic Misinformation Identification
|
|
0
|
59
|
August 22, 2024
|
Different padding behaviour of data collator
|
|
0
|
68
|
August 23, 2024
|
Finetuning a pre-trained model
|
|
0
|
52
|
August 21, 2024
|
Continuous Crash
|
|
1
|
131
|
July 3, 2024
|
Regarding Tokenizer
|
|
0
|
49
|
July 5, 2024
|
Storing and loading KV cache
|
|
6
|
1331
|
October 21, 2024
|
How to use pytorch to process variance sequence
|
|
0
|
6
|
August 24, 2024
|
How to use fine tuned a pre-trained text to image model?
|
|
0
|
35
|
August 22, 2024
|
User Study with AI researchers and development team members
|
|
0
|
52
|
August 21, 2024
|
OutOfMemoryError: CUDA out of memory. Tried to allocate 2.00 MiB. GPU
|
|
6
|
1533
|
July 7, 2024
|
Problems with Jupyter
|
|
0
|
66
|
July 3, 2024
|
GPU memory usage of optimizer's states when using LoRA
|
|
4
|
665
|
July 5, 2024
|
Qwen-VL Parallel GPU run not able to solve
|
|
1
|
1037
|
February 20, 2024
|
UserWarning: Was asked to gather along dimension 0, but all input tensors were scalars; will instead unsqueeze and return a vector when running trainer
|
|
2
|
1452
|
September 10, 2024
|
The model EleutherAI/gpt-j-6B is too large to be loaded automatically
|
|
1
|
1474
|
August 22, 2024
|
Collapse duplicates in dataset and treat it as usual
|
|
5
|
236
|
July 5, 2024
|
Convert model clip to onnx
|
|
0
|
187
|
July 5, 2024
|
Hiring Developer
|
|
0
|
50
|
July 5, 2024
|
Excluding Prompt from Language Model's Response
|
|
1
|
946
|
July 3, 2024
|
ValueError: expected sequence of length 1024 at dim 1 (got 507)
|
|
1
|
869
|
May 26, 2024
|
Updating model and tokenizers inside Trainer.train
|
|
0
|
33
|
August 23, 2024
|
GPT-NEO 1.3 always gives same output
|
|
0
|
13
|
August 22, 2024
|
Image dataype issue
|
|
0
|
52
|
July 3, 2024
|
SFTTrainer takes up so much ram that it breaks an A100 GPU
|
|
0
|
191
|
July 6, 2024
|
Need an AI Generator for a Fashion Brand
|
|
1
|
505
|
July 8, 2024
|
Training Diffuser Model on Colab GPU - 'nvidia-smi' Error & Feasibility
|
|
1
|
59
|
August 23, 2024
|
Natural language query to Elasticsearch query conversion
|
|
0
|
259
|
August 22, 2024
|
Llama-3 70b - Probability outputs appear "quantized" using non-quantized model (but not with quantized model)
|
|
0
|
62
|
August 22, 2024
|