Hugging Face Forums

Topic	Replies	Views	Activity
Creating DPO Dataset Using Llama Beginners	0	319	July 5, 2024
Git push running but not appearing online 🤗Hub	1	394	August 21, 2024
Feasibility of Fine-Tuning GPT2-XL Model on 3060 RTX GPU for Academic Misinformation Identification 🤗Transformers	0	59	August 22, 2024
Different padding behaviour of data collator 🤗Transformers	0	68	August 23, 2024
Finetuning a pre-trained model Intermediate	0	52	August 21, 2024
Continuous Crash Beginners	1	131	July 3, 2024
Regarding Tokenizer Beginners	0	49	July 5, 2024
Storing and loading KV cache 🤗Transformers	6	1331	October 21, 2024
How to use pytorch to process variance sequence Beginners	0	6	August 24, 2024
How to use fine tuned a pre-trained text to image model? 🧨 Diffusers	0	35	August 22, 2024
User Study with AI researchers and development team members Research	0	52	August 21, 2024
OutOfMemoryError: CUDA out of memory. Tried to allocate 2.00 MiB. GPU Spaces	6	1533	July 7, 2024
Problems with Jupyter Spaces	0	66	July 3, 2024
GPU memory usage of optimizer's states when using LoRA DeepSpeed	4	665	July 5, 2024
Qwen-VL Parallel GPU run not able to solve Models	1	1037	February 20, 2024
UserWarning: Was asked to gather along dimension 0, but all input tensors were scalars; will instead unsqueeze and return a vector when running trainer Beginners	2	1452	September 10, 2024
The model EleutherAI/gpt-j-6B is too large to be loaded automatically 🤗Hub	1	1474	August 22, 2024
Collapse duplicates in dataset and treat it as usual 🤗Datasets	5	236	July 5, 2024
Convert model clip to onnx Models	0	187	July 5, 2024
Hiring Developer Community Calls	0	50	July 5, 2024
Excluding Prompt from Language Model's Response Beginners	1	946	July 3, 2024
ValueError: expected sequence of length 1024 at dim 1 (got 507) Beginners	1	869	May 26, 2024
Updating model and tokenizers inside Trainer.train Models	0	33	August 23, 2024
GPT-NEO 1.3 always gives same output Beginners	0	13	August 22, 2024
Image dataype issue Beginners	0	52	July 3, 2024
SFTTrainer takes up so much ram that it breaks an A100 GPU 🤗Transformers	0	191	July 6, 2024
Need an AI Generator for a Fashion Brand Community Calls	1	505	July 8, 2024
Training Diffuser Model on Colab GPU - 'nvidia-smi' Error & Feasibility Beginners	1	59	August 23, 2024
Natural language query to Elasticsearch query conversion Beginners	0	259	August 22, 2024
Llama-3 70b - Probability outputs appear "quantized" using non-quantized model (but not with quantized model) Beginners	0	62	August 22, 2024