Hugging Face Forums

Topic	Replies	Views	Activity
When using SGD: RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn 🤗Transformers	0	1894	October 9, 2023
Aggregate AB test dataset 🤗Hub	0	258	October 9, 2023
Problem with custom metric for custom T5 model Beginners	1	762	October 9, 2023
About intermediate variable text 🔒 Gradio	1	179	October 9, 2023
CUDA is out of memory Beginners	3	3300	October 9, 2023
How to set trust_remote_code=True for prompt-tuning fine-tuning for local deployment models 🤗AutoTrain	1	1507	October 9, 2023
I want to create a small model that could optimize code. What are your suggestions? Beginners	3	526	October 9, 2023
Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this) 🤗Accelerate	0	400	October 8, 2023
[Errno 2] No such file or directory: 'ffmpeg' Beginners	4	3629	October 9, 2023
Gradio message limit is there a way? 🔒 Gradio	4	734	October 8, 2023
Get model downloads count oneach month Models	0	241	October 8, 2023
Add additional trainable layers to StableDiffusion for fine-tuning 🧨 Diffusers	0	1007	October 8, 2023
Intermediate features from a Huggingface pretrained model 🤗Transformers	0	323	October 8, 2023
Error with BertTokenizerFast: AttributeError - 'function' object has no attribute 'get' Beginners	0	635	October 8, 2023
Tried to download Mistral 7B but got an error message 🤗Transformers	3	13271	October 8, 2023
Where to find documentation on dataset format for finetuning Beginners	0	277	October 7, 2023
Converting pytorch_model.bin (Whisper )to .pt Beginners	0	643	October 8, 2023
Saving a Bert model Beginners	2	645	October 8, 2023
torch.cuda.OutOfMemoryError when evaluate while traning 🤗Transformers	0	510	October 8, 2023
Vocab-Transformers vs Sentence-Transformers Site Feedback	0	631	October 8, 2023
Can I use sentence-transformers with tensorflow? 🤗Transformers	1	341	October 8, 2023
Single batch training on multi-gpu 🤗Accelerate	1	986	October 8, 2023
Trained a tokenizer from scratch but problem when loading 🤗Transformers	0	478	October 8, 2023
Qunatized model with LORA takes much more GPU memory than the un-quantized model with LORA for the (E-5-Large Embedding Transformer) 🤗Transformers	4	1723	October 8, 2023
I was using huugginfface meta-llama/Llama-2-7b-chat-hf and im facing an error 🤗Tokenizers	2	2565	October 8, 2023
LayoutLMV3 on dataset other than english Beginners	0	201	October 8, 2023
TrainingArgument 🤗Transformers	3	8215	October 8, 2023
How to join separate strings to translate them together for better speed? Models	0	228	October 7, 2023
Fine-tuning: Merge or chain? Beginners	2	954	October 7, 2023
Using fine-tuned model that wasn't explicitly saved Beginners	2	1054	October 7, 2023