Intermediate

Topic	Replies	Views	Activity
How is it possible to get GPU memory errors when increasing the gradient_accumulation steps?	1	1350	January 22, 2024
Why does Hugging Face's push_to_hub convert saved models to .bin instead of using safetensor mode?	2	1955	September 6, 2023
AttributeError: 'NoneType' object has no attribute 'keys'	0	3345	April 3, 2023
MLflowCallback TypeError: can only concatenate list (not "type") to list	3	1644	November 16, 2021
Found some inconsistency on CLIPTokenizer, but how should we fix this?	0	583	October 6, 2022
What are the limits on saving private models and datasets on the hub?	4	1463	April 29, 2024
Huggingface.co site scam people	3	912	January 10, 2023
Multi-gpu batch processing fails when using Peft Lora with Huggingface	1	1276	March 8, 2024
Token Classification Label order	0	564	November 11, 2022
Batched pipeline for Question-Answering	0	557	April 28, 2022
FSDP with Trainer class: AlgorithmError: ValueError('Cannot flatten integer dtype tensors'), exit code: 1	0	549	June 13, 2024
Can't save the tensorflow model of nvidia/mit-b5	3	154	December 19, 2024
Suggestions for hugging face transformer models for Code and Formal Languages	2	1746	May 3, 2022
Zero shot classification pipeline customization	2	1747	April 27, 2022
Generate desired text output based on model training	3	263	December 17, 2024
What is ViTImageProcessor doing?	3	1479	April 18, 2024
Calculate Impact of Input Tokens on BERT Output Probability	1	2060	July 24, 2020
Why does tokenizer.apply_chat_template() add multiple eos tokens?	4	734	September 19, 2024
Fine-tune model with CoT	1	363	January 27, 2025
Transforming Pushed Hugging Face Models into Usable GGUF Models for Local Colab Use	2	1644	March 15, 2024
Extracting HuBERT hidden units	1	1127	July 26, 2022
Sentence Pair Classification	1	1991	May 4, 2022
Upload a TF model to Huggingface	6	1063	September 1, 2021
Regression with Graph Convolutional Networks	0	500	February 22, 2022
Help with Tokenizer Word Length Limit	2	1613	July 16, 2023
Save a Bert model with custom forward function and heads on Hugginface	1	1967	June 7, 2022
Combining OpenAI Embeddings and OpenAI CLIP embeddings?	1	1963	July 10, 2023
BLIP How to combine embeddings for multimodal search?	1	1959	January 11, 2024
Adding Preprocessing to Hosted Inference API	4	1215	April 14, 2022
How to finetune/instruction-tune a large language model on a QA corpus?	1	1912	January 20, 2024
Does fine-tuning a language model modify its hidden weights?	1	594	August 10, 2021
What is the best way to tackle OOV	0	472	April 6, 2022
What is the correct way to create a feature extractor for a hugging face (HF) ViT model?	1	1049	April 6, 2023
BERT Multilabel - Different Training Dataset For Each Label?	3	1300	December 27, 2021
Token classification on custom BERT and data	2	1499	December 28, 2020
RuntimeError: CUDA out of memory	1	1020	April 15, 2021
Using Roberta for Sentence2Vec	3	1258	April 11, 2021
Parallelise pipelines on a single GPU?	3	692	October 31, 2024
Change saving metric in Trainer	2	1417	May 18, 2024
Huggingface_hub.client giving error on list_deployed_models()	2	79	March 3, 2025
How to pass table structure to LLM model	2	1402	May 1, 2024
How to save model in Colab during TPU training with Accelerate	2	1385	November 19, 2021
Finetuning of conversational model without train data in conversation style	1	1696	February 2, 2024
How to fix RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn	1	1690	June 10, 2024
Zero shot classification and Onnx	2	1376	April 27, 2022
Out of memory training 3B param model on 8 GPU (320GB memory) with FSDP	1	1680	July 28, 2023
Calculating perplexity from hidden_states	2	1368	March 21, 2023
[LMM Fine Tuning] Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer	1	1667	November 29, 2023
Run_mlm.py using --sharded_ddp "zero_dp_3 offload" gives AssertionError	3	1174	April 21, 2021
Combining tokenizer.decode and model.generate scores for probability prediction	2	1330	March 1, 2023