Beginners

Topic	Replies	Views	Activity
Multiple GPU in SFTTrainer	4	3050	December 27, 2024
Keep hitting 500 Internal server error when trying to launch gradio app in Spaces	6	2560	December 13, 2024
Fine-tunening a multimodal model	4	5375	December 25, 2024
Pro Account $2 inference limit	8	1255	March 23, 2025
Trainer only doing 3 epochs no matter the TrainingArguments!	5	15185	June 20, 2022
Predicting On New Text With Fine-Tuned Multi-Label Model	4	5184	December 23, 2021
Getting cannot import name 'is_npu_available' from 'accelerate.utils'	2	6684	July 6, 2024
How to generate a samples of summaries with Pegasus?	3	1024	October 16, 2023
Looking for good package for summarizing large quantities of qualitative survey responses	0	364	May 22, 2023
How to use the fine-tuned model for actual prediction after re-loading it	5	14606	August 10, 2022
How to use llm model's api?	2	3627	November 14, 2024
Good models for few-shot multi-label text classification	0	1947	March 23, 2022
Datasets: Limit the number of rows?	4	8672	December 17, 2023
How can I go about building Grammarly for my local language?	1	1358	November 7, 2020
Not sure how to compute BLEU through compute_metrics	5	4400	November 3, 2023
Save, load and do inference with fine-tuned model	3	17044	March 8, 2024
Handling long text in BERT for Question Answering	7	12032	March 10, 2022
How to revert to an earlier commit on a repo?	4	4755	January 26, 2024
Error in fine-tuning BERT	8	6268	February 21, 2022
HTTPError: 429 Client Error: Too Many Requests for url	0	1869	January 12, 2023
How do I increase max_new_tokens	3	29535	August 19, 2023
ERROR: Access denied: repository is gated and you are not in the authorized list	4	4642	May 9, 2025
4bit finetuning LLM: "No inf checks were recorded for this optimizer." If I don't use Abirate/english_quotes	2	3359	April 15, 2024
Error " ModuleNotFoundError: No module named 'gradio'"	0	10277	September 21, 2023
An error occurred while fetching the blob	1	1291	November 14, 2024
Problem with loading custom dataset from jsonl file	1	12878	May 5, 2023
How to load training_args	5	7430	September 5, 2025
Getting error when trying to log into hugging face account	2	3282	September 10, 2025
Problem access public model?	2	1022	January 30, 2025
ValueError: You need to specify either `text` or `text_target` when using evaluator	1	3952	August 27, 2024
[tool] easy branch rebase	0	312	September 17, 2020
Replacing last layer of a fine-tuned model to use different set of labels	6	6613	December 23, 2021
Can I create a folder in the repo trough the website?	1	3912	May 4, 2023
ModuleNotFoundError: No module named 'transformers.modeling_outputs'	2	10057	May 16, 2023
EvalPrediction returning one less prediction than label id for each batch	7	6156	June 19, 2024
What's the difference between bart-base tokenizer and bart-large tokenizer	6	2064	December 6, 2020
Repository Not Found for url: https://huggingface.co/bigscience/bloom-1b3/resolve/main/config.json	3	26477	September 21, 2023
Loading a model in an app when using HF Spaces	0	1672	November 26, 2023
Using Token to Access Llama2	3	14825	February 21, 2024
Training with varying lengths of sequences	0	1649	May 31, 2023
[Tokenizers]What this max_length number?	3	2536	March 3, 2025
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers	8	5304	November 20, 2021
Scores in generate()	6	10615	May 26, 2023
ValueError: The batch received was empty, your model won't be able to train on it. Double-check that your training dataset contains keys expected by the model: args,kwargs,label_ids,label	8	5208	June 11, 2025
How to finetune a bert model to a Summarizer	2	5015	March 7, 2022
Is there a way to correctly load a pre-trained transformers model without the configuration file?	6	18056	August 13, 2021
Your space is on error, check its status on hf.co	6	5684	September 10, 2024
Best LLMs that can run on 4gb VRAM	2	4813	January 22, 2025
Character-level tokenizer	6	9824	May 8, 2024
Anywhere where I can read more about the `device_map` kwarg in `from_pretrained`?	2	14981	January 5, 2024