Multiple GPU in SFTTrainer
|
|
4
|
3050
|
December 27, 2024
|
Keep hitting 500 Internal server error when trying to launch gradio app in Spaces
|
|
6
|
2560
|
December 13, 2024
|
Fine-tunening a multimodal model
|
|
4
|
5375
|
December 25, 2024
|
Pro Account $2 inference limit
|
|
8
|
1255
|
March 23, 2025
|
Trainer only doing 3 epochs no matter the TrainingArguments!
|
|
5
|
15185
|
June 20, 2022
|
Predicting On New Text With Fine-Tuned Multi-Label Model
|
|
4
|
5184
|
December 23, 2021
|
Getting cannot import name 'is_npu_available' from 'accelerate.utils'
|
|
2
|
6684
|
July 6, 2024
|
How to generate a samples of summaries with Pegasus?
|
|
3
|
1024
|
October 16, 2023
|
Looking for good package for summarizing large quantities of qualitative survey responses
|
|
0
|
364
|
May 22, 2023
|
How to use the fine-tuned model for actual prediction after re-loading it
|
|
5
|
14606
|
August 10, 2022
|
How to use llm model's api?
|
|
2
|
3627
|
November 14, 2024
|
Good models for few-shot multi-label text classification
|
|
0
|
1947
|
March 23, 2022
|
Datasets: Limit the number of rows?
|
|
4
|
8672
|
December 17, 2023
|
How can I go about building Grammarly for my local language?
|
|
1
|
1358
|
November 7, 2020
|
Not sure how to compute BLEU through compute_metrics
|
|
5
|
4400
|
November 3, 2023
|
Save, load and do inference with fine-tuned model
|
|
3
|
17044
|
March 8, 2024
|
Handling long text in BERT for Question Answering
|
|
7
|
12032
|
March 10, 2022
|
How to revert to an earlier commit on a repo?
|
|
4
|
4755
|
January 26, 2024
|
Error in fine-tuning BERT
|
|
8
|
6268
|
February 21, 2022
|
HTTPError: 429 Client Error: Too Many Requests for url
|
|
0
|
1869
|
January 12, 2023
|
How do I increase max_new_tokens
|
|
3
|
29535
|
August 19, 2023
|
ERROR: Access denied: repository is gated and you are not in the authorized list
|
|
4
|
4642
|
May 9, 2025
|
4bit finetuning LLM: "No inf checks were recorded for this optimizer." If I don't use Abirate/english_quotes
|
|
2
|
3359
|
April 15, 2024
|
Error " ModuleNotFoundError: No module named 'gradio'"
|
|
0
|
10277
|
September 21, 2023
|
An error occurred while fetching the blob
|
|
1
|
1291
|
November 14, 2024
|
Problem with loading custom dataset from jsonl file
|
|
1
|
12878
|
May 5, 2023
|
How to load training_args
|
|
5
|
7430
|
September 5, 2025
|
Getting error when trying to log into hugging face account
|
|
2
|
3282
|
September 10, 2025
|
Problem access public model?
|
|
2
|
1022
|
January 30, 2025
|
ValueError: You need to specify either `text` or `text_target` when using evaluator
|
|
1
|
3952
|
August 27, 2024
|
[tool] easy branch rebase
|
|
0
|
312
|
September 17, 2020
|
Replacing last layer of a fine-tuned model to use different set of labels
|
|
6
|
6613
|
December 23, 2021
|
Can I create a folder in the repo trough the website?
|
|
1
|
3912
|
May 4, 2023
|
ModuleNotFoundError: No module named 'transformers.modeling_outputs'
|
|
2
|
10057
|
May 16, 2023
|
EvalPrediction returning one less prediction than label id for each batch
|
|
7
|
6156
|
June 19, 2024
|
What's the difference between bart-base tokenizer and bart-large tokenizer
|
|
6
|
2064
|
December 6, 2020
|
Repository Not Found for url: https://huggingface.co/bigscience/bloom-1b3/resolve/main/config.json
|
|
3
|
26477
|
September 21, 2023
|
Loading a model in an app when using HF Spaces
|
|
0
|
1672
|
November 26, 2023
|
Using Token to Access Llama2
|
|
3
|
14825
|
February 21, 2024
|
Training with varying lengths of sequences
|
|
0
|
1649
|
May 31, 2023
|
[Tokenizers]What this max_length number?
|
|
3
|
2536
|
March 3, 2025
|
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers
|
|
8
|
5304
|
November 20, 2021
|
Scores in generate()
|
|
6
|
10615
|
May 26, 2023
|
ValueError: The batch received was empty, your model won't be able to train on it. Double-check that your training dataset contains keys expected by the model: args,kwargs,label_ids,label
|
|
8
|
5208
|
June 11, 2025
|
How to finetune a bert model to a Summarizer
|
|
2
|
5015
|
March 7, 2022
|
Is there a way to correctly load a pre-trained transformers model without the configuration file?
|
|
6
|
18056
|
August 13, 2021
|
Your space is on error, check its status on hf.co
|
|
6
|
5684
|
September 10, 2024
|
Best LLMs that can run on 4gb VRAM
|
|
2
|
4813
|
January 22, 2025
|
Character-level tokenizer
|
|
6
|
9824
|
May 8, 2024
|
Anywhere where I can read more about the `device_map` kwarg in `from_pretrained`?
|
|
2
|
14981
|
January 5, 2024
|