Change model download folder?
|
|
1
|
10235
|
October 17, 2023
|
Some clarification on Conv1D
|
|
1
|
1789
|
May 1, 2024
|
Adding custom vocabularies on Whisper
|
|
7
|
27344
|
March 25, 2025
|
Training a model for a PDF with OCR - where to begin?
|
|
4
|
10883
|
October 27, 2024
|
Missing config.json file after AutoTraining
|
|
7
|
8425
|
April 10, 2024
|
Downloading a dataset files locally
|
|
3
|
37564
|
November 4, 2024
|
Loading and saving a model
|
|
2
|
13378
|
September 14, 2024
|
Trainer never invokes compute_metrics
|
|
7
|
7624
|
July 25, 2025
|
New battle in AI field
|
|
3
|
336
|
January 25, 2025
|
Multi-GPU LLM inference data parallelism (llama)
|
|
1
|
14415
|
October 25, 2023
|
Clear GPU memory of transformers.pipeline
|
|
6
|
24314
|
March 19, 2025
|
Get word embeddings from transformer model
|
|
1
|
13939
|
June 17, 2021
|
Using pipelin on pytorch mps device
|
|
0
|
1963
|
July 4, 2022
|
Is it possible to clone a dataset/repo from huggingface over ssh?
|
|
2
|
19803
|
April 19, 2023
|
Recommended hardware for running LLMs locally
|
|
2
|
34904
|
December 18, 2023
|
K fold cross validation
|
|
5
|
13060
|
July 29, 2023
|
Thoughts on quantity of training data for fine tuning
|
|
6
|
20742
|
March 10, 2022
|
Speeding up GPT2 generation
|
|
3
|
4812
|
October 29, 2020
|
Private repo wget download not working why?
|
|
6
|
6031
|
April 20, 2025
|
How do you use the whoami endpoint?
|
|
1
|
1998
|
March 18, 2022
|
Custom embedding / prompt tuning
|
|
0
|
1583
|
September 20, 2021
|
Resume Training with Lower Learning Rate
|
|
3
|
1405
|
January 5, 2025
|
Generating train split slow?
|
|
0
|
1540
|
June 22, 2022
|
Getting error OSError: Looks like you do not have git-lfs installed,
|
|
3
|
4250
|
February 17, 2024
|
Saving-Loading Model in Colab and Making Predictions
|
|
2
|
15425
|
June 15, 2021
|
Two errors (Xformers not installed correctly; 'RWForCausalLM' not supported) on my first attempt
|
|
5
|
10834
|
November 29, 2024
|
How to train T5 with Tensorflow
|
|
8
|
4944
|
October 27, 2022
|
Evaluation became slower and slower during Trainer.train()
|
|
8
|
4691
|
February 3, 2025
|
Suport for Julia
|
|
0
|
1394
|
February 4, 2022
|
Run models on a desktop computer?
|
|
7
|
84690
|
June 16, 2024
|
Can not find adapter_config.json using PeftConfig.from_pretrained
|
|
8
|
13990
|
July 11, 2024
|
Create own dataset for NER
|
|
3
|
6352
|
November 22, 2023
|
What is the difference between forward() and generate()?
|
|
3
|
11116
|
December 25, 2023
|
"Load Diffusion Model" and "Unet Loader (GGUF)" null/undefined
|
|
8
|
7248
|
March 22, 2025
|
Use custom LogitsProcessor in `model.generate()`
|
|
2
|
6900
|
March 14, 2023
|
Multiple tasks for one fine-tuned LLM
|
|
2
|
6766
|
September 18, 2023
|
Defining a custom dataset for fine-tuning translation
|
|
4
|
5097
|
July 10, 2021
|
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation
|
|
5
|
46305
|
September 24, 2024
|
Extract data from text and parse it as a JSON
|
|
6
|
23955
|
August 6, 2024
|
Why is the lm_head layer in GPT2LMHeadModel not a parameter?
|
|
5
|
8170
|
September 29, 2023
|
Push_to_hub usage errors?
|
|
8
|
11675
|
August 22, 2023
|
I don't understand the difference between asymmetric retrieval, sentence similarity, and semantic search
|
|
2
|
6364
|
July 28, 2023
|
How to plot learning curves from Trainer
|
|
0
|
614
|
June 29, 2022
|
Need advice: PytorchStreamReader failed reading zip archive
|
|
6
|
12927
|
January 24, 2025
|
Is large language model and foundation model the same thing?
|
|
4
|
8343
|
August 22, 2022
|
Running transformer models on mps instead of cpu on mac
|
|
1
|
2341
|
January 18, 2025
|
What is best way to compute document similarity?
|
|
1
|
4148
|
June 21, 2022
|
SSL Error - Max retries
|
|
7
|
20390
|
March 17, 2025
|
Get number of parameters for different parts of a model
|
|
0
|
5763
|
May 10, 2021
|
Does fine-tuning mean retraining the entire model?
|
|
2
|
5861
|
November 22, 2022
|