Trainer's `save_model` isn't saving the entire state_dict and is only saving the embedding/encoder
|
|
1
|
1420
|
January 2, 2024
|
Allow Multiple Processes at Once
|
|
0
|
291
|
January 2, 2024
|
Need Help! Open-Source models for funtion calling
|
|
0
|
472
|
January 2, 2024
|
Contextual Recommendation of Adages, Allusions, Anecdotes, Aphorisms, Jokes, Proverbs, Quotes, Lyrics, Poems, Stories, and Witticisms
|
|
1
|
272
|
January 15, 2024
|
Continue pretraining on a released model
|
|
0
|
785
|
January 1, 2024
|
Training Uncensored AI w/ My Data Set
|
|
0
|
694
|
January 1, 2024
|
AutoTrain (UI) error
|
|
0
|
382
|
January 1, 2024
|
Create custom character
|
|
0
|
195
|
January 1, 2024
|
Accessing logits when using Trainer API
|
|
1
|
325
|
January 1, 2024
|
Many ambiguous unicode characters for trained tokenizer
|
|
0
|
364
|
December 31, 2023
|
CUDA out of memory when using the trainer model_init
|
|
0
|
246
|
December 31, 2023
|
How the hugging face embeddings size is too low?
|
|
0
|
225
|
December 31, 2023
|
How can we store multiple embeddings and combine into single embedding?
|
|
0
|
438
|
December 31, 2023
|
What's the pipeline task string for paraphrase sentence detection
|
|
0
|
186
|
December 31, 2023
|
Confusing (and possibly misleading) PPO Trainer Code from TRL API Doc Tutorial
|
|
2
|
451
|
January 2, 2024
|
Accelerate stalls when using Tensor Dataset
|
|
0
|
309
|
December 31, 2023
|
When using an SDXL base and refiner, should LORAs be sent to both?
|
|
0
|
721
|
December 30, 2023
|
Llamma index Saving and Loading
|
|
1
|
743
|
January 2, 2024
|
Optimizing LLM Inference with One Base LLM and Multiple LoRA Adapters for Memory Efficiency
|
|
1
|
4529
|
January 20, 2024
|
Fine-tuning not producing results with default settings on HF
|
|
1
|
740
|
January 31, 2024
|
SpeechT5 Text to Speech fine tuning runtime error
|
|
1
|
307
|
January 12, 2024
|
Datasets-cli test failed when generating metadata due to the use of Array2D
|
|
4
|
303
|
January 6, 2024
|
Getting model.save_pretrained() to save the best iteration when fine tuning a model with keras (using callbacks)?
|
|
0
|
244
|
December 30, 2023
|
Forebidden Embedding
|
|
3
|
585
|
January 3, 2024
|
Tokenizer.add_tokens automatically convert ESM2 new token to special
|
|
1
|
340
|
January 8, 2024
|
Openai/whisper-large-v3 ONNX validation
|
|
2
|
1091
|
December 30, 2023
|
Colab CUDA OOM using Llama-2-7b-chat-hf even with 40GPU RAM
|
|
0
|
896
|
December 29, 2023
|
GPT2TokenizerFast tokenzied output
|
|
0
|
153
|
December 29, 2023
|
Connection Error when Accessing Dataset URL on Hugging Face
|
|
5
|
4819
|
February 2, 2024
|
Can't deploy my custom endpoint with my Concrete ML package
|
|
3
|
231
|
December 29, 2023
|