How can I get the trainer to log steps in dictionary?
|
|
0
|
135
|
May 17, 2024
|
How to control the GPU id for loading model weights when fintune Llama8B model with the Trainer?
|
|
0
|
76
|
May 17, 2024
|
My huggingface account was deleted
|
|
1
|
307
|
May 17, 2024
|
Speech to text on constrained hardware (embedded)
|
|
0
|
101
|
May 16, 2024
|
Cannot import name 'WhisperForAudioClassification
|
|
0
|
161
|
May 16, 2024
|
What model will fit better for Email Parsing and Data Extraction
|
|
1
|
679
|
May 16, 2024
|
Environment Variables now supported on endpoints?
|
|
2
|
261
|
May 16, 2024
|
Feature Request: Elastic Launch Support in `notebook_launcher`
|
|
0
|
127
|
May 16, 2024
|
Isn't KV cache influenced by position encoding in inference?
|
|
3
|
930
|
May 16, 2024
|
HuggingFace account randomly deleted
|
|
1
|
422
|
May 16, 2024
|
Dataset with no splits
|
|
4
|
3545
|
May 16, 2024
|
Infinite Building with Langflow
|
|
1
|
274
|
May 16, 2024
|
AutoTrain to support KTO
|
|
0
|
150
|
May 16, 2024
|
An avoidable billing error
|
|
0
|
104
|
May 16, 2024
|
Trainer .train (resume _from _checkpoint =True)
|
|
9
|
15384
|
May 16, 2024
|
Finetuning DistilBERT for NER
|
|
0
|
147
|
May 16, 2024
|
Fine tuning llm model
|
|
2
|
4460
|
May 16, 2024
|
Finetuning T5 series models with my own data
|
|
0
|
143
|
May 16, 2024
|
ModuleNotFoundError: No module named 'transformers.agents'
|
|
2
|
752
|
May 16, 2024
|
Can't change max_input_length of Text Generation Inference
|
|
0
|
137
|
May 15, 2024
|
Questions about Mistral and apply_chat_template with Text Generation Inference, openai API and messages API
|
|
0
|
175
|
May 15, 2024
|
Why follow Flan-T5 template when T5 tokenizer ignores multiple newlines
|
|
0
|
114
|
May 15, 2024
|
Decoder only model - how to have it not include the prompt in its output?
|
|
3
|
668
|
May 15, 2024
|
How to pretrain randomized language model with custom dataset
|
|
0
|
64
|
May 15, 2024
|
PEFT prompt tuning for SEQ_CLS with BERT causes unexpected keyword argument 'label'
|
|
0
|
268
|
May 15, 2024
|
Can any model actually write current Rust?
|
|
2
|
508
|
May 15, 2024
|
Question regarding adding a 4080 (and 3080?) to a 4090 rig for AI
|
|
2
|
491
|
May 15, 2024
|
ValueError: Unrecognized configuration class <class 'transformers.models.whisper.configuration_whisper.WhisperConfig'>
|
|
0
|
246
|
May 15, 2024
|
How to train a LLM model on a Native language
|
|
0
|
375
|
May 15, 2024
|
Uploading a large trained model
|
|
6
|
2043
|
May 15, 2024
|