Conversion from finetune m2m_100 model to huggingface format
|
|
0
|
111
|
April 22, 2024
|
What is the difference between tokenizer.eos_token_id, model.config.eos_token_id and model.generation_config.eos_token_id?
|
|
1
|
1578
|
April 22, 2024
|
ETA for training time is 60k hours for first generation
|
|
0
|
104
|
April 22, 2024
|
How to load_dataset from local directory?
|
|
0
|
263
|
April 22, 2024
|
Loading datasets on MacOS X causing segmentation fault
|
|
1
|
397
|
April 22, 2024
|
How to create a `log` to record the app's usage?
|
|
0
|
60
|
April 22, 2024
|
TypeError: '>' not supported between instances of 'NoneType' and 'int' - Error while training distill bert
|
|
6
|
8318
|
April 22, 2024
|
Error after 501 steps
|
|
2
|
509
|
April 22, 2024
|
Query regarding Generating Network Commands using "Intention" of users , using Large Language Models
|
|
0
|
168
|
April 21, 2024
|
Inference input token number set as the max length always?
|
|
10
|
1228
|
April 21, 2024
|
Issue with Optuna visualization in web browser
|
|
0
|
173
|
April 20, 2024
|
Pipeline device issue, torch_xla generation() bug, flax models malloc errors
|
|
0
|
166
|
April 21, 2024
|
Unbiased chat or LLM required. Or even a model that I can retrain without using code. Please
|
|
0
|
329
|
April 21, 2024
|
Failed to import Trainer: unhashable type 'list
|
|
2
|
1183
|
April 21, 2024
|
500 internal error at huggingface chat
|
|
0
|
195
|
April 21, 2024
|
Config parameters for custom models
|
|
0
|
105
|
April 21, 2024
|
Can I use fine-tuned model with TGI?
|
|
0
|
189
|
April 21, 2024
|
Total downloads not showing
|
|
0
|
196
|
April 21, 2024
|
FineTune LLM for regex
|
|
3
|
2062
|
April 21, 2024
|
Unable to create a space with zero gpu
|
|
0
|
444
|
April 21, 2024
|
Interface API deployment
|
|
0
|
89
|
April 21, 2024
|
How to set different sizes for `input` and `output` on `gr.Interface`
|
|
0
|
57
|
April 21, 2024
|
Model Parallism
|
|
0
|
181
|
April 21, 2024
|
Policy to get inactive usernames?
|
|
0
|
150
|
April 21, 2024
|
How to run single-node, multi-GPU training with HF Trainer and deepspeed?
|
|
1
|
1429
|
April 21, 2024
|
How to show figure which is plotted by `matplotlib`?
|
|
0
|
72
|
April 21, 2024
|
Where does .tokens() come from/inherit from in hugging face
|
|
3
|
130
|
April 21, 2024
|
Which Open Source LLM is suitable for training? Mistral-7B or Llama2-7B?
|
|
0
|
881
|
April 21, 2024
|
KeyError: 'length' when using using load_dataset on Sagemaker
|
|
3
|
1757
|
April 21, 2024
|
KeyError: 'length' when loading dataset by load_from_disk
|
|
1
|
1062
|
April 21, 2024
|