Best model for human like conversations?
|
|
1
|
3414
|
April 11, 2024
|
How many GB of RAM do I need to train DBRX?
|
|
2
|
231
|
April 11, 2024
|
Moving tokenizer outputs to CUDA taking way too long
|
|
7
|
1883
|
April 11, 2024
|
AttributeError: 'list' object has no attribute '__module__' when loading model from file system with from_pretrained
|
|
0
|
245
|
April 11, 2024
|
How to have no preset values sent into .compute() in Huggingface evaluate metrics?
|
|
2
|
418
|
April 11, 2024
|
Model Parallelism and Pipelining for Model Training
|
|
3
|
3082
|
April 11, 2024
|
Tensor size error when generating embeddings for documents using pre-trained models
|
|
3
|
483
|
April 11, 2024
|
Dataset download limits
|
|
0
|
234
|
April 11, 2024
|
🔬 Exploring Reinforcement Learning for Molecule Generation with GPT-Based Models; Loss Fluctuations
|
|
2
|
272
|
April 11, 2024
|
AI Chatbot Can be used in meditation app development
|
|
0
|
81
|
April 11, 2024
|
Need help to harness the power of generative AI for product images
|
|
0
|
189
|
April 11, 2024
|
Getting torch import error
|
|
1
|
335
|
April 11, 2024
|
Evaluate installation issues
|
|
0
|
104
|
April 11, 2024
|
How to train a proper StyleGan Model
|
|
0
|
230
|
April 11, 2024
|
Hugging Face UI
|
|
0
|
183
|
April 11, 2024
|
How to resume training from checkpoint
|
|
0
|
533
|
April 11, 2024
|
What is the behaviour of cosine scheduler and warm up steps when setting using epochs?
|
|
1
|
258
|
April 10, 2024
|
How to display dataset feature on datasetcard?
|
|
0
|
145
|
April 10, 2024
|
I cant get past this any ideas
|
|
0
|
163
|
April 10, 2024
|
QLoRA trained Mixtral 8x7B deployment error on Sagemaker using text generation inference image
|
|
0
|
303
|
April 10, 2024
|
Search models by tokenizer
|
|
0
|
91
|
April 10, 2024
|
Trainer.predict return predictions=None
|
|
1
|
214
|
April 10, 2024
|
Computing log probability of an arbitrary sequence given another sequence
|
|
1
|
1951
|
April 10, 2024
|
What happened to the openchat model in the huggingfacechat
|
|
0
|
232
|
April 10, 2024
|
Using evaluate.evaluator on a PEFT model
|
|
1
|
236
|
April 10, 2024
|
Missing config.json file after AutoTraining
|
|
7
|
8218
|
April 10, 2024
|
Turkish NLP - Introductions
|
|
31
|
5594
|
April 10, 2024
|
Fine-tune a Llama 2 7b hf take 160 hours on RTX 4070?
|
|
1
|
1210
|
April 10, 2024
|
Can not resume my endpoints, always receiving download Error
|
|
1
|
450
|
April 10, 2024
|
Preprocessing of dataset
|
|
0
|
172
|
April 10, 2024
|