Ccreate continues set of generated images - same style and characters
|
|
0
|
158
|
May 20, 2024
|
What Model and approach should i use for my use case
|
|
2
|
164
|
May 20, 2024
|
Build pretrained huggingface whisper for tensorrt-llm
|
|
0
|
324
|
May 20, 2024
|
How to use `inputs_embed` and `attention_mask` together?
|
|
1
|
957
|
May 19, 2024
|
What is the correct way to parse data for DPO? Do you seperate out prompt or not?
|
|
0
|
212
|
May 19, 2024
|
GPU memory usage is twice (2x) what I calculated based on number of parameters and floating point precision
|
|
5
|
457
|
May 18, 2024
|
Change saving metric in Trainer
|
|
2
|
1485
|
May 18, 2024
|
Huggingface token returning an invalid token
|
|
1
|
1453
|
May 17, 2024
|
Can't change max_input_length of Text Generation Inference
|
|
0
|
137
|
May 15, 2024
|
Questions about Mistral and apply_chat_template with Text Generation Inference, openai API and messages API
|
|
0
|
175
|
May 15, 2024
|
Question regarding adding a 4080 (and 3080?) to a 4090 rig for AI
|
|
2
|
488
|
May 15, 2024
|
Getting nan while fine tuning Blip 2 and weired output
|
|
0
|
148
|
May 14, 2024
|
Push model to hugging face hub without Trainer
|
|
7
|
1416
|
May 14, 2024
|
Implement few-shot inference for question-answering with DistilBERT
|
|
0
|
156
|
May 13, 2024
|
Negative KL-divergence RLHF implementation
|
|
1
|
1595
|
May 13, 2024
|
Llama2 tools instruction wierd reponse
|
|
2
|
163
|
May 8, 2024
|
Adding another head to Vision encoder decoder model
|
|
4
|
343
|
May 7, 2024
|
Quantize a Model before loading it for pre-training?
|
|
0
|
134
|
May 7, 2024
|
GPT-2 Data Preparation for Parsing Trees
|
|
0
|
124
|
May 6, 2024
|
Tokenizer from a GGUF file in Python?
|
|
1
|
844
|
May 6, 2024
|
Train modell for Question Answering
|
|
3
|
320
|
May 6, 2024
|
Stochastic Sampling with Trainer.evaluate() Logits
|
|
3
|
328
|
May 6, 2024
|
Trouble loading checkpoint shards for microsoft/Phi-3-mini-4k-instruct
|
|
1
|
1097
|
May 5, 2024
|
CUDA OOM on model(inputs) but not on model.generate(inputs), but doesn't generate use model(inputs)?
|
|
4
|
256
|
May 4, 2024
|
Giving attention mask to ppo_trainer
|
|
0
|
241
|
May 4, 2024
|
Less Trainable Parameters after quantization
|
|
14
|
4474
|
May 2, 2024
|
Fine tuning RoBerta got an unexpected keyword argument 'labels'
|
|
2
|
1049
|
May 1, 2024
|
Fine-tuning with Different Model Heads
|
|
4
|
811
|
April 30, 2024
|
What are the limits on saving private models and datasets on the hub?
|
|
4
|
1539
|
April 29, 2024
|
Finetuning T5 for Summarisation - Poor results
|
|
1
|
545
|
April 28, 2024
|