Mistral trouble when fine-tuning : Don't set pad_token_id = eos_token_id
|
|
8
|
5360
|
August 28, 2024
|
Space won't start - logs not found
|
|
19
|
2318
|
August 27, 2024
|
Always 【initializing】 until time out without any error log
|
|
3
|
39
|
August 27, 2024
|
Text generation using SetFit
|
|
1
|
885
|
August 27, 2024
|
I'm doing Yolov8 model training but the accuracy rate is 70%
|
|
0
|
40
|
August 27, 2024
|
How may i upload my json version of the King James version of the Holy Bible?
|
|
1
|
543
|
August 27, 2024
|
FutureWarning close
|
|
4
|
2253
|
August 27, 2024
|
How do I use a trained LORA, unmerged?
|
|
0
|
19
|
August 27, 2024
|
LlamaIndex for PDF parsing
|
|
2
|
2108
|
August 27, 2024
|
Converting bert_large_uncased_whole_word_masking to onnx
|
|
0
|
15
|
August 27, 2024
|
How to choose dataset_text_field in SFTTrainer hugging face for my LLM model
|
|
1
|
451
|
August 27, 2024
|
What is the implement for text2vid VAE encoder in diffusers?
|
|
0
|
10
|
August 27, 2024
|
Commit Message Generation Model
|
|
0
|
35
|
August 27, 2024
|
Clarification on Classification Token
|
|
0
|
16
|
August 27, 2024
|
Advice on an email classification problem
|
|
3
|
372
|
August 27, 2024
|
ValueError: You need to specify either `text` or `text_target` when using evaluator
|
|
1
|
3628
|
August 27, 2024
|
Type error in StableDiffusionImg2ImgPipeline
|
|
0
|
102
|
August 26, 2024
|
How to use SegFormer encoder and decoder?
|
|
2
|
30
|
August 26, 2024
|
Account Recovery Request
|
|
3
|
36
|
August 27, 2024
|
Does a PRO subscription add Memory to HF Spaces?
|
|
3
|
74
|
August 26, 2024
|
Convert slow XLMRobertaTokenizer to fast one
|
|
3
|
1164
|
August 26, 2024
|
How to Fine-Tune mBART or mT5 for Transliteration from Romanized Text to Native Script?
|
|
0
|
23
|
August 26, 2024
|
How to prepare ckpt.pth model checkpoint file as .bin or safetensors?
|
|
0
|
286
|
August 26, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)
|
|
5
|
3405
|
August 26, 2024
|
How to convert hf model to optimized model with kv-caching
|
|
0
|
71
|
August 26, 2024
|
[SOLVED] What's the right way to do GPU paralellism for inference (not training) on AutoModelForCausalLM?
|
|
1
|
201
|
August 26, 2024
|
Inference workflow in compile mode using transformers.pipeline()
|
|
0
|
29
|
August 26, 2024
|
Incorrect logits shape for GIT model
|
|
2
|
17
|
August 26, 2024
|
Fine-tune a 7B parameter LLM efficiently and affordably?
|
|
2
|
637
|
August 26, 2024
|
Phi-3-mini-128k-instruct not working with pro inference api
|
|
14
|
2223
|
August 26, 2024
|