🤗Transformers

Topic	Replies	Views	Activity
How could I define a LogitsProcessorList with multi parameters? 🤗Transformers	0	91	May 10, 2024
Argmax of Generation Probabilities doesn't match with Generated Sequence Tokens 🤗Transformers	2	953	May 10, 2024
How could I fusion the logits from different models and then convert it to Token? 🤗Transformers	0	102	May 10, 2024
Finetune_rag.py won't save checkpoints 🤗Transformers	0	120	May 9, 2024
CLIP: The `backend_tokenizer` provided does not match the expected format 🤗Transformers	3	257	May 9, 2024
What does the `use_cache` in `generate` actually do? 🤗Transformers	1	2443	May 9, 2024
AWD-LSTM beats finetuned BERT as train ds decreases?! :person_shrugging:t4: 🤗Transformers	2	127	May 9, 2024
How to count how many forward passes were done in model.generate when using assistant_model 🤗Transformers	0	86	May 9, 2024
How to pass multiple datasets into Trainer for Knowledge distillation in NMT 🤗Transformers	3	335	May 9, 2024
Trainer doesn't show the loss at each step 🤗Transformers	20	35734	May 9, 2024
Lazy model initialization 🤗Transformers	3	988	May 8, 2024
Getting zero gradients for image patch embeddings when implementing GRADCAM for ViLT 🤗Transformers	0	94	May 8, 2024
Input to reshape is a tensor with 3763200 values, but the requested shape requires a multiple of 20384 🤗Transformers	0	87	May 8, 2024
Having multiple candidate labels in a zero shot classification model 🤗Transformers	3	602	May 8, 2024
Why eval_accumulation_steps takes so much memory 🤗Transformers	5	1628	May 8, 2024
Add metrics to object detection example 🤗Transformers	12	3946	May 8, 2024
Runtime error: NotImplementedError: Cannot copy out of meta tensor; no data! 🤗Transformers	0	2137	May 7, 2024
Llama-2 significantly slower than other models on huggingface 🤗Transformers	2	979	May 7, 2024
Retraining the SAM model on the color image database in order to segment multiple classes in the image‏ 🤗Transformers	0	363	May 7, 2024
Cuda Out of Memory when fine tuning llm model 🤗Transformers	3	1188	May 7, 2024
Lower Memory Usage for TF GPT-J 🤗Transformers	1	810	May 7, 2024
How to stream responses from AutoModelforCausalLM? 🤗Transformers	0	468	May 7, 2024
Fine tuning T5 Encoder and T5 Decoder separately 🤗Transformers	1	754	May 6, 2024
AttributeError: module 'torch' has no attribute 'chalf' 🤗Transformers	8	1063	May 6, 2024
Why activations memory is computed through an experiment rather formulating it for DeepSpeed autotuner DeepSpeed	0	81	May 6, 2024
Issues with Downloading Llama2 in Jupyter Notebook 🤗Transformers	1	560	May 5, 2024
Good Arabic embeddings are needed 🤗Transformers	0	131	May 4, 2024
# [ImportError: `llama-index-readers-file` package not found ] 🤗Transformers	0	272	May 4, 2024
ImportError: `llama-index-readers-file` package not found 🤗Transformers	0	182	May 4, 2024
How to use hugging face transformers for testing a dataset 🤗Transformers	1	273	May 4, 2024