🤗Transformers

Topic	Replies	Views	Activity
Hyper params search for model config 🤗Transformers	0	175	February 22, 2024
qloRA with cpu offload 🤗Transformers	1	951	February 22, 2024
Fine-tuning throws "index out of range in self" 🤗Transformers	6	10332	February 21, 2024
Same checkpoint produces different output 🤗Transformers	0	148	February 20, 2024
Llama-2 Sequence Classification: Much lower accuracy on inference from checkpoint compared to model 🤗Transformers	5	5973	February 20, 2024
It says that `bfloat16.enabled` without `auto' needed to be specified when training T5, is anyone aware of how to solve that? DeepSpeed	0	257	February 20, 2024
Adding categorical and numerical data to Bert model 🤗Transformers	0	1004	February 20, 2024
Gradually increasing CPU load on using sentence embeddings model with kmeans 🤗Transformers	0	537	February 20, 2024
Cannot see training accuracy, only validation accuracy 🤗Transformers	2	1305	February 20, 2024
Hallucination with trainer.evaluate() on LLMs 🤗Transformers	1	679	February 19, 2024
Running ASR inference pipeline on multiple GPU's 🤗Transformers	0	137	February 19, 2024
Generate() returns full prompt plus answer 🤗Transformers	1	6320	February 19, 2024
Pipelines without a tokenizer 🤗Transformers	1	643	February 19, 2024
Token level representations 🤗Transformers	0	190	February 17, 2024
Repetition_penalty not working? 🤗Transformers	1	209	February 18, 2024
How to set stopping criteria in model.generate() when a certain word appears 🤗Transformers	3	3789	February 18, 2024
Fine tuning using LOFTQ - CUDA out of memory error 🤗Transformers	4	382	February 18, 2024
Which hidden states have the highest score in beam search? 🤗Transformers	0	106	February 18, 2024
Any model's size is huge when saved as opposed to downloading from hub pretrained 🤗Transformers	3	380	February 17, 2024
Decoder_start_token_id per sample or per batch during training 🤗Transformers	0	231	February 16, 2024
Some Roberta weights are not initializing from the checkpoint 🤗Transformers	0	791	February 16, 2024
From Transformers Version v4.12.0 onwards, The example colab BERT2BERT is wrong. (Things to keep in mind when using from transformers import EncoderDecoderModel) 🤗Transformers	0	272	February 16, 2024
How to force bos_token_id for each example individually in MBart? 🤗Transformers	3	1216	February 16, 2024
What on earth is point_batch_size for the transformers SamModel? 🤗Transformers	0	321	February 15, 2024
T5-xxl mlm distributed training? 🤗Transformers	1	372	February 15, 2024
Finetune LLaMA2 model with datasets missing labels 🤗Transformers	0	373	February 15, 2024
Tranier not starting on multi-GPU setting 🤗Transformers	1	1076	February 15, 2024
Difference between CausalLMWithValueHead vs ModelForCausalLM 🤗Transformers	2	3461	February 15, 2024
Using Owl ViT Embeddings with cosine similarity 🤗Transformers	1	566	February 15, 2024
Trainer freezes after all steps are complete (multi-gpu setting) 🤗Transformers	4	1589	February 14, 2024