🤗Transformers

Topic	Replies	Views	Activity
How to use the generation_utils.generate? 🤗Transformers	0	283	April 28, 2022
Constantly running out of memory fine-tuning Wav2Vec2 DeepSpeed	1	977	April 28, 2022
Use GPT2LMHeadModel to start a new sentence 🤗Transformers	0	311	April 28, 2022
How to get a fixed size embedding from the last hidden state of vision models? 🤗Transformers	0	806	April 28, 2022
How can I load specific checkpoint of trained model 🤗Transformers	0	615	April 28, 2022
TypeError: cross_entropy_loss(): argument 'input' (position 1) must be Tensor, not SequenceClassifierOutput 🤗Transformers	2	6917	April 26, 2022
Optimum Pruning and Quantization Current Limitation 🤗Transformers	4	989	April 26, 2022
Use two sentences as inputs for sentence classification 🤗Transformers	7	20336	April 21, 2022
Difference between accelerate/torch_distributed/deepspeed DeepSpeed	0	1418	April 25, 2022
TF transformers model inputs and outputs showing none? 🤗Transformers	1	1148	April 25, 2022
Sequence masking 🤗Transformers	0	382	April 25, 2022
How to get predictions after evaluation phase in a callback 🤗Transformers	1	1145	April 25, 2022
Multi gpu training 🤗Transformers	3	6029	April 24, 2022
(Solved) Model esm-1b is not defined 🤗Transformers	0	1436	April 23, 2022
Anyone have idea how we can finetune a model using Trainer API? 🤗Transformers	0	448	April 22, 2022
Pre-train BERT with HF Trainer 🤗Transformers	0	741	April 22, 2022
Batch size in trainer eval loop DeepSpeed	3	4581	April 22, 2022
No skipping steps after loading from checkpoint 🤗Transformers	16	7585	April 21, 2022
TextDataset can't set max_seq_length? 🤗Transformers	2	1856	April 21, 2022
Can Processors/FeatureExtractors be used within custom DataCollators or DataLoaders? 🤗Transformers	0	388	April 21, 2022
Error on Evaluating GPT2LMHeadModel after training from scratch 🤗Transformers	0	294	April 20, 2022
Different results between pipeline and model() with multiple inputs 🤗Transformers	0	549	April 20, 2022
Code review: compute_metrics for WER with Wav2Vec2ProcessorWithLM 🤗Transformers	4	1041	April 19, 2022
Model training without downloading data on a local storage 🤗Transformers	0	431	April 19, 2022
Wandb for Huggingface Trainer saves only first model 🤗Transformers	0	445	April 19, 2022
How to run inference for T5 tensorrt model deployed on nvidia triton? 🤗Transformers	0	1269	April 19, 2022
Possibly incorrect sequence length warning for sequences greater than model_max_length 🤗Transformers	0	1393	April 18, 2022
Inference/prediction ValueError using BART 🤗Transformers	0	312	April 17, 2022
How to use label smoothing for single label classification in hugging face models 🤗Transformers	1	3643	April 16, 2022
GPT2 Training History? 🤗Transformers	0	419	April 15, 2022