🤗Transformers

Topic	Replies	Views	Activity
MPS Tensor to float64 dtype as the MPS framework doesn't support float64 Error 🤗Transformers	0	1746	June 11, 2023
How to use the output of first several layers as the input of the last few layers in Bert/DistillBert 🤗Transformers	0	823	June 10, 2023
How to solve ValueError: expected sequence of length 15 at dim 1 (got 18) error in python 🤗Transformers	3	7940	June 10, 2023
Adding Custom label names for BART training through Trainer Function 🤗Transformers	1	335	June 10, 2023
[BART]Why am I generating out-of-label classification results 🤗Transformers	0	257	June 10, 2023
Runtime Error - Failed to import transformers.models.bart.modeling_tf_bart 🤗Transformers	0	1376	June 9, 2023
Loss values change but accuracy, f1 and recall remain the same 🤗Transformers	0	635	June 9, 2023
How do I use a fine-tuned Trainer model for inference correctly? 🤗Transformers	0	988	June 9, 2023
Loss exploding/increasing in pretraining 🤗Transformers	1	1121	June 8, 2023
Is this possible to export MMS to TorchScript? 🤗Transformers	0	164	June 8, 2023
Boost inference speed of T5 models up to 5X & reduce the model size by 3X 🤗Transformers	2	5627	June 8, 2023
Any suggested model to perform semantic linking? 🤗Transformers	1	340	June 8, 2023
Len(trainer.model.state_dict().keys()) reduced after calling trainer.train() 🤗Transformers	0	275	June 8, 2023
How to change dropout in pre trained model for fine tunning gpt 🤗Transformers	0	896	June 7, 2023
Streaming token output from models like T5 🤗Transformers	7	12237	June 7, 2023
Decoding Modified Sentence Embeddings 🤗Transformers	0	2323	June 7, 2023
Model checkpoints on a worker node in multi-node training 🤗Transformers	0	739	June 7, 2023
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0; 39.56 GiB total capacity; 37.84 GiB already allocated; 242.56 MiB free; 37.96 GiB reserved in total by PyTorch) 🤗Transformers	2	5369	June 7, 2023
Using PyTorch model in TensorFlow 🤗Transformers	2	2302	June 7, 2023
Preventing every dropout in the GPT2DoubleHeadsModel 🤗Transformers	4	1390	June 7, 2023
Loading a trained model gives an error that weights are randomly initialized 🤗Transformers	0	474	June 6, 2023
How to using LION optimizer? 🤗Transformers	0	806	June 6, 2023
Use sentence transformers with different embeddings size 🤗Transformers	0	293	June 6, 2023
Fine Tuning Git Model for Malayalam Image Captioning 🤗Transformers	0	508	June 6, 2023
Error on pipeline with docquery (Transformers) 🤗Transformers	2	2031	June 5, 2023
[Help appreciated] GPT2 Finetuning results in Only Padding output 🤗Transformers	2	1620	June 5, 2023
Is there any way to create or adjust a conversational model using Transformers? 🤗Transformers	0	160	June 5, 2023
How to `push_to_hub` after training but preserve training logs? 🤗Transformers	0	254	June 5, 2023
Swapping GPT-2 Attention with Flash Attention 🤗Transformers	3	3026	June 4, 2023
How to use the wav2vec2-large-TIMIT-IPA2 model? 🤗Transformers	0	285	June 4, 2023