🤗Transformers

Topic	Replies	Views	Activity
T5-small trained with small dataset not infering anything 🤗Transformers	0	212	April 25, 2023
T5 for classification task 🤗Transformers	0	489	April 25, 2023
RTX 6000 Ada slower then 3090 🤗Transformers	0	614	April 25, 2023
Question about using trainer with DeepSpeed 🤗Transformers	0	465	April 25, 2023
Issues in finetuning t5-large model 🤗Transformers	1	462	April 25, 2023
How to use FSDP + DPP in Trainer 🤗Transformers	1	1019	April 24, 2023
Is detokenize available in transformer lib? 🤗Transformers	2	2806	April 24, 2023
Tied weights for encoder and decoder vocab matrix hard coded in T5? 🤗Transformers	0	901	April 24, 2023
About fill-mask pipeline with [mask] made up of multiple tokens 🤗Transformers	0	325	April 24, 2023
Generation using contrastive search 🤗Transformers	0	179	April 24, 2023
Trade off between max_length vs loss 🤗Transformers	0	198	April 23, 2023
Mt5 fine-tuning using fp16 yields zero loss 🤗Transformers	1	643	April 23, 2023
Issues loading NLLB 54B MoE model for multi-GPU inferencing using accelerate 🤗Transformers	0	902	April 22, 2023
Support for ASR inference on longer audiofiles or on live transcription? 🤗Transformers	2	481	April 21, 2023
Whisper on long audio files -- support for chunking? 🤗Transformers	3	5810	April 21, 2023
What happens when loading shards? 🤗Transformers	0	2521	April 21, 2023
How can I introspect the input and output keys for an arbitrary model? 🤗Transformers	1	439	April 21, 2023
Question about Bloom pretrain 🤗Transformers	0	166	April 21, 2023
Is it true that Deepspeed currently does not support regression tasks and only supports softmax-based classification tasks? DeepSpeed	0	275	April 21, 2023
How to prevent redownloading in from_pretrained caused by hash? 🤗Transformers	0	583	April 21, 2023
How does Segformer handle image size differences? 🤗Transformers	5	4128	April 20, 2023
Does anyone else observer RoBERTa fine-tuning instability? 🤗Transformers	8	3140	April 20, 2023
Image classification tutorial bug 🤗Transformers	0	216	April 20, 2023
LayoutLMv3 Onnx Conversion 🤗Transformers	1	815	April 20, 2023
Trying to understand the task-specific head for diff. models + Transformers AutoModel 🤗Transformers	0	436	April 20, 2023
Fusion-in-Decoder models 🤗Transformers	3	2984	April 20, 2023
How to get the score of the response of the model? 🤗Transformers	0	201	April 19, 2023
Same model GPT-NEO-XT behave differently with same prompts & different context 🤗Transformers	0	277	April 19, 2023
Force word embeddings for a specific language with facebook/m2m100_418M 🤗Transformers	0	213	April 19, 2023
Type hinting Inconsistency in beam_search.py 🤗Transformers	0	189	April 19, 2023