Intermediate

Topic	Replies	Views	Activity
Fine tuning facebook/bart-large-mnli zeroshot classifier	2	910	June 30, 2023
Reformer - attention data format	1	400	June 29, 2023
Instruction Fine-Tuning StarCoder Model	0	620	June 28, 2023
Stucked on "Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding."	0	2072	June 27, 2023
falcon-40B inference on older version of torch	0	229	June 27, 2023
Why evaluation in multiple nodes for distributed mode	0	172	June 25, 2023
Finetuning for feature-extraction? I.e. unsupervised fine tuning?	10	5573	June 25, 2023
API: Quota exceeded for machine error	0	1389	June 22, 2023
Best practice for finetune LLM	0	653	June 21, 2023
Back Translation Using T5	0	329	June 21, 2023
How to create the fsdp_config json file for Trainer?	4	2968	June 19, 2023
Loading extra memory in GPU 0 using DDP	0	388	June 18, 2023
Prompt loss weight instead of masking in generative models	1	2213	June 18, 2023
iIlegal text classification	0	164	June 15, 2023
Pythia Tuning Question	0	300	June 14, 2023
Save LORA weights only in intermediate checkpoints	0	1833	June 14, 2023
Seq2SeqTrainer Error	0	466	June 12, 2023
Is there any way to avoid CPU bottlenecks when doing single prompt inference?	1	985	June 12, 2023
Trainer for MT with source and target tokenizers	0	211	June 9, 2023
How to get intermediate features from HF pretrained model?	0	271	June 7, 2023
Finetuning Segment Anything and automatic prediction	2	5829	June 7, 2023
Xlm-roberta-base predicting always same class, other models don't	2	1110	June 7, 2023
Error while Fine tuning Zero shot classification model fb-bart-large-mnli	0	526	June 6, 2023
How to restrict T5 model to generate tokens only from the input text?	0	424	June 6, 2023
Training "don't know" and "don't understand" responses	0	212	May 31, 2023
Is it possible to use BART model for question answering purpose which responses like a human like conversation	0	290	May 31, 2023
SportsBot Training Data and Modal	1	346	May 30, 2023
Trainer fails to resume training from a checkpoint, claiming there's not enough samples in the dataset	1	1641	May 29, 2023
Alibi and Extrapolation	0	450	May 29, 2023
Loading adapter merged models	0	474	May 29, 2023