How does DDP + huggingface Trainer handle input data?
|
|
3
|
398
|
May 18, 2023
|
Timm & HuggingFace
|
|
0
|
65
|
May 16, 2023
|
Sampling strategies
|
|
1
|
106
|
April 4, 2023
|
A specific documents AI API for Hugging Face?
|
|
0
|
70
|
May 12, 2023
|
How to fine tune BertForSequenceClassification with PEFT?
|
|
0
|
107
|
May 10, 2023
|
Learning sets and disabling positional embedding knowledge?
|
|
0
|
65
|
May 10, 2023
|
Using alpaca with local embedding
|
|
0
|
215
|
May 10, 2023
|
Is there any way to avoid CPU bottlenecks when doing single prompt inference?
|
|
0
|
116
|
May 10, 2023
|
Baffling performance issue on most NVidia GPUs with simple transformers + pytorch code
|
|
2
|
237
|
May 9, 2023
|
Finetuned MT5 model generating the same first token for any input
|
|
0
|
73
|
May 9, 2023
|
Import HuggingFace PatentSBERTa Model support in EMR and PySpark
|
|
0
|
61
|
May 8, 2023
|
Machine Translation using Hugging Face problem
|
|
0
|
139
|
May 8, 2023
|
How to run an end to end example of distributed data parallel with hugging face's trainer api (ideally on a single node multiple gpus)?
|
|
15
|
4445
|
May 8, 2023
|
Using Transformers with DistributedDataParallel — any examples?
|
|
11
|
6496
|
May 8, 2023
|
Fine-tuning with LoRA; can't learn
|
|
0
|
81
|
May 7, 2023
|
TypeError: Repository.__init__() got an unexpected keyword argument 'token'
|
|
4
|
2505
|
May 7, 2023
|
Accessing model from a callback to predict between epochs
|
|
0
|
95
|
May 5, 2023
|
Implementing one prompt recommender
|
|
0
|
60
|
May 3, 2023
|
Train Roberta from scratch for custom dataset
|
|
1
|
656
|
May 2, 2023
|
Typical sampling decoding technique
|
|
1
|
925
|
April 28, 2023
|
ValueError: Unable to generate dummy inputs for the model. Please provide a tokenizer or a preprocessor
|
|
0
|
80
|
April 28, 2023
|
Plotting separate loss curves for different datasets
|
|
0
|
78
|
April 28, 2023
|
Training large language models to consider two texts to generate output text
|
|
0
|
67
|
April 26, 2023
|
Generation is always CPU limited
|
|
0
|
102
|
April 21, 2023
|
Repeatedly decoding tokens multiple times after PEFT fine-tuning whisper
|
|
1
|
100
|
April 21, 2023
|
ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided ['tokens', 'id', 'space_after', 'ner_tags', 'ner_ids']
|
|
2
|
262
|
April 21, 2023
|
Is it possible to add simple custom pytorch-crf layer on top of TokenClassification model. It will make the model more robust
|
|
4
|
1396
|
April 19, 2023
|
BPEDecoder no spaces after special tokens
|
|
4
|
832
|
April 19, 2023
|
Converting AlignTTS (text-to-speech) model to ONNX
|
|
0
|
89
|
April 18, 2023
|
How to get sentence embedding using a fine-tuned model
|
|
0
|
80
|
April 18, 2023
|