Continue from pretrained
|
|
1
|
746
|
May 21, 2023
|
Retrain T5 using unsupervised learning with MLM
|
|
0
|
251
|
May 21, 2023
|
Generation Config for ByT5
|
|
0
|
788
|
May 20, 2023
|
Transformer for numeric dataset
|
|
0
|
662
|
May 20, 2023
|
How to **properly** prompt the decoder?
|
|
0
|
833
|
May 20, 2023
|
Fine-tune GPT2 for translation conditioned on a dictionary of medical terms?
|
|
0
|
248
|
May 18, 2023
|
Question on HuggingFace's T5 documenation
|
|
0
|
321
|
May 18, 2023
|
LM example run_clm.py isn't distributing data across multiple GPUs as expected
|
|
10
|
2712
|
May 17, 2023
|
Converting SWIN Transformers from Pytorch through ONNX or Others
|
|
0
|
490
|
May 16, 2023
|
Defining custom compute_metrics for multiclass classification
|
|
0
|
963
|
May 16, 2023
|
IndexError: index -1 is out of bounds for dimension 1 with size 0
|
|
0
|
965
|
May 16, 2023
|
Augmenting a token classifier class to concatenate metadata to the hidden representation of the text, before the output classification layer
|
|
2
|
831
|
May 16, 2023
|
How to use additional input features for Abstractive summarization? Seq2Seq
|
|
0
|
254
|
May 15, 2023
|
Transformers Agent
|
|
2
|
1016
|
May 15, 2023
|
How to Create one Process But Using Multi GPU?
|
|
0
|
722
|
May 15, 2023
|
Transformers Agent Chat State
|
|
0
|
179
|
May 15, 2023
|
Warning when loading T5 encoders
|
|
3
|
2009
|
May 15, 2023
|
Using a custom GenerationMixin with T5ForConditionalGeneration
|
|
0
|
259
|
May 14, 2023
|
Large Text regeneration with Hugging Face and python
|
|
0
|
495
|
May 14, 2023
|
Slow training time in current version
|
|
0
|
262
|
May 14, 2023
|
Is it possible to use HfAgent Transformers for Agents used in NodeJS?
|
|
0
|
269
|
May 13, 2023
|
DeepSpeed config file not found
|
|
0
|
605
|
May 13, 2023
|
Unable to upload agent tools
|
|
0
|
397
|
May 12, 2023
|
How to Train Model Using CPU with MultiProcess Each With Some Number of Thread?
|
|
0
|
983
|
May 12, 2023
|
Use torch.optim.lr_scheduler.CyclicLR with Trainer
|
|
0
|
432
|
May 12, 2023
|
How to load a ORTModelForVision2Seq model?
|
|
0
|
384
|
May 12, 2023
|
Does model supports partial `past_key_values`?
|
|
0
|
436
|
May 12, 2023
|
GPT-2 shift logits and labels
|
|
5
|
5907
|
May 12, 2023
|
Seeding Data Collator
|
|
0
|
225
|
May 12, 2023
|
Asked for "Draw me a picture of a circle"
|
|
0
|
266
|
May 11, 2023
|