Error with aggregation_strategy="max", TypeError: Can't convert [' In'] to PyString
|
|
0
|
449
|
April 1, 2022
|
How to run on CPU?
|
|
1
|
10318
|
April 1, 2022
|
Understanding how Ray "objective" works with the trainer
|
|
1
|
1781
|
April 1, 2022
|
Huggingface longformer memory issues
|
|
0
|
541
|
March 31, 2022
|
Is it correct to replace a class after self.post_init() function?
|
|
0
|
1067
|
March 31, 2022
|
Gradients verification between JAX/Flax models and PyTorch
|
|
2
|
1190
|
March 31, 2022
|
How to finetune cola dataset using trainsformer and pytorch?
|
|
0
|
310
|
March 30, 2022
|
Get sentence âBâ predicted, given sentence âAâ using Next Sentence Prediction model
|
|
1
|
931
|
March 30, 2022
|
Can't load tokenizer with added special tokens
|
|
0
|
832
|
March 29, 2022
|
How to make Trainer train the model one epoch at a time?
|
|
1
|
1836
|
March 29, 2022
|
Are Word Embeddings by BERT generated for long sequences better than ones generated for short sequences?
|
|
0
|
238
|
March 29, 2022
|
Training loss increases suddenly at the beginning of each epoch
|
|
1
|
1682
|
March 29, 2022
|
Learning rate and checkpoints
|
|
0
|
441
|
March 29, 2022
|
Is the trainer's seed reset at every model_init?
|
|
4
|
1258
|
March 28, 2022
|
RuntimeError: params[0] in this process with sizes [253991, 1024] appears not to match sizes of the same param in process 0
|
|
0
|
653
|
March 28, 2022
|
Huggingface transformer sequence classification
|
|
3
|
495
|
March 26, 2022
|
Controlled Text Generation
|
|
2
|
2588
|
March 26, 2022
|
Transformers replacing loss function
|
|
0
|
3385
|
March 26, 2022
|
Using weights with transformers huggingface - running on GPUs
|
|
0
|
1521
|
March 26, 2022
|
:rocket: Optimum Transformers: accelerated NLP pipelines with Infinity speed
|
|
4
|
665
|
March 25, 2022
|
How to train model without shuffling data with multi-GPU
|
|
0
|
241
|
March 25, 2022
|
How to add a customized logger to Trainer?
|
|
0
|
656
|
March 25, 2022
|
Huggingface sequence classification unfreezing layers
|
|
2
|
1319
|
March 24, 2022
|
Bug: Finetune XLM-RoBERTa-large on XNLI get 0.33 in accuracy while XLM-RoBERTa-base works fine
|
|
0
|
343
|
March 23, 2022
|
GPT2LMHeadModel not loading from pretrained
|
|
2
|
4808
|
March 23, 2022
|
Combine BertForSequenceClassificaion with Additional Features
|
|
3
|
9495
|
March 23, 2022
|
What is rust_model.ot and how can I use it?
|
|
4
|
6005
|
March 23, 2022
|
How to find the beam search score for any target output? (BartForConditionalGeneration)
|
|
0
|
1449
|
March 22, 2022
|
Decoder attention mask in text2text/se2seq generation encoder-decoder models
|
|
1
|
1642
|
March 22, 2022
|
Flexible "Keep" threshhold for DETR model
|
|
0
|
228
|
March 22, 2022
|