Transformers Huge Community Feedback
|
|
5
|
4117
|
September 19, 2022
|
Appropriate tokenizer for particular dataset?
|
|
0
|
237
|
September 18, 2022
|
BLOOM models don't run on my GPU using Transformers
|
|
1
|
1663
|
September 18, 2022
|
Pinned model loading message
|
|
0
|
249
|
September 18, 2022
|
Hugging face stickers?
|
|
9
|
2331
|
September 17, 2022
|
Always same reponse?
|
|
1
|
638
|
September 16, 2022
|
Generate constraint words within the output sentence and not at its start/end
|
|
1
|
1291
|
September 16, 2022
|
Is hf supports split multiple sents into one sequence with <eos> when trainning gpt ,so that receive compute resources
|
|
0
|
167
|
September 16, 2022
|
Resume Training, but reset epochs
|
|
0
|
944
|
September 16, 2022
|
LayoutLMv3 transformers.onnx
|
|
1
|
1354
|
September 15, 2022
|
In trainer, how to get something from model?
|
|
0
|
211
|
September 15, 2022
|
Transformer domain Adaptation
|
|
0
|
1124
|
September 15, 2022
|
Sentence transformer poor performance after fine tuning
|
|
1
|
1615
|
September 11, 2022
|
Shape mismatching between `sequences` and `scores` in beam search generation
|
|
1
|
554
|
September 14, 2022
|
Effect of selecting both `do_sample` and `num_beams` in summarization pipeline
|
|
0
|
3148
|
September 13, 2022
|
Segmentation Fault while runing example from token classification
|
|
0
|
1085
|
September 13, 2022
|
How to understand the bias term in language model head (when we tie the word embeddings)?
|
|
0
|
937
|
September 12, 2022
|
Can't load config for 'google/pegasus-pubmed'
|
|
3
|
3394
|
September 12, 2022
|
Pickle scanning
|
|
1
|
566
|
September 11, 2022
|
Ward2vec is not included in tokinizer??
|
|
0
|
188
|
September 11, 2022
|
Finetuing BART in SQuAD
|
|
0
|
214
|
September 10, 2022
|
Issues with using DeepSpeed on multiple GPUs
|
|
2
|
2569
|
September 9, 2022
|
Not a valid JSON file
|
|
5
|
10078
|
September 9, 2022
|
Larger Sum(Logits) != Larger Sum(Probability)
|
|
2
|
434
|
September 9, 2022
|
I cannot import something from transformers
|
|
0
|
954
|
September 9, 2022
|
Why does setting `--fp16 True` not save memory as expected?
|
|
2
|
2716
|
September 9, 2022
|
How to convert the new t5x models to huggingface transformers
|
|
4
|
1750
|
September 8, 2022
|
Understanding adjusting Transformer max length
|
|
0
|
1455
|
September 8, 2022
|
Pretraining an MT5 model for summarisation
|
|
3
|
534
|
September 8, 2022
|
BERT2BERT for CNN/Dailymail example not working
|
|
0
|
229
|
September 8, 2022
|