How to use transformer attention model when the input is features
|
|
1
|
1218
|
October 12, 2020
|
What is the license of /nlptown/bert-base-multilingual-uncased-sentiment?
|
|
3
|
508
|
October 12, 2020
|
PYTORCH-TRANSFORMERS vs Transformers
|
|
2
|
2070
|
October 12, 2020
|
How to best deal with numbers?
|
|
2
|
1402
|
October 12, 2020
|
Extend load_from_disk and save_to_disk to remote storage
|
|
3
|
522
|
October 12, 2020
|
Dataset for fake news detection, fine tune or pre-train
|
|
7
|
1720
|
October 12, 2020
|
What does `tokenizers.normalizer.normalize` do?
|
|
5
|
3463
|
October 12, 2020
|
RAG Example and Word-Level contributions
|
|
4
|
1914
|
October 12, 2020
|
Strange error when using the Longformer (HuggingFace developers, please reply)
|
|
8
|
1796
|
October 12, 2020
|
RuntimeError: Error in void faiss::gpu::allocMemorySpace
|
|
16
|
8382
|
October 12, 2020
|
T5 fine tuning, loss difference when using labels and decoder_input_ids
|
|
2
|
1168
|
October 12, 2020
|
Checkpoint vs model weight
|
|
2
|
4728
|
October 12, 2020
|
Pplm runtime error with finetuned model
|
|
1
|
557
|
October 12, 2020
|
What could be causing " line 51, in write_predictions_to_file if not preds_list[example_id]: IndexError: list index out of range" in token-classification?
|
|
2
|
526
|
October 13, 2020
|
Using LongformerForMultipleChoice for processing multiple-choice questions with the 4 options
|
|
1
|
659
|
October 13, 2020
|
Bug: Share my uploaded models publicly by default
|
|
0
|
541
|
October 13, 2020
|
Longformer for sequenceclassification
|
|
5
|
473
|
October 13, 2020
|
Warning occured when trying to load checkpoint to continue training
|
|
5
|
2268
|
October 13, 2020
|
Reddit data - GDPR
|
|
0
|
547
|
October 13, 2020
|
Training DistilGPT2
|
|
4
|
2384
|
October 13, 2020
|
T5: ignore sentinel indices for unsupervised denoising / masking objective?
|
|
0
|
372
|
October 13, 2020
|
Customizing GenerationMixin to output attentions
|
|
4
|
1792
|
September 10, 2020
|
Finetuning Pegasus for summarization task
|
|
3
|
1045
|
October 14, 2020
|
[RFC] Transformers Pipeline v2
|
|
4
|
1850
|
October 14, 2020
|
I'm getting "nan" value for loss, while following a tutorial from the documentatin
|
|
0
|
665
|
October 14, 2020
|
Is there any way to control the input of a `Longformer` layer?
|
|
1
|
253
|
October 14, 2020
|
T5: Tips for finetuning on crossword clues (clue => answer)
|
|
1
|
625
|
October 14, 2020
|
Distillation: create student model from a different base model than teacher
|
|
9
|
2057
|
October 14, 2020
|
Getting predictions
|
|
1
|
284
|
October 15, 2020
|
Keeping some tokens untranslated
|
|
0
|
557
|
October 15, 2020
|