Why is there no pooler representation for XLNet or a consistent use of sequence_summary()?
|
|
16
|
2210
|
December 7, 2020
|
Productionalize the model
|
|
0
|
353
|
December 4, 2020
|
Zero shot learning
|
|
0
|
301
|
December 4, 2020
|
Using summarization models for paraphrasing
|
|
0
|
400
|
December 4, 2020
|
Seq2Seq fnetuning wandb issue
|
|
1
|
877
|
December 3, 2020
|
Should Google Colab Be Updated With New Model Uploading?
|
|
1
|
285
|
December 3, 2020
|
BART for sequence classification
|
|
1
|
319
|
December 3, 2020
|
Are BERT models pretrained with Whole Word Masking?
|
|
1
|
453
|
November 30, 2020
|
Naming inconsistency in Distilbert config
|
|
1
|
504
|
November 30, 2020
|
Additional features as input to TFBert?
|
|
1
|
368
|
November 30, 2020
|
Some unintended things happen in Seq2SeqTrainer example
|
|
3
|
1585
|
November 30, 2020
|
Arguments in encode_plus
|
|
1
|
269
|
November 29, 2020
|
Improving performance results for BERT
|
|
2
|
973
|
November 25, 2020
|
[Help] GPU with query answering
|
|
0
|
330
|
November 25, 2020
|
T5 for Named Entity Recognition
|
|
2
|
6351
|
November 24, 2020
|
Accuracy changes dramatically
|
|
0
|
564
|
November 23, 2020
|
Token classification probability and scoring
|
|
0
|
755
|
November 23, 2020
|
How to train TFT5ForConditionalGeneration model?
|
|
5
|
3340
|
November 21, 2020
|
How to create the warmup and decay from the BERT/Roberta papers?
|
|
2
|
7470
|
November 18, 2020
|
Initializing the weights of the final layer of e.g. BertForTokenClassification with a manual seed
|
|
2
|
7990
|
October 6, 2020
|
Convert mT5 to HF weights?
|
|
6
|
994
|
November 17, 2020
|
mBART finetuning tips/post-mortem
|
|
6
|
2652
|
November 17, 2020
|
Abbreviation expansions
|
|
0
|
749
|
November 17, 2020
|
Evaluation metrics
|
|
1
|
2013
|
November 16, 2020
|
Learning rate setting
|
|
1
|
2045
|
November 16, 2020
|
New Model sharing and uploading is extremely slow
|
|
2
|
3564
|
November 16, 2020
|
GPT2 with TensorFlow?
|
|
1
|
372
|
November 14, 2020
|
Distributed Training on Databricks
|
|
0
|
901
|
November 14, 2020
|
Custom DistilBertTokenizer training
|
|
3
|
659
|
November 13, 2020
|
DPR retriever module
|
|
1
|
838
|
November 6, 2020
|