Resume Training / Finetune a language model and further finetune a classifier
|
|
1
|
1267
|
October 19, 2020
|
Hyperparameter for distil bert
|
|
0
|
670
|
October 19, 2020
|
Transformer for Abstractive Summarization for Chats Based on Performance
|
|
3
|
1952
|
October 9, 2020
|
Obtaining BERT-base from BERT-large
|
|
3
|
460
|
October 2, 2020
|
How I fine-tune BART for summarization using large texts?
|
|
3
|
3993
|
September 28, 2020
|
New seq2seq tool: search hparam space with run_eval.py
|
|
5
|
347
|
September 17, 2020
|
Not all BLEU scores were created equal
|
|
0
|
315
|
September 15, 2020
|
Bertology-like Analysis for BART, T5?
|
|
0
|
669
|
August 31, 2020
|
BART question, it seems that pretraining is not work for a small model?
|
|
6
|
564
|
August 3, 2020
|
Why are segment and position embeddings so large?
|
|
2
|
1547
|
August 2, 2020
|
Understanding what went wrong in attention
|
|
5
|
1653
|
July 31, 2020
|
ACL 2020 highlights – Joe
|
|
3
|
1596
|
July 30, 2020
|
Debiasing models by HEX projection
|
|
1
|
523
|
July 28, 2020
|
What does it mean to prime a GPT model?
|
|
5
|
4175
|
July 27, 2020
|
Attaching TF models to CNN features
|
|
1
|
445
|
July 24, 2020
|
Is it reasonableto pretrain by masking certain dimensions of each vector, rather than the individual token?
|
|
3
|
462
|
July 21, 2020
|
Print All Tokens Over a Certain Probability Threshold
|
|
3
|
1116
|
July 21, 2020
|
Building a custom Squad 2.0 style dataset, is it worth it?
|
|
3
|
1000
|
July 20, 2020
|
State of the art technique for initializing Embedding Matrix?
|
|
3
|
5052
|
July 19, 2020
|
Modern NLP for "Economics of Innovation" (Open Research Project using Patent Data)
|
|
4
|
758
|
July 14, 2020
|
ACL 2020 - Some personal highlights - Victor
|
|
4
|
1367
|
July 14, 2020
|
ICLR 2020 highlights - Yacine
|
|
1
|
1748
|
July 11, 2020
|
About the Research category
|
|
2
|
454
|
July 11, 2020
|
ACL 2020 highlights – Canwen
|
|
1
|
915
|
July 10, 2020
|
ACL 2020 highlights - Yacine
|
|
0
|
1406
|
July 10, 2020
|
Paper Discussion: Weight Poisoning Attacks on Pre-trained Models
|
|
0
|
1029
|
July 8, 2020
|