New seq2seq tool: search hparam space with run_eval.py
|
|
5
|
347
|
September 17, 2020
|
Not all BLEU scores were created equal
|
|
0
|
313
|
September 15, 2020
|
Bertology-like Analysis for BART, T5?
|
|
0
|
668
|
August 31, 2020
|
BART question, it seems that pretraining is not work for a small model?
|
|
6
|
563
|
August 3, 2020
|
Why are segment and position embeddings so large?
|
|
2
|
1542
|
August 2, 2020
|
Understanding what went wrong in attention
|
|
5
|
1639
|
July 31, 2020
|
ACL 2020 highlights – Joe
|
|
3
|
1592
|
July 30, 2020
|
Debiasing models by HEX projection
|
|
1
|
521
|
July 28, 2020
|
What does it mean to prime a GPT model?
|
|
5
|
4163
|
July 27, 2020
|
Attaching TF models to CNN features
|
|
1
|
445
|
July 24, 2020
|
Is it reasonableto pretrain by masking certain dimensions of each vector, rather than the individual token?
|
|
3
|
459
|
July 21, 2020
|
Print All Tokens Over a Certain Probability Threshold
|
|
3
|
1105
|
July 21, 2020
|
Building a custom Squad 2.0 style dataset, is it worth it?
|
|
3
|
996
|
July 20, 2020
|
State of the art technique for initializing Embedding Matrix?
|
|
3
|
4982
|
July 19, 2020
|
Modern NLP for "Economics of Innovation" (Open Research Project using Patent Data)
|
|
4
|
757
|
July 14, 2020
|
ACL 2020 - Some personal highlights - Victor
|
|
4
|
1363
|
July 14, 2020
|
ICLR 2020 highlights - Yacine
|
|
1
|
1745
|
July 11, 2020
|
About the Research category
|
|
2
|
447
|
July 11, 2020
|
ACL 2020 highlights – Canwen
|
|
1
|
914
|
July 10, 2020
|
ACL 2020 highlights - Yacine
|
|
0
|
1403
|
July 10, 2020
|
Paper Discussion: Weight Poisoning Attacks on Pre-trained Models
|
|
0
|
1029
|
July 8, 2020
|