Topic	Replies	Views	Activity
New seq2seq tool: search hparam space with run_eval.py Research	5	347	September 17, 2020
Not all BLEU scores were created equal Research	0	313	September 15, 2020
Bertology-like Analysis for BART, T5? Research	0	668	August 31, 2020
BART question, it seems that pretraining is not work for a small model? Research	6	563	August 3, 2020
Why are segment and position embeddings so large? Research	2	1542	August 2, 2020
Understanding what went wrong in attention Research	5	1639	July 31, 2020
ACL 2020 highlights – Joe Research	3	1592	July 30, 2020
Debiasing models by HEX projection Research	1	521	July 28, 2020
What does it mean to prime a GPT model? Research	5	4163	July 27, 2020
Attaching TF models to CNN features Research	1	445	July 24, 2020
Is it reasonableto pretrain by masking certain dimensions of each vector, rather than the individual token? Research	3	459	July 21, 2020
Print All Tokens Over a Certain Probability Threshold Research	3	1105	July 21, 2020
Building a custom Squad 2.0 style dataset, is it worth it? Research	3	996	July 20, 2020
State of the art technique for initializing Embedding Matrix? Research	3	4982	July 19, 2020
Modern NLP for "Economics of Innovation" (Open Research Project using Patent Data) Research	4	757	July 14, 2020
ACL 2020 - Some personal highlights - Victor Research	4	1363	July 14, 2020
ICLR 2020 highlights - Yacine Awesome paper	1	1745	July 11, 2020
About the Research category Research	2	447	July 11, 2020
ACL 2020 highlights – Canwen Research	1	914	July 10, 2020
ACL 2020 highlights - Yacine Research	0	1403	July 10, 2020
Paper Discussion: Weight Poisoning Attacks on Pre-trained Models Awesome paper	0	1029	July 8, 2020

Research