Language model to search an answer in a huge collection of (unrelated) paragraphs
|
|
1
|
100
|
November 27, 2020
|
Meta Persona an abstract adaptive neural construct
|
|
0
|
48
|
November 25, 2020
|
Adding learnable coefficients for multi-objective losses?
|
|
2
|
72
|
November 25, 2020
|
Inference on constrained devices
|
|
0
|
38
|
November 21, 2020
|
Is there an easy way to apply layer-wise decaying learning rate in huggingface trainer for RobertaMaskedForLM?
|
|
2
|
95
|
November 16, 2020
|
Pre-Train BERT (from scratch)
|
|
39
|
1672
|
November 16, 2020
|
What are some popular datasets for domain adaptation in NLP
|
|
1
|
85
|
November 12, 2020
|
Carrying Gradients Through Generate
|
|
4
|
216
|
November 2, 2020
|
Adding features to a pretrained language model
|
|
3
|
548
|
October 28, 2020
|
Bart-base rouge scores
|
|
11
|
310
|
October 27, 2020
|
Load/save HF block sparse model
|
|
1
|
62
|
October 21, 2020
|
Resume Training / Finetune a language model and further finetune a classifier
|
|
1
|
98
|
October 19, 2020
|
Hyperparameter for distil bert
|
|
0
|
76
|
October 19, 2020
|
ELECTRA training reimplementation and discussion
|
|
9
|
1995
|
October 18, 2020
|
GPT2 for QA Pair Generation
|
|
8
|
284
|
October 13, 2020
|
Transformer for Abstractive Summarization for Chats Based on Performance
|
|
3
|
398
|
October 9, 2020
|
Evaluation metrics for BERT-like LMs
|
|
3
|
160
|
October 5, 2020
|
Obtaining BERT-base from BERT-large
|
|
3
|
83
|
October 2, 2020
|
How I fine-tune BART for summarization using large texts?
|
|
3
|
388
|
September 28, 2020
|
New seq2seq tool: search hparam space with run_eval.py
|
|
5
|
87
|
September 17, 2020
|
Not all BLEU scores were created equal
|
|
0
|
74
|
September 15, 2020
|
How to use T5 for sentence embedding?
|
|
4
|
358
|
September 12, 2020
|
Finetuning for fp16 compatibility
|
|
0
|
94
|
September 3, 2020
|
Bertology-like Analysis for BART, T5?
|
|
0
|
153
|
August 31, 2020
|
[Help needed] Extending Trainer for Meta learning
|
|
2
|
225
|
August 19, 2020
|
BART question, it seems that pretraining is not work for a small model?
|
|
6
|
182
|
August 3, 2020
|
Why are segment and position embeddings so large?
|
|
2
|
322
|
August 2, 2020
|
Understanding what went wrong in attention
|
|
5
|
271
|
July 31, 2020
|
ACL 2020 highlights – Joe
|
|
3
|
787
|
July 30, 2020
|
Finetuning German BERT for QA on biomedical domain
|
|
1
|
141
|
July 28, 2020
|