RAG for FEVER Dataset
|
|
0
|
419
|
February 8, 2021
|
Transfer learning to explore tasks' information requirements?
|
|
0
|
388
|
February 5, 2021
|
Model or Dataset available for classifying a grammatical sentence?
|
|
1
|
1690
|
February 3, 2021
|
Generating coherent related text with generative model (GPT2 etc.)
|
|
0
|
521
|
January 28, 2021
|
RoBERTa trained on NSP
|
|
0
|
634
|
January 12, 2021
|
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
|
|
1
|
1607
|
January 20, 2021
|
Multilingual token, phrase and sentence representations for text similarity
|
|
0
|
491
|
January 13, 2021
|
Classification problem difficulty when going from 3 classes to 5 classes?
|
|
1
|
365
|
January 11, 2021
|
Text to Text Transformer - T5
|
|
2
|
1101
|
January 4, 2021
|
Shortformer: Better Language Modeling using Shorter Inputs
|
|
0
|
467
|
December 31, 2020
|
Don't Stop Pretraining BART
|
|
1
|
906
|
December 29, 2020
|
Pre-training with Lamb optimizer
|
|
7
|
4321
|
December 28, 2020
|
About the encoder and generator used in the RAG model
|
|
2
|
862
|
December 25, 2020
|
MRPC Reproducibility with transformers-4.1.0
|
|
0
|
500
|
December 19, 2020
|
Using transformers (BERT, RoBERTa) without embedding layer
|
|
8
|
4149
|
December 16, 2020
|
What are some recommended pretrained models for extracting semantic feature on single sentence?
|
|
4
|
1501
|
December 14, 2020
|
BORT: Optimal Subarchitecture Extraction for BERT
|
|
1
|
541
|
December 5, 2020
|
Training generative models based on "rewards"
|
|
0
|
288
|
December 4, 2020
|
EMNLP Picks from the Hugging Face Science Team
|
|
1
|
4067
|
December 2, 2020
|
Meta Persona an abstract adaptive neural construct
|
|
0
|
714
|
November 25, 2020
|
Adding learnable coefficients for multi-objective losses?
|
|
2
|
760
|
November 25, 2020
|
Inference on constrained devices
|
|
0
|
295
|
November 21, 2020
|
What are some popular datasets for domain adaptation in NLP
|
|
1
|
471
|
November 12, 2020
|
Adding features to a pretrained language model
|
|
3
|
3875
|
October 28, 2020
|
Bart-base rouge scores
|
|
11
|
1730
|
October 27, 2020
|
Load/save HF block sparse model
|
|
1
|
400
|
October 21, 2020
|
Resume Training / Finetune a language model and further finetune a classifier
|
|
1
|
1267
|
October 19, 2020
|
Hyperparameter for distil bert
|
|
0
|
670
|
October 19, 2020
|
Transformer for Abstractive Summarization for Chats Based on Performance
|
|
3
|
1952
|
October 9, 2020
|
Obtaining BERT-base from BERT-large
|
|
3
|
460
|
October 2, 2020
|