Transfer learning to explore tasks' information requirements?
|
|
0
|
388
|
February 5, 2021
|
Model or Dataset available for classifying a grammatical sentence?
|
|
1
|
1670
|
February 3, 2021
|
Generating coherent related text with generative model (GPT2 etc.)
|
|
0
|
521
|
January 28, 2021
|
RoBERTa trained on NSP
|
|
0
|
633
|
January 12, 2021
|
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
|
|
1
|
1602
|
January 20, 2021
|
Multilingual token, phrase and sentence representations for text similarity
|
|
0
|
489
|
January 13, 2021
|
Classification problem difficulty when going from 3 classes to 5 classes?
|
|
1
|
361
|
January 11, 2021
|
Text to Text Transformer - T5
|
|
2
|
1099
|
January 4, 2021
|
Shortformer: Better Language Modeling using Shorter Inputs
|
|
0
|
467
|
December 31, 2020
|
Don't Stop Pretraining BART
|
|
1
|
902
|
December 29, 2020
|
Pre-training with Lamb optimizer
|
|
7
|
4290
|
December 28, 2020
|
About the encoder and generator used in the RAG model
|
|
2
|
859
|
December 25, 2020
|
MRPC Reproducibility with transformers-4.1.0
|
|
0
|
499
|
December 19, 2020
|
Using transformers (BERT, RoBERTa) without embedding layer
|
|
8
|
4116
|
December 16, 2020
|
What are some recommended pretrained models for extracting semantic feature on single sentence?
|
|
4
|
1481
|
December 14, 2020
|
BORT: Optimal Subarchitecture Extraction for BERT
|
|
1
|
539
|
December 5, 2020
|
Training generative models based on "rewards"
|
|
0
|
288
|
December 4, 2020
|
EMNLP Picks from the Hugging Face Science Team
|
|
1
|
4062
|
December 2, 2020
|
Meta Persona an abstract adaptive neural construct
|
|
0
|
712
|
November 25, 2020
|
Adding learnable coefficients for multi-objective losses?
|
|
2
|
757
|
November 25, 2020
|
Inference on constrained devices
|
|
0
|
293
|
November 21, 2020
|
What are some popular datasets for domain adaptation in NLP
|
|
1
|
471
|
November 12, 2020
|
Adding features to a pretrained language model
|
|
3
|
3875
|
October 28, 2020
|
Bart-base rouge scores
|
|
11
|
1727
|
October 27, 2020
|
Load/save HF block sparse model
|
|
1
|
397
|
October 21, 2020
|
Resume Training / Finetune a language model and further finetune a classifier
|
|
1
|
1260
|
October 19, 2020
|
Hyperparameter for distil bert
|
|
0
|
669
|
October 19, 2020
|
Transformer for Abstractive Summarization for Chats Based on Performance
|
|
3
|
1948
|
October 9, 2020
|
Obtaining BERT-base from BERT-large
|
|
3
|
452
|
October 2, 2020
|
How I fine-tune BART for summarization using large texts?
|
|
3
|
3968
|
September 28, 2020
|