Research

Topic	Replies	Views	Activity
Collaborative Training Experiment Round 2 with Yandex and HuggingFace Research	0	566	September 1, 2021
Tutorial / codebase for models interacting while training? Research	0	494	August 29, 2021
10_000 samples & 10_000 labels Research	0	510	July 31, 2021
Best way to infer continuously with Transformer? Research	0	557	July 26, 2021
The (hidden) meaning behind the embedding of the padding token? Awesome paper	2	6280	July 14, 2021
Language model to search an answer in a huge collection of (unrelated) paragraphs Research	4	1511	July 6, 2021
Address extraction and formated using Places API (Google Maps API) Research	0	1730	July 4, 2021
Finetuning for fp16 compatibility Research	2	1698	June 17, 2021
What can transformers learn without position encoding? Research	1	3144	June 10, 2021
Project Description Research	1	366	May 29, 2021
Does it make sense to generate sentences with Transofmrer's encoder? Research	0	379	May 22, 2021
PEGASUS model overfitting Research	2	464	May 19, 2021
Classification Heads in BERT and DistilBERT for Sequence Classification Research	2	1185	May 13, 2021
Collaborative Training Experiment of an Albert Model for Bengali Research	1	1308	May 6, 2021
Task-specific fine-tuning of GPT2 Research	0	1045	April 22, 2021
Is causal language modeling (CLM) vs masked language modeling (MLM) a common distinction in NLP research? Research	0	2181	April 21, 2021
Any ways to visualize attention of the LXMERT? Research	0	497	April 17, 2021
Human Evaluation and Statistical significance Research	0	418	April 8, 2021
How to instill auxiliary information coupled with words into transformers? Research	0	341	March 19, 2021
Zero-shot and distillation - Improved distilled model over teacher model Research	0	1103	March 18, 2021
XLSR-53: To group tokens or not to group tokens Research	1	548	March 18, 2021
NER for 2D text Research	0	429	March 16, 2021
Dealing with Imbalanced Datasets? Research	1	5468	March 11, 2021
How does BERT actually answer questions? Research	1	801	March 11, 2021
FDA Label Document Embedding Research	9	1472	February 19, 2021
Likelyhood input sequence came from training set Research	0	337	February 17, 2021
Why are embedding / pooler layers excluded from pruning comparisons? Research	7	789	February 16, 2021
Debugging the RAG question encoder Research	2	578	February 10, 2021
Question about maximum number of tokens Research	1	6194	February 9, 2021
Science Tuesday: MARGE Awesome paper	7	3743	February 8, 2021