How to approach NLG problem, mainly generating summaries from a table/chart using trasnformers based models
|
|
0
|
109
|
March 6, 2023
|
Carrying Gradients Through Generate
|
|
5
|
1527
|
January 29, 2023
|
Model Adaptation
|
|
0
|
140
|
January 24, 2023
|
Swapping out self-attention layer in BERT
|
|
0
|
186
|
January 11, 2023
|
Why are huge batch sizes used for pretraining and small ones for finetuning?
|
|
3
|
3187
|
January 10, 2023
|
How to load only a few parameters
|
|
0
|
187
|
January 7, 2023
|
Domain-specific word similarity problem
|
|
1
|
287
|
January 6, 2023
|
Encoder-Decoder vs Decoder Only Architecture Models
|
|
0
|
664
|
December 18, 2022
|
Train BERT with sentence embeddings
|
|
0
|
267
|
December 14, 2022
|
Is the evaluate-metric/accuracy the same as macro-accuracy?
|
|
0
|
289
|
December 13, 2022
|
Understanding FLOPs-per-token estimates from OpenAI's scaling laws
|
|
5
|
3159
|
December 13, 2022
|
ConformerCTC for streaming
|
|
1
|
310
|
December 12, 2022
|
Sequence classification
|
|
0
|
251
|
December 11, 2022
|
Individually Logging All The Layer/Neuron Outputs
|
|
0
|
338
|
December 1, 2022
|
Incremental decoding with T5
|
|
0
|
511
|
November 29, 2022
|
Is it possible to split a Bert-alike model's output into different task?
|
|
0
|
349
|
November 28, 2022
|
Privacy enhancing technologies in model development
|
|
0
|
380
|
November 22, 2022
|
Conversational QA pretrained model?
|
|
0
|
420
|
November 21, 2022
|
Composition Training/Validation Split of AutoTrain
|
|
0
|
311
|
November 18, 2022
|
Do the common tricks in transformers help with RNNs?
|
|
0
|
384
|
November 10, 2022
|
Rust applications
|
|
3
|
1233
|
November 10, 2022
|
Metadata of NLP datasets
|
|
0
|
441
|
November 5, 2022
|
I'd like to understand on how to train a neural net with agents and evolution
|
|
0
|
416
|
November 1, 2022
|
How to annotate these type of data for custom tr-ocr training
|
|
0
|
415
|
October 30, 2022
|
Online/streaming speech recognition
|
|
2
|
1729
|
October 26, 2022
|
Exploring contexts of occurrence of particular words in large datasets
|
|
2
|
616
|
October 19, 2022
|
Explaining medical diagnosis
|
|
0
|
403
|
October 19, 2022
|
Attention mask and token ids
|
|
1
|
1250
|
October 18, 2022
|
Text generation using SetFit
|
|
0
|
558
|
October 17, 2022
|
BERT from scratch without self-supervised learning
|
|
0
|
451
|
October 13, 2022
|