Models

Topic	Replies	Views	Activity
Not getting a good model at first try	0	365	April 14, 2022
Compute the BLEU using pretrained T5-small	2	4000	April 13, 2022
Teaching Transformers to Sum Numbers	0	479	April 11, 2022
NLP Model Deployment and input transformation	0	344	April 10, 2022
3-dimensional attention_mask in LongformerSelfAttention	0	819	April 5, 2022
Creating Batch Sizes for Video Transcription Dataset	0	690	April 5, 2022
Fine-tuning BERT with sequences longer than 512 tokens	7	28009	April 4, 2022
Loss to zero in the training	0	2180	February 17, 2022
Question about GPT2LMHeadModel, GPT2ForSequenceClassification	2	4641	April 1, 2022
Extracting and adding document clustering features to a document classification model	0	788	March 30, 2022
How to select models efficently for fine-tuning?	0	599	March 30, 2022
Hosted Inference API with SpeechBrain returns arror	7	529	March 29, 2022
Microsoft WavLM-Base-Plus for Speaker Verification is corrupted	3	765	March 28, 2022
Further pre-train language model in transformers like BERT	3	1117	March 27, 2022
Web demo broken on ST5	0	415	March 26, 2022
Using Attention matrix to explain a classification problem?	0	648	March 25, 2022
Do I need to worry about this bert.dense.pooler training warning for my usecase?	0	823	March 25, 2022
Why is BigBird Pegasus/Pegasus Repeating the Same Sentence for Summarization?	1	831	March 24, 2022
Demand on Text Regression Pipeline/Application	0	899	March 22, 2022
Wav2vec2-xls-r-2b-22-to-16 sample code not running	1	703	March 18, 2022
T5 Temperature-scaled mixing	0	687	March 18, 2022
Finetuning longformer	2	1434	March 18, 2022
Learning rate for XLM-R followed by linear layers	0	519	March 16, 2022
Wrong tokenizer paths in I-BERT-Large models	0	652	March 15, 2022
What is the difference between lm_labels and decoder_input_ids	0	516	March 13, 2022
About parameter sharing in t5-v1.1	0	364	March 12, 2022
Is there any more tokenizer-free language model available?	0	563	March 12, 2022
Freeze weights of a zero shot model	0	392	March 11, 2022
Pegasus dropping Non-ASCII Chars	6	1181	March 11, 2022
“Confidence “ metric for text to text generation pipeline	0	461	March 9, 2022