Using Transformers(?) for Tibetan-English Translation
|
|
0
|
397
|
June 21, 2023
|
Medical NER based on Bert in Norwegian
|
|
0
|
211
|
June 21, 2023
|
A criticism of instruction fine-tuning datasets
|
|
2
|
1605
|
June 20, 2023
|
Can we access attention component and feed-forward component of a Bert layer?
|
|
1
|
830
|
June 17, 2023
|
Forward-Forward algorithm by Geoffrey Hinton
|
|
10
|
3906
|
June 17, 2023
|
Language model gradients sensitive to target value/length
|
|
0
|
275
|
June 16, 2023
|
Leather Jacket UK
|
|
0
|
167
|
June 16, 2023
|
Masked Language Model Scoring
|
|
5
|
2412
|
June 15, 2023
|
Modification of self attention in BERT without pretraining
|
|
1
|
295
|
June 15, 2023
|
Fine tuning gpt-neo via ppo
|
|
1
|
1285
|
June 11, 2023
|
Muti-Task Model - OCR + Object Detection
|
|
0
|
538
|
June 8, 2023
|
How to use T5 for sentence embedding?
|
|
6
|
13071
|
May 27, 2023
|
My QUESTION is how run a very big model like bloom on a cluster of machines?
|
|
0
|
239
|
May 26, 2023
|
Few-shot learning vs Fine-Tuning
|
|
0
|
1409
|
May 26, 2023
|
Finetuning on a recent topic/domain
|
|
2
|
386
|
May 25, 2023
|
Opcodeo Tokenizer
|
|
0
|
236
|
May 17, 2023
|
Importance of sentinel token placement in T5?
|
|
0
|
457
|
May 16, 2023
|
Integration with Public-sector Data Portals
|
|
0
|
288
|
May 16, 2023
|
Multi-GPU Machine Setup Guide and QnA
|
|
6
|
4086
|
May 1, 2021
|
Help me with my PhD research on voice dataset documentation by completing this survey
|
|
1
|
407
|
May 13, 2023
|
Feeding a Knowledge Base into Transformer model
|
|
1
|
1173
|
May 2, 2023
|
Model that generates comments for the AITA subreddit
|
|
0
|
363
|
April 29, 2023
|
Cost Effective LLM - For Small Guys
|
|
0
|
961
|
April 27, 2023
|
Civic Technology Community Group
|
|
1
|
353
|
April 25, 2023
|
Fine-tuned MLM based RoBERTa not improving performance
|
|
2
|
793
|
April 20, 2023
|
A complete survey on ChatGPT: One Small Step for Generative AI, One Giant Leap for AGI
|
|
0
|
1008
|
April 5, 2023
|
Continue pre-training GPT2
|
|
0
|
395
|
March 26, 2023
|
NLP: Infer intent of finalising a transaction in a dialogue/chat system
|
|
0
|
222
|
March 22, 2023
|
Conversational Budget Analytics
|
|
1
|
453
|
March 19, 2023
|
TRL loss blowing up
|
|
2
|
469
|
March 16, 2023
|