Research on Hyperparameters for Fine Tuning
|
|
2
|
247
|
February 26, 2024
|
Copying mechanism for transformer
|
|
9
|
5894
|
February 23, 2024
|
AI for Low-Budget film making (an experiment)
|
|
0
|
257
|
February 14, 2024
|
Open source psychology project using HF sentence transformers
|
|
0
|
189
|
February 14, 2024
|
Conversational Search and Analysis of Collections of Letters and Comments
|
|
3
|
505
|
February 3, 2024
|
Free Access for Masters Dissertation
|
|
1
|
274
|
February 2, 2024
|
Tech skill embeddings
|
|
0
|
141
|
January 29, 2024
|
Profiling all layers of a model
|
|
0
|
385
|
January 26, 2024
|
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
|
|
1
|
2482
|
June 13, 2023
|
How to get the probabilities of each class when we use T5 or Flan
|
|
0
|
240
|
January 23, 2024
|
Combinatorial Optimization with LLMs/Transformers
|
|
4
|
1039
|
January 23, 2024
|
Researching ways to speed up WhisperAI startup
|
|
0
|
245
|
January 20, 2024
|
Strategies for Enhancing LLM's Understanding of a Complex Novel for Improved Question Answering
|
|
1
|
639
|
January 19, 2024
|
Facing issues in fine-tuning Vicuna-7b model
|
|
0
|
307
|
January 18, 2024
|
Contextual Recommendation of Adages, Allusions, Anecdotes, Aphorisms, Jokes, Proverbs, Quotes, Lyrics, Poems, Stories, and Witticisms
|
|
1
|
212
|
January 15, 2024
|
Inference optimization with HPC
|
|
2
|
328
|
January 8, 2024
|
Best use of a large dataset
|
|
0
|
181
|
January 6, 2024
|
Theme Extraction from Text
|
|
1
|
503
|
December 29, 2023
|
How can I replicate the research paper?
|
|
1
|
290
|
December 22, 2023
|
PPO using TRL: optimal strategy for reward calculation?
|
|
1
|
391
|
December 20, 2023
|
Special Digit Recognizer
|
|
0
|
232
|
December 18, 2023
|
Looking for OCR post-processing for Visual Document Understanding
|
|
0
|
293
|
December 15, 2023
|
Using Google's Gemini for scientific literature
|
|
0
|
1010
|
December 14, 2023
|
Translation task for scarce language
|
|
1
|
174
|
December 14, 2023
|
Classifying NLP tasks based on prompts?
|
|
0
|
229
|
December 14, 2023
|
Jacket shop usa
|
|
1
|
248
|
December 13, 2023
|
How do i choose a optimal LLM for Pentesting
|
|
2
|
557
|
December 13, 2023
|
LongLora fine-tuned model
|
|
0
|
215
|
December 11, 2023
|
What to Monitor during training Val_Loss or Val_Accuracy?
|
|
0
|
235
|
December 5, 2023
|
LayoutLMV3 information extraction from invoice
|
|
1
|
462
|
November 28, 2023
|