Using Cosine LR scheduler via TrainingArguments in Trainer
|
|
8
|
5099
|
May 16, 2024
|
Running transformer models on mps instead of cpu on mac
|
|
0
|
35
|
May 16, 2024
|
Fine tuning llm model
|
|
2
|
2146
|
May 16, 2024
|
Document Similarity of long documents e.g. legal contracts
|
|
5
|
6789
|
May 16, 2024
|
Guide/Tutorial to write an inference endpoint for custom models
|
|
1
|
173
|
May 16, 2024
|
Finetuning T5 series models with my own data
|
|
0
|
38
|
May 16, 2024
|
Error 403! What to do about it?
|
|
31
|
24302
|
May 16, 2024
|
ModuleNotFoundError: No module named 'transformers.agents'
|
|
2
|
103
|
May 16, 2024
|
Can't change max_input_length of Text Generation Inference
|
|
0
|
31
|
May 15, 2024
|
Questions about Mistral and apply_chat_template with Text Generation Inference, openai API and messages API
|
|
0
|
29
|
May 15, 2024
|
Why follow Flan-T5 template when T5 tokenizer ignores multiple newlines
|
|
0
|
33
|
May 15, 2024
|
Decoder only model - how to have it not include the prompt in its output?
|
|
3
|
61
|
May 15, 2024
|
How to pretrain randomized language model with custom dataset
|
|
0
|
23
|
May 15, 2024
|
PEFT prompt tuning for SEQ_CLS with BERT causes unexpected keyword argument 'label'
|
|
0
|
40
|
May 15, 2024
|
Is it possible to get the data that is seen by the model during training?
|
|
0
|
41
|
May 15, 2024
|
Can any model actually write current Rust?
|
|
2
|
250
|
May 15, 2024
|
Question regarding adding a 4080 (and 3080?) to a 4090 rig for AI
|
|
2
|
44
|
May 15, 2024
|
How to limit response to generated output only? Using ChatML
|
|
3
|
79
|
May 15, 2024
|
Load_dataset can't find hosted public .parquet files?
|
|
3
|
747
|
May 15, 2024
|
ValueError: Unrecognized configuration class <class 'transformers.models.whisper.configuration_whisper.WhisperConfig'>
|
|
0
|
39
|
May 15, 2024
|
How to train a LLM model on a Native language
|
|
0
|
32
|
May 15, 2024
|
Robot Prophet , Sophia AI
|
|
0
|
64
|
May 15, 2024
|
Uploading a large trained model
|
|
6
|
646
|
May 15, 2024
|
Beginners help how do I block or remove a follower
|
|
0
|
35
|
May 15, 2024
|
Proposal: AI-Powered Video Generation from Single Images Using a Comprehensive Model Zoo
|
|
0
|
38
|
May 15, 2024
|
Regression outputs (list) for normal distribution output in regression problems
|
|
0
|
28
|
May 15, 2024
|
Training BERT for basic recommendation
|
|
0
|
33
|
May 15, 2024
|
Entropy tokenizer
|
|
0
|
33
|
May 15, 2024
|
Information to logical expression
|
|
0
|
39
|
May 15, 2024
|
[ Dataset.from_generator ] Prevent caching during upload
|
|
1
|
47
|
May 15, 2024
|