Can you help me interepret the results of my hyperparameter sweep for fine-tuning BLIP2-2.7?
|
|
0
|
47
|
October 22, 2024
|
Sequential Prefilling w/ Mamba
|
|
0
|
49
|
October 21, 2024
|
Masking task with BERT on time serires
|
|
0
|
24
|
October 21, 2024
|
429 Errors and Model Overloaded - Dedicated Endpoint
|
|
0
|
33
|
October 22, 2024
|
Fine tune LLM in our competition for mental health research - £500 ($648) available to win!
|
|
0
|
48
|
October 23, 2024
|
Load frozen layers from one checkpoint and new layers from second checkpoint?
|
|
0
|
40
|
October 23, 2024
|
How to use custom dataset in a model
|
|
0
|
26
|
October 23, 2024
|
Clone Git repo into app
|
|
0
|
8
|
May 16, 2025
|
SHAP Value [MASK] vs attention mask
|
|
0
|
76
|
October 24, 2024
|
The Tree Oil Painting
|
|
0
|
30
|
May 22, 2025
|
Classifier Dropout for *DecoderModel*ForSequenceClassification Classes
|
|
0
|
58
|
October 25, 2024
|
How to "rebuild TensorFlow", what "appropriate compiler flags" i need etc
|
|
0
|
595
|
November 15, 2024
|
Problem with returning decoder cross attentions through generate function
|
|
0
|
25
|
October 25, 2024
|
Multi-gpu huggingface training using trl
|
|
0
|
402
|
October 22, 2024
|
Idea: Iterative Residual Embeddings for Complex Image Understanding
|
|
0
|
12
|
May 21, 2025
|
Accelerator() causes Error
|
|
2
|
364
|
April 12, 2024
|
Managing Memory for Agents 2.0
|
|
0
|
38
|
October 26, 2024
|
Beyond Prompting: A Narrative-Centric Framework for Simulated Consciousness in LLMs
|
|
0
|
31
|
May 19, 2025
|
Input batch size not matching Target batch size
|
|
0
|
88
|
October 26, 2024
|
AERIS â Cognitive Reasoning Layer for Dialectical Evaluation (Demo + Baseline)
|
|
0
|
30
|
May 22, 2025
|
Process Reward Model compatibility with PPOTrainer
|
|
0
|
124
|
October 23, 2024
|
How to Log Accuracy with Metadata in a Sentence Regression Task?
|
|
0
|
16
|
October 26, 2024
|
Add the missing independent feature in graph
|
|
0
|
17
|
October 27, 2024
|
Help with preparing train data for fine-tuning llama 3.1 instruct model?
|
|
0
|
97
|
October 27, 2024
|
TypeError: DPODataCollator.__init__() got an unexpected keyword argument 'max_prompt_length'
|
|
0
|
66
|
October 28, 2024
|
Continuous execution lead to decreasing inference time
|
|
0
|
17
|
October 28, 2024
|
Are dataset "_id" safe to use?
|
|
0
|
155
|
November 15, 2024
|
Lost Van Gogh? AI-Driven Scientific Analysis Reveals Brushstroke secrets!
|
|
0
|
16
|
May 22, 2025
|
I want add thread.Thread in gradio app?
|
|
0
|
274
|
November 15, 2024
|
Finetuning a Large Language Model
|
|
0
|
78
|
October 23, 2024
|