List all available revision?
|
|
2
|
787
|
May 1, 2024
|
Loss.backward() producing nan values with 8-bit Llama-3-70B-Instruct
|
|
3
|
720
|
May 1, 2024
|
I cannot find the code that transformers trainer model_wrapped by deepspeed , i can find the theory about model_wrapped was wraped by DDP(Deepspeed(transformer model )) ,but i only find the code transformers model wrapped by ddp, where is the deepspeed wr
|
|
1
|
132
|
May 1, 2024
|
Wandb plot x-axis epoch instead of global steps?
|
|
2
|
1355
|
November 30, 2023
|
When deploying AutoTrained model: "Cannot access gated repo"
|
|
1
|
678
|
May 1, 2024
|
Help required in opening files of a dataset (.phys, .thermal, .pts, .ass extensions)
|
|
0
|
48
|
May 1, 2024
|
Llama3 incomplete answer
|
|
1
|
292
|
May 1, 2024
|
How to create a new Hugging face model by using already available hugging face models
|
|
2
|
151
|
May 1, 2024
|
Mistral or LLaMA?
|
|
3
|
3524
|
May 1, 2024
|
Not able to push dataset/model with write token
|
|
1
|
202
|
April 30, 2024
|
Azure cognitiveservices runtime error
|
|
1
|
373
|
April 30, 2024
|
Display of gradio in google colab is not good
|
|
2
|
790
|
April 30, 2024
|
No sentence-transformers model found with name sentence-transformers/all-MiniLM-L6-v2
|
|
2
|
3747
|
April 30, 2024
|
ImportError using AutoModelForCasualLM.from_pretrained
|
|
0
|
420
|
April 30, 2024
|
Memory Error While Fine-tuning AYA on 8 H100 GPUs
|
|
0
|
216
|
April 30, 2024
|
We are a startup non-profit civil liberties project looking to use an unrestricted model for summarizing legal ease for people being victimized
|
|
0
|
180
|
April 30, 2024
|
Fine-tuning with Different Model Heads
|
|
4
|
705
|
April 30, 2024
|
Empathetic Generative AI
|
|
1
|
361
|
April 30, 2024
|
Finetune SAM for instance segmentation to output segmenatation masks along with label names
|
|
0
|
216
|
April 30, 2024
|
Fine tuning gguf models?
|
|
1
|
1412
|
April 30, 2024
|
Regarding GGUF Quantize model
|
|
0
|
160
|
April 30, 2024
|
Fine tune of Mistral model
|
|
0
|
100
|
April 30, 2024
|
Diffusers Pipeline, Can't Connect to the Hub
|
|
0
|
291
|
April 30, 2024
|
meta-llama/Meta-Llama-3-8B is giving empty responses when I use with transformers
|
|
0
|
253
|
April 30, 2024
|
Causal text analysis using transformers
|
|
1
|
520
|
April 30, 2024
|
Japanese NLP - Introductions
|
|
13
|
4438
|
April 30, 2024
|
How can I integrate the InternVL-Chat-V1.5 model into a web page without specialized hardware or API?
|
|
0
|
116
|
April 29, 2024
|
Need to set re_entrant to true with latest transformers
|
|
1
|
1102
|
April 29, 2024
|
How to pass the api token using transformers candle (rust)?
|
|
1
|
159
|
April 29, 2024
|
Comparision of text documents using AlpacaEval
|
|
0
|
79
|
April 29, 2024
|