AttributeError: module 'torch' has no attribute 'chalf'
|
|
8
|
996
|
May 6, 2024
|
Why activations memory is computed through an experiment rather formulating it for DeepSpeed autotuner
|
|
0
|
81
|
May 6, 2024
|
Issues with Downloading Llama2 in Jupyter Notebook
|
|
1
|
547
|
May 5, 2024
|
Good Arabic embeddings are needed
|
|
0
|
126
|
May 4, 2024
|
# [ImportError: `llama-index-readers-file` package not found ]
|
|
0
|
269
|
May 4, 2024
|
ImportError: `llama-index-readers-file` package not found
|
|
0
|
179
|
May 4, 2024
|
How to use hugging face transformers for testing a dataset
|
|
1
|
265
|
May 4, 2024
|
Clarification on the attention_mask
|
|
4
|
23005
|
May 3, 2024
|
Retraining pre-trained NER model with new data samples
|
|
1
|
395
|
May 3, 2024
|
PerceiverIO Output Query Array Doubts
|
|
1
|
124
|
May 3, 2024
|
How do I avoid LLM rambling?
|
|
0
|
322
|
May 3, 2024
|
Tensor parallel in Pytorch 2.3
|
|
0
|
203
|
May 2, 2024
|
Convert Pytorch Model to Huggingface Transformer?
|
|
2
|
10755
|
May 2, 2024
|
Customizing T5 tokenizer for finetuning
|
|
1
|
612
|
May 2, 2024
|
Node: 'model/swin_transformer/tf_swin_model/swin/encoder/layers.1/blocks.0/Reshape_33' Input to reshape is a tensor with 3763200 values, but the requested shape requires a multiple of 20384
|
|
0
|
100
|
May 2, 2024
|
pre-train_BERT for a specific corpus
|
|
0
|
72
|
May 2, 2024
|
Inference API offline model limit
|
|
1
|
917
|
May 2, 2024
|
A model to extract email text body from html code
|
|
4
|
583
|
May 2, 2024
|
T5 generates repetitive sentences
|
|
3
|
770
|
May 2, 2024
|
Training multiple times in one script
|
|
0
|
199
|
May 2, 2024
|
Setting "num_beams" and using "past_key_values" when calling .generate()
|
|
0
|
213
|
May 2, 2024
|
I cannot find the code that transformers trainer model_wrapped by deepspeed , i can find the theory about model_wrapped was wraped by DDP(Deepspeed(transformer model )) ,but i only find the code transformers model wrapped by ddp, where is the deepspeed wr
|
|
1
|
134
|
May 1, 2024
|
How to create a new Hugging face model by using already available hugging face models
|
|
2
|
152
|
May 1, 2024
|
Fine tuning gguf models?
|
|
1
|
1422
|
April 30, 2024
|
meta-llama/Meta-Llama-3-8B is giving empty responses when I use with transformers
|
|
0
|
256
|
April 30, 2024
|
Need to set re_entrant to true with latest transformers
|
|
1
|
1168
|
April 29, 2024
|
How to pass the api token using transformers candle (rust)?
|
|
1
|
162
|
April 29, 2024
|
Learning rate for the `Trainer` in a multi gpu setup
|
|
4
|
584
|
April 29, 2024
|
Script stops upon setting the model
|
|
0
|
96
|
April 29, 2024
|
How to convert natural languages into vec?
|
|
2
|
95
|
April 29, 2024
|