Inference for a 7B model on A100 takes too long?
|
|
1
|
1761
|
March 15, 2024
|
Does merging of based model with LORA weight mandatory for LLAMA2?
|
|
1
|
778
|
March 15, 2024
|
Flowise flow - response from GPT-4 cut (?)
|
|
0
|
245
|
March 15, 2024
|
Inferences with DataParallel
|
|
3
|
5041
|
March 15, 2024
|
I am following a hugging face guide for fine tuning whisper but I run into error when training
|
|
0
|
171
|
March 15, 2024
|
Use embeddings etc. with php
|
|
1
|
1404
|
March 15, 2024
|
Asynchronous CPU-GPU computation
|
|
0
|
347
|
March 15, 2024
|
About temporary hold of adding pament method
|
|
2
|
284
|
March 15, 2024
|
Pipelines for Chat Generation with Memory
|
|
3
|
3753
|
March 15, 2024
|
Здорово! Contribute to Multilingual LLM!
|
|
0
|
310
|
March 15, 2024
|
How to use `broadcast` to send tensor from main process
|
|
0
|
294
|
March 15, 2024
|
Uploading an audio dataset keeps failing at "Uploading the dataset shards"
|
|
2
|
362
|
March 15, 2024
|
Wordy explanations to questions given details
|
|
1
|
78
|
March 15, 2024
|
Huggingface Training Containers
|
|
0
|
304
|
March 15, 2024
|
Image diffuser improver
|
|
0
|
125
|
March 15, 2024
|
Best Model for Question + Answer Embeddings
|
|
0
|
478
|
March 15, 2024
|
Transforming Pushed Hugging Face Models into Usable GGUF Models for Local Colab Use
|
|
2
|
1674
|
March 15, 2024
|
Mixtral 8x7B or any LLM evaluation
|
|
0
|
184
|
March 15, 2024
|
Is it ok to have max_length greater than context_length of the model
|
|
0
|
334
|
March 15, 2024
|
Reused tokenizer returns unk
|
|
1
|
521
|
March 14, 2024
|
Release timeline for 4.39.0 / mamba?
|
|
0
|
210
|
March 14, 2024
|
How to log predictions from evaluation set after each Trainer validation to wandb?
|
|
2
|
1099
|
March 14, 2024
|
How to remove/unclaim a paper authorship?
|
|
1
|
239
|
March 14, 2024
|
Error while using LILT model "index out of range in self"
|
|
5
|
703
|
March 14, 2024
|
RuntimeError: CUDA error: device-side assert triggeredCUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect
|
|
0
|
359
|
March 14, 2024
|
Error with pretrained model: 'dict' object has no attribute 'to_json_string'
|
|
2
|
1222
|
March 14, 2024
|
Error invoking DialoGPT-large via serverless inference endpoint - can only concatenate str (not "dict") to str"
|
|
3
|
953
|
March 14, 2024
|
Loss and results misunderstanding
|
|
0
|
144
|
March 14, 2024
|
NLP Training data
|
|
0
|
130
|
March 14, 2024
|
Need Help Separating PDF Content into Paragraphs Using OCR
|
|
0
|
364
|
March 14, 2024
|