Datasets: Limit the number of rows?
|
|
4
|
8287
|
December 17, 2023
|
Understanding repetition_penalty in LLaMA-2 Pretrained Model
|
|
0
|
5263
|
December 17, 2023
|
[Feature Request] Search parameters
|
|
0
|
282
|
December 18, 2023
|
Llama-2-70b-chat-hf get worse result than Llama-2-70B-Chat-GPTQ
|
|
0
|
489
|
December 18, 2023
|
How to choose optimal batch size for training LLMs?
|
|
4
|
18502
|
December 18, 2023
|
Models slow on M1 Pro 16gb
|
|
0
|
727
|
December 18, 2023
|
What is the difference between StableDiffusionPipeline and DiffusionPipeline
|
|
2
|
1105
|
December 18, 2023
|
Recommended hardware for running LLMs locally
|
|
2
|
32538
|
December 18, 2023
|
Load dataset from local files with existing builder (pubmed dataset)
|
|
2
|
254
|
December 18, 2023
|
Flash Attention 2 Error on Mistral Based Model
|
|
0
|
615
|
December 18, 2023
|
Remove PE/Encoder on BartModel
|
|
0
|
198
|
December 18, 2023
|
Mistral take ages
|
|
0
|
323
|
December 18, 2023
|
Questions re: Tokenizer pipeline composability / reuse outside of the HF ecosystem
|
|
0
|
213
|
December 18, 2023
|
Special Digit Recognizer
|
|
0
|
292
|
December 18, 2023
|
Slightly different embeddings for pandas series using a sentence transformer
|
|
1
|
617
|
December 18, 2023
|
What would be the minimum instance to deploy TheBloke/Phind-CodeLlama-34B-v2-GPTQ?
|
|
1
|
273
|
December 18, 2023
|
Searching documentation no longer works
|
|
2
|
289
|
December 18, 2023
|
Trainer: How can I log model outputs besides loss?
|
|
0
|
262
|
December 18, 2023
|
Deploy model in hugging face platform
|
|
0
|
256
|
December 18, 2023
|
🤗Transformer with Trainer API on TPU VMs and TPU Pods
|
|
0
|
407
|
December 18, 2023
|
Whisper Model: Validation loss decreasing but WER increasing/constant
|
|
0
|
267
|
December 18, 2023
|
How long are new users rate limited in HF discussions?
|
|
2
|
645
|
December 18, 2023
|
Getting LLaMA tokenizer from meta
|
|
0
|
119
|
December 19, 2023
|
Performing Whisper's "transcribe" with Transformer pipelines
|
|
2
|
2646
|
December 19, 2023
|
How to save audio dataset with parquet format on disk
|
|
2
|
2054
|
December 19, 2023
|
How can I multithreadedly download a HuggingFace dataset?
|
|
2
|
1399
|
December 19, 2023
|
Accelerate FSDP training || RuntimeError : Forward oder differ across ranks
|
|
0
|
444
|
December 19, 2023
|
Crash during training - rate limit
|
|
0
|
465
|
December 19, 2023
|
Prompt Tuning For Sequence Classification
|
|
5
|
2054
|
December 19, 2023
|
Use gradio with Curl
|
|
0
|
777
|
December 19, 2023
|