"attention_mask" + `pad_token_id
|
|
2
|
5225
|
June 6, 2024
|
Extracting information from bills, tax statements, etc: What ML model to use?
|
|
3
|
3138
|
August 28, 2024
|
Multigpu precompute dataset map function and share between processes
|
|
0
|
186
|
July 8, 2024
|
Training using multiple GPUs
|
|
20
|
19997
|
February 25, 2024
|
What is my batch size..?
|
|
2
|
2133
|
April 29, 2024
|
Why isn't quantization config reducing memory usage?
|
|
0
|
82
|
August 16, 2024
|
Why Predict Words Already in the Input?
|
|
0
|
8
|
August 16, 2024
|
How to pre-train or finetune LLM with structured dataset, so the LLM can reason the relationships between data objects
|
|
3
|
4646
|
December 5, 2023
|
How to pass table structure to LLM model
|
|
2
|
1307
|
May 1, 2024
|
Fin-eng-dataset
|
|
4
|
503
|
August 16, 2024
|
Pyabsa Databricks Job Cluster OutputSizeLimit Exceeded
|
|
0
|
138
|
August 16, 2024
|
LLAMA2 70b Inference api stuck on currently loading
|
|
4
|
1031
|
September 3, 2024
|
Add metrics to object detection example
|
|
12
|
3783
|
May 8, 2024
|
Does Trainer.predict preserve test dataset order?
|
|
1
|
177
|
May 2, 2024
|
Convert pre-trained MHA weights to GQA weights
|
|
1
|
375
|
September 29, 2024
|
Calculate precision, recall, f1 score for custom dataset for multiclass classification
|
|
13
|
8611
|
June 13, 2024
|
Trying to use AutoTokenizer with TensorFlow gives: `ValueError: text input must of type `str` (single example), `List[str]` (batch or single pretokenized example) or `List[List[str]]` (batch of pretokenized examples).`
|
|
11
|
19541
|
October 5, 2024
|
Removing tokens from the GPT tokenizer
|
|
2
|
1902
|
August 20, 2024
|
How to extract a specific paragraph from a text file
|
|
2
|
725
|
May 29, 2024
|
DeepSpeed Zero 3 with LoRA - Merging adapters
|
|
1
|
584
|
August 16, 2024
|
Expected mat1 and mat2 to have the same dtype, but got: c10::Half != float
|
|
3
|
1621
|
July 8, 2024
|
Greedy sampling with the new branch
|
|
0
|
128
|
July 8, 2024
|
Dataset.from_dict() killed
|
|
0
|
137
|
July 8, 2024
|
Inference Endpoints for text embeddings inference not working
|
|
2
|
186
|
August 16, 2024
|
Load Huggingface models into Golang?
|
|
2
|
10339
|
March 5, 2024
|
Error from CUDA on audio classification
|
|
3
|
1835
|
September 18, 2024
|
Help solving RuntimeError: NCCL Error 1: unhandled cuda error (run with NCCL_DEBUG=INFO in pods
|
|
1
|
621
|
August 16, 2024
|
Chat Templates for BlenderBot
|
|
5
|
1177
|
September 2, 2024
|
GPT2Tokenizer not putting bos/eos token
|
|
3
|
5380
|
March 31, 2024
|
Discrepancy between OpenAI CLIP and Huggingface CLIP models
|
|
2
|
1637
|
August 19, 2024
|