GPT-2 shift logits and labels
|
|
5
|
5828
|
May 12, 2023
|
Seeding Data Collator
|
|
0
|
223
|
May 12, 2023
|
OOM when I using torch.nn.parallel.DistributedDataParallel to train LLAMA-7B
|
|
0
|
721
|
May 12, 2023
|
Formatting data for the selected task is stuck. How to debug?
|
|
0
|
327
|
May 11, 2023
|
Finetuning options with SAM?
|
|
4
|
5231
|
May 11, 2023
|
Asked for "Draw me a picture of a circle"
|
|
0
|
265
|
May 11, 2023
|
60-minute+ Transcript Summarization
|
|
0
|
488
|
May 11, 2023
|
TransformerAgents - error running example notebook
|
|
0
|
200
|
May 11, 2023
|
For google/deplot, what should I input as header text for fine-tuning?
|
|
7
|
1416
|
May 11, 2023
|
Huggingface Distributed Training with Accelerate
|
|
1
|
864
|
May 11, 2023
|
Use alpaca with local embedding
|
|
0
|
630
|
May 11, 2023
|
Offset_alibi is not used in bloom
|
|
0
|
243
|
May 11, 2023
|
TypeError: __init__() got an unexpected keyword argument âgeneratorâ
|
|
0
|
739
|
May 11, 2023
|
Problem with sharing models among processes via multiprocessing
|
|
0
|
934
|
May 11, 2023
|
Finetuning Transformers for Text Classification Issue
|
|
2
|
706
|
May 11, 2023
|
Is this correct approach to do Prompt Tuning on DollyV2 model
|
|
0
|
593
|
May 9, 2023
|
Spaces using CPU
|
|
2
|
484
|
May 11, 2023
|
My list of human preference datasets
|
|
0
|
2651
|
May 10, 2023
|
Issues replicating time series analysis with Transformers
|
|
1
|
507
|
May 10, 2023
|
How to fine tune BertForSequenceClassification with PEFT?
|
|
0
|
943
|
May 10, 2023
|
Use decoder_input_ids with deepspeed
|
|
0
|
270
|
May 9, 2023
|
Learning sets and disabling positional embedding knowledge?
|
|
0
|
296
|
May 10, 2023
|
BERT inference with Hugging Face Transformers and AWS Inferentia
|
|
0
|
529
|
May 10, 2023
|
How to do full page analysis with TrOCR (integrating with text segmentation analysis)
|
|
0
|
2047
|
May 10, 2023
|
Attaching a vision decoder to VisionTextDualEncoder
|
|
0
|
264
|
May 10, 2023
|
Cannot upload CSV or JSONLines To Autotrain
|
|
2
|
895
|
May 10, 2023
|
Seq2SeqTrainer, `push_to_hub` returns None
|
|
2
|
395
|
May 10, 2023
|
Scala/JVM Bindings for Tokenizers
|
|
0
|
503
|
May 10, 2023
|
requests.exceptions.HTTPError: 401 Client Error:
|
|
1
|
912
|
May 10, 2023
|
Error when training Deberta
|
|
0
|
279
|
May 10, 2023
|