Source and target vs input and labels for causal autoregressive language models
|
|
1
|
1744
|
July 27, 2022
|
How to train a model to do QnA - other than English language?
|
|
2
|
726
|
July 27, 2022
|
IndexError when building Space
|
|
1
|
938
|
July 27, 2022
|
How exactly does datasets versioning work?
|
|
5
|
3391
|
July 27, 2022
|
How to upload a model card through the API?
|
|
2
|
678
|
July 27, 2022
|
How can I send the infer data to a roberta endpoint?
|
|
0
|
304
|
July 27, 2022
|
Pruning a model embedding matrix for memory efficiency
|
|
7
|
3452
|
July 27, 2022
|
I want to deploy Hugging Face with ONNX in JavaScript for question and answering
|
|
3
|
1568
|
July 27, 2022
|
The first argument to `Layer.call` must always be passed
|
|
3
|
1549
|
July 27, 2022
|
Huggingface datasets streaming problem
|
|
6
|
1542
|
July 27, 2022
|
Model saved into an unique .h5 file (or TensorflowLight)
|
|
5
|
6218
|
July 27, 2022
|
Class prediction in a zero/few-shot setting at inference time
|
|
0
|
401
|
July 27, 2022
|
\multi-node finetuning with Trainer
|
|
0
|
478
|
July 27, 2022
|
Optimum & RoBERTa: how far can we trust a quantized model against its pytorch version?
|
|
10
|
2406
|
July 27, 2022
|
Using Accelerate on an HPC (Slurm)
|
|
10
|
10390
|
July 27, 2022
|
Visualizing named entities
|
|
0
|
321
|
July 27, 2022
|
PreTrain T5 from scratch in Bengali
|
|
5
|
2207
|
July 26, 2022
|
Running mT5 on multiple GPUs
|
|
0
|
520
|
July 26, 2022
|
Why can't the bloom model be run (really slowly) on consumer hardware?
|
|
2
|
558
|
July 26, 2022
|
Tensorflow Models are way slower than Pytorch models, for autoregressive generation?
|
|
3
|
389
|
July 26, 2022
|
Boosting Wav2Vec2-xls-r with an N gram decoder using the transcripts used to train wav2vec2
|
|
1
|
985
|
July 26, 2022
|
Wav2vec2-large-xlsr-53
|
|
4
|
815
|
July 26, 2022
|
Extracting HuBERT hidden units
|
|
1
|
1146
|
July 26, 2022
|
Network is Unreachable Error
|
|
0
|
1559
|
July 26, 2022
|
There is a adamw optimizer in pytorch version.Is there a adamw in tensorflow2 version
|
|
1
|
283
|
July 26, 2022
|
How to add multiple metrics to Huggingface Transformers Trainer?
|
|
1
|
2071
|
July 26, 2022
|
T5 transformer tokens and scores
|
|
0
|
709
|
July 26, 2022
|
Inference Input for Vision Models
|
|
6
|
1312
|
July 26, 2022
|
Dynamic range quantization for HF models seem to be spurious
|
|
0
|
200
|
July 26, 2022
|
Anomaly Detection / Out of Domain Detection with BERT
|
|
0
|
964
|
July 26, 2022
|