Models

Topic	Replies	Views	Activity
Help regarding understanding Llama output without sampling	0	1092	April 17, 2023
High Level Philosophy Can Break/Jailbreak LLM's Real Bad	0	713	April 16, 2023
F222 and diffusers	1	6620	April 13, 2023
Confusion over use of -100 pad value for GPT2 Causal Modeling Fine-tuning	0	290	April 13, 2023
This model could not be loaded by the inference API	1	634	April 12, 2023
Using a DataCollator using native pytorch	0	235	April 12, 2023
Best Text to Speech in 2023	0	598	April 9, 2023
How to interpret metrics for a Seq2Seq task?	0	758	April 8, 2023
Creating word embeddings using BERT of machine generated sequential data	0	266	April 7, 2023
Flan-t5-xl generates only one sentence	3	4148	April 6, 2023
Give detailed information (paragraph) to CLIP about the classes	0	196	April 6, 2023
Validate document content against set of rules	0	200	April 6, 2023
The model doesn't work on HF, but it works locally	0	263	April 4, 2023
The loss plateau of pratraining Bert using run_mlm.py	4	1981	April 4, 2023
How to interpret the output of the segmentation model?	0	245	April 4, 2023
MarianMT model cross attention layers alignment problem!	0	338	April 3, 2023
Run split-GPU inference with GPT-NeoX-20B	1	744	April 3, 2023
New tool to improve performance of generative AI models	0	766	April 2, 2023
Call ViTMAE Forward Embedding	1	298	March 30, 2023
How the embedding model (x-vectors) trained?	0	921	March 30, 2023
I tired and can't solve this error , ValueError: The model did not return a loss from the inputs, only the following keys: logits. For reference, the inputs it received are input_ids,attention_mask	1	1163	March 29, 2023
GPT-J-6B Model from Transformers GPU Guide contains invalid tensors	0	585	March 29, 2023
Finetuning wav2vec2-large-xlsr-53 only outputs blank labels	6	1227	March 29, 2023
Tabular-regression is not a valid pipeline	0	169	March 25, 2023
How many neurons (units) are there in the BERT model?	0	233	March 25, 2023
[Keras] Fine-Tune Vision Transformer Model?	3	2385	August 9, 2022
TrOCR issues Stop Iteration training	0	392	March 24, 2023
Whisper on SageMaker - how to stop it from translating the result?	0	489	March 24, 2023
The model 'GPTJForCausalLM' is not supported for text2text-generation	0	1016	March 22, 2023
When I use Trainer API to train the GLM Model and save this model，I find memory of the finetuned model is twice the size of the original model. What is the reason for this?	5	323	March 22, 2023