Models

Topic	Replies	Views	Activity
Mt0 models are text2text-generation models, not text-generation	0	434	March 4, 2023
Does anyone know how to setup meta's llama?	0	2161	March 4, 2023
Using the .generate() function with a custom model class	0	683	March 3, 2023
[RAG]AttributeError: 'RagTokenizer' object has no attribute 'convert_tokens_to_ids'	0	586	March 2, 2023
Using geographical distance matrix in transformers	0	315	March 2, 2023
I have problem with teaching GPT-2 large model (774m) with transformers	2	811	March 1, 2023
How to teach a gpt-2 for Q&A?	0	2181	March 1, 2023
How to download current version models only?	1	459	March 1, 2023
Issue with loading model config file for GPT-Neo 2.7B	0	395	March 1, 2023
Recovering input IDs from input embeddings using GPT-2	1	1261	March 1, 2023
How to do generation using encoder_outputs	0	327	February 28, 2023
Asking for the appropriate image-tweet combination for social media geolocation model architecture	0	179	February 28, 2023
How to Resolve batch[label_name] = torch.tensor(batch[label_name], dtype=torch.int64) TypeError: not a sequence	0	908	February 27, 2023
How to use markupLM for QA on HTML text longer than 512 tokens?	0	387	February 26, 2023
How to add additonal attention layer in pretrained U-Net?	0	733	February 25, 2023
AttributeError: 'Wav2Vec2FeatureExtractor' object has no attribute 'decode'	0	585	February 24, 2023
Product brand and model detection	0	436	February 23, 2023
Inference with BLOOMZ on CPU	0	297	February 22, 2023
AutoModelForQuestionAnswering: ValueError: too many values to unpack (expected 2)	0	459	February 22, 2023
Attention weights transfer but different classes	0	223	February 21, 2023
502 Bad Gateway When Accessing a Model via Cloudflare Worker	0	841	February 21, 2023
In wav2vec2 why are the basic learned units are learning basic units are 25ms long?	0	366	February 21, 2023
When using wav2vec2 inference, does the quantitation being used?	0	181	February 21, 2023
Finetunning on a new corpus for Conditional Generation. Should I train from scratch?	0	323	February 21, 2023
Can I fine-tune a Donut model that has once been fine-tuned or use several simultaneously?	0	331	February 19, 2023
Donut base-sized model, pre-trained only for a new language tutorial	2	1076	February 19, 2023
Bart_lfqa: content has to be taken into account and a message is to be uttered if no reply is being found instead of going freestyle	0	229	February 18, 2023
Using Bloom with detailed parameters?	8	2926	February 18, 2023
Encoder-only Transformer (BERT-like) for Token Classification outside NLP	0	434	February 16, 2023
How to save and load the custom Hugging face model including config.json file using pytorch	2	7271	February 16, 2023