Teaching Transformers to Sum Numbers

mab · April 11, 2022, 11:59pm

I am using T-5 base model to add numbers. The model works well with regular addition(100% accuracy). However, when I train it with modular sum where the system has to choose the last number of the sum. (e.g. for 5+5+7 the answer should be 7) it completely fails ( each epoch it predicts the same number for all of the questions) . I am really surprised that the same system fails at this easy problem. Anybody has any idea why or how to solve?

Topic		Replies	Views
Recommended models for sequence classification on toy synthetic datasets 🤗Transformers	1	260	June 15, 2024
Using T5-Base via Inference API Models	1	994	November 17, 2021
T5 model for summarization far from SOTA results Models	0	1343	July 2, 2021
Advice on model type Beginners	6	455	March 3, 2023
Use Pretrained T5 for Summarization Beginners	3	635	July 2, 2021

Teaching Transformers to Sum Numbers

Related topics