How to Implement Numerical Inference in a Text Generation Problem

rnet · May 17, 2022, 7:46pm

The crux of the question: is there a way to train a sequence generation model to have some degree of numerical inference.

Problem setup: a user is attempting to create an advertisement campaign constructed of a set of attritubes (i.e. the campaign should target individuals in San Francisco). The user provides two numerical values to describe the constraints of their campaign, and the model generates a sequence of attributes describing a campaign.

Initial approach: the native approach is to convert the numerical inputs into string representations (2.0 → two point zero). A custom tokenizer is trained on the dataset. After the number to string conversion, the numerical values, along with the target sequence, are used to train a fine-tuned version of the GPT-2 model by OpenAI.

Results: As expected, the model does very well at generating campaigns that make sense; However, the campaigns do not make sense in the context of the numerical inputs. Numerical inputs of 2.0 and 2.1 yield very different campaigns, when in fact, they should yield similar sequences of attributes.

Has there been work done on passing numerical information through a transformer model?

Example data: Given a budget of 1200, a CPM target of 2.1, we recommend the following targets. Channel: mobile, connected tv, Location: US, … (and so on).

Topic		Replies	Views
Text generation conditioned on numbers 🤗Transformers	0	406	May 26, 2022
T5 Fine Tuning - Text to Text Generation 🤗Transformers	2	1298	April 7, 2021
How to best deal with numbers? Beginners	2	1442	October 12, 2020
Any numbers-to-text example? 🤗Transformers	12	1641	May 11, 2022
Recommended way to perform batch inference for generation 🤗Transformers	0	2543	March 6, 2021

How to Implement Numerical Inference in a Text Generation Problem

Related topics