Maybe not to generate a word every time

but an concept or idea, then use attension to control the length and detailed content with RL to compare the error