Thank you! This is very helpful. Using generator seems the easiest, but the dataset builder offers great flexibility, from what I see. For instance, it would make it easy to have a max_text_length argument or any other argument that changes how the dataset should be created, on-demand. With generator or predefined methods, the datasets would be static.
I’m going with the builder class!