I’m trying Dataset.from_generator() loading function, it is awesome! However, on some runs, it simply skips my changes to the generator function. How to stop it from caching?
I tried:
Hmm can you share more details about your generator ? It’s not supposed to skip you changes.
We haven’t added the argument to from_generator() to ask to always use a new cache yet, feel free to open an issue on GitHub to mention this issue if you’re interested.
Note that every time you modify the gen_kwargs passed to from_generator it will use a new cache, you can leverage this as a workaround.