RAG: Large number of "generate train split"

I used the facebook/rag-token-nq model in my project and set use_dummy_dataset=False. After the download process, it says ‘’Generating train split: | | /21015300. My queries are around 47,000. Is this expected to generate train splits(around 21,015,300) when only use the model to do prediction? Thank you.