My goal is to find the most frequent words in my corpus, or even better, topics.
The goal is to create a question-answer system and to talk about topics the model knows something about.
How do I get the most frequent words / topics of a small corpus (220,000 documents, each about 1000 words, saved in a pickle) quickly in python?
Thank you for nothing