Any advice on LLM inference over a large dataset?

archonlith · August 16, 2023, 9:39pm

I would like to do some research and apply a single prompt to each entry in a text column in a large-ish dataset and then collect the results into a new column.

I played in Databricks with the Falcon 7B model and entered a list of 80 prompts as my first test. It took a few hours to finish. I’m thinking that running the 40B model on a larger cluster is only going to get slower and slower. I’ll try to run the 40B model anyway as soon as I can get the compute for it.

What is the recommended optimal procedure here? What kind of time/results can I expect in the best case?

Topic		Replies	Views
What's the best way to speed up inference on a large dataset? Beginners	3	3895	March 13, 2022
Comparing Inference Instances for Text Embedding and Completion Tasks Intermediate	1	336	May 23, 2023
How estimate VRAM needed for prompt according to prompt's size (inference and fine tuning) Beginners	1	1240	September 22, 2023
Text generation, LLMs and fine-tuning Beginners	0	1690	December 8, 2022
Inference with falcon7b to generate essays in Google Colab? 🤗Transformers	0	294	July 23, 2023

Any advice on LLM inference over a large dataset?

Related topics