Streamlit App Faster Model Loading

Buckeyes2019 · January 15, 2021, 2:08am

An open question for anyone that has used Transformer models in a Streamlit app. I am using:::

pipeline(“summarization”, model=“sshleifer/distilbart-cnn-6-6”, tokenizer=“sshleifer/distilbart-cnn-6-6”,framework=“pt”)

::: to do summarization in the app. However, it takes about 55 seconds to create the summary, and it appears that 35 seconds or more of that time is spent downloading the model. Is there another way to access the model quicker? Perhaps by pre-loading the model to Streamlit Sharing (via the github repo the app sits in)?

Also, the summary generation part of the app appears to work once or twice, but if done any more times the app crashes. Has anyone else had this experience?

BramVanroy · January 15, 2021, 8:26am

No experience with Streamlit itself, but you can always download the model locally. Usage is a bit different then: you need to provide a directory to the model argument instead of just the model name. So download all those files to a directory, and then use that directory as your arguments.

100worte · January 15, 2021, 9:40am

You should wrap the loading of model/pipeline and add a streamlit.cache decorator. That way, the loading/downloading part will be done only once.

https://docs.streamlit.io/en/stable/api.html#optimize-performance

maxdavish · January 21, 2021, 9:57pm

Within the streamlit.cache() decorator you’ll get better performance if you use allow_output_mutation=True because this means Streamlit just uses the same copy of the model in memory, rather than reloading when it’s re-run.

Topic		Replies	Views
Is Facebook NLLB too slow? Models	8	1795	August 30, 2024
Extremely slow init of fine-tuned model Beginners	0	277	February 9, 2024
How does summarization work with pretrained models? 🤗Transformers	0	590	November 14, 2023
Streamlit app with PyTorch/HuggingFace Transformers crashes when deployed to Heroku 🤗Transformers	0	859	December 16, 2020
Streamlit + Llama 3, takes too much gpu memory? Models	0	188	July 13, 2024

Streamlit App Faster Model Loading

Related topics