How to tell LLM to stop processing request if user is no longer interested?

Today when I make a call to llm , if it takes more than 1 minute to respond then I don’t want to continue and I terminate my connection. But the issue is, does me terminating the connection indicate LLM to stop working on my request in any way to save compute on LLM side to serve other requests.

Does it happen today ? Or is it even possible todo with LLMs ?

1 Like