TGI & guidance making a strange behavior

romprr · March 20, 2025, 1:56pm

Hello everyone,

I’m currently working on a project where i had a prototype working, as i’m now working on the production version i’m re adapting some of my formats, one of them is a pydantic class like this :

class Object(BaseModel) :
  some variables here

class TheActualClass(BaseModel) :
  var : Object
  some other variables

With the production there is alot more variables in Object than there was with the prototype, leading to a strange behavior of TGI which i also encoutered when processing long context data (too large for my vram probably ?), TGI does not react at all, my gpu is not doing anything and it is kinda just frozen forcing me to reload the container because even after half an hour of waiting there is absolutely nothing that happens.

I did find a workaround by replacing the Object from a BaseModel to a TypedDict, but i’m kinda curious on why the engine is acting this way of just being frozen, but also because it is the engine i plan to use for the production case and if this happens in use that might become a pretty big problem.

Maybe anyone has an idea about what might be the problem ?

John6666 · March 20, 2025, 2:28pm

Possibly Pydantic with TGI specific issue…?

github.com/pydantic/pydantic-ai

text-generation-inference (TGI) support

opened 08:49PM - 17 Dec 24 UTC

closed 06:28PM - 19 Dec 24 UTC

sadransh

more info

I am having following issue with TGI self-hosted model when trying to use tool c…alling with the following code, while it is fine with `result_type=str` ```python from pydantic import BaseModel from pydantic_ai import Agent from pydantic_ai.models.openai import OpenAIModel class CityLocation(BaseModel): city: str country: str model = OpenAIModel( model_name='Meta-Llama-3.1-70B-Instruct', base_url = 'http://localhost:8000/v1', api_key ="-", ) agent = Agent(model, result_type=CityLocation) result = agent.run_sync('Where were the olympics held in 2012?') print(result.data) #> city='London' country='United Kingdom' print(result.cost()) ``` error: ``` UnprocessableEntityError: Failed to deserialize the JSON body into the target type: messages[1]: missing field `content` at line 1 column 235 ``` To my understanding based on tgi docs, it is openai compatible. https://huggingface.co/docs/text-generation-inference/en/messages_api Does it require similar development to this feature request? https://github.com/pydantic/pydantic-ai/issues/224 environment: pydantic_ai v 0.0.13 python 3.9 pydantic 2.10.3

romprr · March 24, 2025, 3:17pm

I don’t think so (or atleast this is not the only thing) as i had this exact same behavior with no formatting just very very long context, what’s really strange is that there is really nothing happening, no error to debug, no generation (i can hear the gpu when it generates usually), nothing happening on the docker logs, it just doesnt act at all and doesnt receive any query when this happens tgi just become unusable

With the workaround i found for formatting and cutting long context into smaller blocks, i didn’t encounter this again, maybe it is just because i’m at the limit of my hardware but i still find it strange that there is not even an error or anything

John6666 · March 24, 2025, 3:37pm

what’s really strange is that there is really nothing happening, no error to debug, no generation (i can hear the gpu when it generates usually), nothing happening on the docker logs, it just doesnt act at all and doesnt receive any query when this happens tgi just become unusable

Wow… that’s pretty weird. If TGI is crashing for some logical reason, there should be some kind of log or output…
The fact that the GPU fan isn’t spinning means that it didn’t even reach the point of loading the model, and it really did abort at the very beginning.

Well, it’s good that there seems to be a workaround…

Topic		Replies	Views
TGI with guidance generates weird output when asked to answer in a "structured" way Intermediate	3	123	February 17, 2025
Default parameters when querying models with TGI Intermediate	0	346	April 23, 2024
TGI Model Question 🤗Hub	0	370	September 21, 2023
Error Using Pydantic with LangChain and local model by Hugging Face for Structured Output 🤗Transformers	1	1027	October 20, 2024
Exception in ASGI application Spaces	3	2653	January 29, 2025

TGI & guidance making a strange behavior

Related topics