could it be from errors running the model? i have experienced this with oobabooga crashing if you try to load models it did not like - such as the ones using multimodal. if its a single model inferencing endpoint, i would check logs…in any case, t series are pretty low powered…i think you should try g4 ro g5 to start with, also - t series does not have any GPUs…some of the models require it.