Error using function callbacks on custom inference endpoint

devz92 · August 2, 2025, 9:56am

Hi guys, I’m having some problems related to the tool calling
testing it with my custom inference endpoint

using this example:

const testTools = [
      {
        "type": "function",
        "function": {
          "name": "get_number",
          "description": "Get the number",
          "parameters": {
            "type": "object",
            "properties": {
              "number": {
                "type": "string",
                "description": "the number to get",
              },
            },
            "required": ["number"],
          },
        },
      },
    ]

I’m not able to make it work
without the tool, it works perfectly, but adding it to the request payload makes the response something like this:

"error\\\": \\\"\\\\\\\"Tool error: Failed to parse generated text: expected value at line 1 column 1 \\\\\\\\\\\\\\\"<|python_tag

this is the call I’m doing

    const response = await inferenceClient.chatCompletion({
      endpointUrl: endpointURL,
      messages: messages.map((message) => ({
        role: message.role,
        content: message.content,
      })),
      max_tokens: xxx,
      temperature: xxx,
      tools: testTools,
      tool_choice: 'auto'
    });

Maybe I’m missing something? or is anyone having the same problem?
thanks in advance for your help!

John6666 · August 2, 2025, 11:17am

There seems to be some incompatibility between TGI and Llama’s Function Calling, so you may need to use a slightly hacky workaround.

Topic		Replies	Views
Tool calling gets stuck in an infinite loop Inference Endpoints on the Hub	2	303	April 12, 2025
Function calling not working with inference clients on (seemingly) any model Beginners	10	740	February 8, 2025
Handler.py not executed in Inference Endpoint Inference Endpoints on the Hub	0	268	September 13, 2023
Endpoint issue with GPTQ Inference Endpoints on the Hub	0	219	January 23, 2024
RuntimeError on trying to create Inference Endpoint Inference Endpoints on the Hub	0	222	August 2, 2023

Error using function callbacks on custom inference endpoint

Resources

Related topics