I’ve searched around and cannot find any existing repos; so I’m interested if anyone has managed to integrate HF TGI with TEI?
I was using PrivateGPT for querying against various text documents; however it has many limitations for production, mainly single tenant.
It seems HG TGI is a good fit, especially with continuous batching of incoming requests for increased total throughput. The next step is to integrate RAG and TEI seems perfect.
Has anyone integrated them together yet?