Sourcing AI Model and Building local application

Indermeet · August 25, 2025, 4:29pm

We’re exploring embedding open-source models from Hugging Face into our application.

For teams that have done this — how are you building containerized applications around these models?

- Is there a reference workflow you follow (from model pull → packaging within application → deployment)?

- How do you check the sourced AI model for any vuln (Is the concept same as checking for vulnerabilities like Open Source Dependencies)

-do you use any artificatory like Jfrog or sonatype to store the models ?

- what are other considerations to make to embed the models within application compared to making API calls to OpenAI/Anthropic ?

John6666 · August 26, 2025, 12:46am

reference workflow

TGI + Docker or vLLM + Docker are recommended for their speed and scalability. Ollama is fast and easy to use for testing purposes, but it is not good at handling very long contexts.

vuln

Use safetensors. If using trust_remote_code, research it thoroughly beforehand. If you really want to use Pickle Tensor, use PyTorch 2.6.0 or later.

Topic		Replies	Views
Alot of questions, or, How can i run models locally (for an absolute begginger) Beginners	3	69	July 4, 2025
Exploring optimal deployment strategies for Hugging Face's open-source embedding models in a high-usage, cost-effective environment without vendor lock-in Models	0	326	November 16, 2023
How can I integrate Hugging Face Transformers with Red Hat OpenShift? Beginners	2	231	October 30, 2024
How to Dockerize HuggingFace Application? Beginners	0	38	August 23, 2024
Training Open-Source Model quickly? Beginners	2	75	July 11, 2025

Sourcing AI Model and Building local application

Related topics