Hi, just need you guys opinion,
I have a chatbot system that run using below model:
- llama 3.2 90B
- llama 3.2 3B
- Whisper (large)
- nomic-embed-text (Embedding)
What is the best requirement cluster GPU needed to run above model?
Hi, just need you guys opinion,
I have a chatbot system that run using below model:
What is the best requirement cluster GPU needed to run above model?