Hardware Requirement GPU

Hi, just need you guys opinion,

I have a chatbot system that run using below model:

  • llama 3.2 90B
  • llama 3.2 3B
  • Whisper (large)
  • nomic-embed-text (Embedding)

What is the best requirement cluster GPU needed to run above model?

1 Like