Hi there,
First off, thank you so much to everybody who has asked questions on these forums, answered those questions, and, of course, to everybody at Hugging Face who has made machine learning accessible to folks like me who are trying to jump into ML. The libraries, the example datasets, the ability to download models so easily, the YouTube channel, the answers to questions on GitHub, everything, it is just invaluable.
Anyways, I am feeling pretty good about the training code set up we have now for a very exciting machine learning task, creating a dental chatbot for dentists and dental professionals using 1.5 million training inputs comprising of forum discussions and podcast transcripts.
My question: what is the best cloud environment?
Obviously this is not a simple question, as there are a lot of factors to consider, so let me narrow it down a bit.
We are currently using an Azure NC24s v3 instance, and it will take us about 2 weeks to train using this configuration.
I am starting to compare prices to SageMaker and other options, but I wanted to see if maybe there was some obscure environment or something I am not aware of that we should consider.
I am also considering testing with an Azure ND96asr v4 instance as even though it’s more expensive, it appears to be significantly more powerful and so could potentially train the LLM faster and non-linearly to cost, and so more bang for our buck overall.
(Also I am trying to figure out what Azure’s ML Studio even is, if it is just an interface or if it provides VM access as well)