It seems that the criteria have changed. In other words, when using large models, the cost per request becomes expensive.
Starting in March, usage now takes into account compute time x price of the hardware
It seems that the criteria have changed. In other words, when using large models, the cost per request becomes expensive.
Starting in March, usage now takes into account compute time x price of the hardware