Cannot run large models using API token

Hi,

This makes sense however these two models:
zephyr
aisak-assistant
are the exact same size, but the former is able to run on inference api, wheras the latter cannot. Could you please provide me with a solution/explanation?

Thanks