Yeah, so @camenduru based on this I should have easily been able to have 15.4 gb of checkpoints and been just fine on the T4 Medium or A10g small, right?
Once the Amazon EKS Distro is available I’ll have to rebuild my entire space or what?
However, I want to embed the space as an iframe or use it as a webcomponent so making the space private isn’t ideal/feasible.
Using gradio auth/env secrets will allow me to have the organization / space be public and still not allow the space to be started by the public running up a large HF bill.
Now to figure this out with the example from @Omnibus
On T4 small you actually are limited in storage to 50G at runtime and 110G for A10G small, you can find the available disk at runtime by flavor here.
Depending on the size of your image though the available storage could be less (but in any case you cannot have more).
@iamrobotbear yes 15.4 GB checkpoints should be able to fit on A10G small and T4 medium, but the image need to be optimized.
In the current state the following two lines at the end of the Dockerfile:
RUN chown -R user:user /content
RUN chmod -R 777 /content
Effectively both increase your image size by at least the size of your checkpoints (so at least about 30GB)
I found one line of the code no work for me in a10g large and the solution was replace the line 17 that install xformers with RUN pip install xformers @camenduru