Since it includes models close to the latest ones such as Gemma 3, the Transoformers version is likely to be almost the latest. In fact, even older Transoformers models should work with the Llama architecture. This is indeed a strange error. The cause is probably not the code or the model itself.
There seems to be a possibility of errors occurring in hf_transfer related to Jupyter. In other words, there may be an error in the download.