Chameleon LLaMA style architecture with a VQGAN

can some one tell me is it a new image recognation model?

can som one make a gguf ?