Fine-tuning CodeLlama on custom data

Hi, i am attempting to train a CodeLlama-34b-v2 model on a custom dataset of front end code. I have tried to do this using GCPs Vertex AI; however, the integration with huggingface and GCP resources is not that intuitive. Does anyone have any experience with training this model on larger datasets of code?