I am trying to fine-tune llama3 on custom dataset.
My dataset is json documents containing {instuction, input, output} keys.
I want to Convert json dataset to “datasets.arrow_dataset.Dataset” type.
I want to use the code :
from datasets import load_dataset
dataset = load_dataset(“yahma/alpaca-cleaned”, split = “train”)