I provided a line from my dataset a few responses back. This person in that thread looks like they are doing the exact same formatting I did. https://discuss.huggingface.co/t/load-dataset-fail-for-custom-json-format/30350/4?u=gokstad
My file is a jsonl and the lines are this:
{"input": "When was 13 Winters formed?", "output": "13 Winters was formed in 2001 in the black woods of southern Maine."}
But looking again I see their format is like this:
{"input": When was 13 Winters formed?, "output": 13 Winters was formed in 2001 in the black woods of southern Maine.}