How to create dataset from github

i want to train the llm (llama) in git code like i want to create

For example
I want to give him a lot of written snake programs
And then ask him to make me a snake game, but I will ask him what I want
What the llama cannot do today

Iā€™m not trying for him to know how to build anything other than snake

Then I will try to teach him another code for the game (instead of pygame) and then ask him to build it with this codeā€¦

The question is how do I create a dataset from the git I choose

Hi,

you can look at Share a dataset to the Hub, and Structure your repository for more details.

Hey, @yosiNewman did u manage to create a dataset? I am facing similar problem right now,