[Open-to-the-community] One week team-effort to reach v2.0 of HF datasets library

I’m open to contribute. I want to get a lot of Source Code related Data. It would be really helpful for all of the Ml4code community.

1 Like

Anthology of serbian literature:


1 Like

I would like to contribute, open for any dataset. However, would like to contribute for Indian languages

1 Like

@thomwolf I would like to participate to add datasets for 10 Indian Languages.

Paper link: https://arxiv.org/abs/2005.00085

and many datasets for different Indian languages for tasks such as text classification and language translation.

1 Like

This is an excellent initiative. I would like to participate :hugs: @thomwolf

1 Like

I would love to help

1 Like

I’m interested.

1 Like

I would like to contribute for Tamil language

1 Like

I would love to be part of this.

1 Like

Hello!
I’m Saif, from Pakistan. Would love to contribute to HuggingFace, but I’m really new to HugginFace itself. I 'll take the time to learn the exact details of what I can exactly contribute to, but coming from Pakistan, I can think I can help with adding data related to Urdu.
Some links for the Urdu NLP are,

  1. https://www.urdunlp.com/
  2. Some github references, https://github.com/topics/urdu-nlp

And more. I’ll go and look further into the HuggeFace.

1 Like

I am (slowly) working on a dataset for token-level classification. Not sure if in scope, but either way, I’d love to lurk and contribute if possible!

1 Like

I have curated a list of NLP datasets that has Tamil language here:

1 Like

Hi, I would love to contribute. The language I would love to work on is telugu. Will keep posting links in the meanwhile

1 Like

In to contribute

1 Like

I would love to work and contribute pls add me to Slack group

1 Like

I I would like to participate. Please add me.

1 Like

I would love to contribute in developement.

1 Like

I would love to contribute to work on Indic NLP Corpora - specifically Hindi and Bengali.

1 Like

Count me in :v:t3::v:t3::blush:

1 Like

I would like to contribute on this! I am currently working on low resource Indic languages so I think that would be good addition.

1 Like