New dataset added_review for improvement

Abirate · December 9, 2021, 10:34am

I have recently created a dataset called “english-quotes”.
The data collection process is web scraping using BeautifulSoup and Requests libraries. I also added the script that I created to scrape data and create the dataset in the Card Description (under " Who are the source Data producers ?")
It’s just a start. I wanted to validate (for myself) the “Datasets” chapter of the course by a final self-evaluation different from the dataset of the course “github_issues” (I admit, I changed the sections of the data card a bit). If you have any remarks or comments for improvement, please let me know (as I will add more advanced datasets in the future)… and thanks.

the link of the new dataset added “english_quotes”: Abirate/english_quotes · Datasets at Hugging Face

lhoestq · December 15, 2021, 1:56pm

This is awesome ! Thanks for adding this dataset

Do you know if there’s a list somewhere of all the possible tags ? It can be useful to know how many classes there are to train multi class classification models.

Topic		Replies	Views
[Open-to-the-community] One week team-effort to reach v2.0 of HF datasets library 🤗Datasets	292	13869	October 30, 2022
[NEWBY] Creating custom datasets to fine tune an existing model Beginners	0	301	November 4, 2022
Missing dataset card for id_personachat Model cards	2	1456	November 15, 2021
Nlp 0.3.0 is out! 🤗Datasets	3	838	July 8, 2020
Request for Further Information on Datasets Beginners	0	281	November 26, 2020

New dataset added_review for improvement

Related topics