I have recently created a dataset called “english-quotes”.
The data collection process is web scraping using BeautifulSoup and Requests libraries. I also added the script that I created to scrape data and create the dataset in the Card Description (under " Who are the source Data producers ?")
It’s just a start. I wanted to validate (for myself) the “Datasets” chapter of the course by a final self-evaluation different from the dataset of the course “github_issues” (I admit, I changed the sections of the data card a bit). If you have any remarks or comments for improvement, please let me know (as I will add more advanced datasets in the future)… and thanks.
- the link of the new dataset added “english_quotes”: Abirate/english_quotes · Datasets at Hugging Face