Chapter 5 questions

juancopi81 · March 30, 2022, 2:43pm

Hello everyone,

I am very new to the topic, so sorry if this question is obvious.

I’d like to start working on this task (Chapter 5 - Time to slice and dice):

Use the techniques from Chapter 3 to train a classifier that can predict the patient condition based on the drug review.

Since this label (patient condition) is also a string (I think there are 819 unique conditions), what would be the best approach? I was thinking about tokenizing this field and then use a seq2seq model. Or maybe assign a number to each unique condition

Thanks for the great course!

Topic		Replies	Views
Correct way to create a Dataset from a csv file Beginners	13	14122	March 25, 2022
Fetching rows of a large Dataset by index 🤗Datasets	10	1641	March 15, 2021
Loading Custom Datasets 🤗Datasets	7	10708	May 25, 2021
Load_dataset('csv', data_files='./imdb.csv') [Errno 2] No such file or directory: './imdb.csv' 🤗Datasets	2	350	November 29, 2023
Chapter 3 questions Course	149	10533	August 29, 2025

Chapter 5 questions

Related topics