What is the proper way of handling multiple features in Huggingface?

Vanofuture · August 13, 2022, 10:40am

Hello.
I am working on a project that labels different political videos based on a certain features like: text-to speech, text in the video, face emotions, metadata and so on.
I am new to Huggingface and ML in general, so my question is: what is the right way to put all these features together in Huggingface? I’ve tried 2 approaches: to put every feature as text in a dataset updating the Features object accordingly and to put everything together separated by comma or separator. Surprisingly, the second approach did better and I’ve got a better accuracy. Why is that so? How am I supposed to add integer data or floating point measures (e.g., level of happiness of a speaker) to my neural network?

Thank you!

Topic		Replies	Views
Merge custom dataset with dataset on Huggingface : problem with features Beginners	0	180	April 20, 2024
How to train a combination model Models	0	19	August 23, 2024
Concatenate non string features to a BERT transformers model Beginners	5	2824	March 27, 2022
Need guidance in selecting the model and the required approach Beginners	0	243	December 21, 2022
Adding additional features to BERT model Models	0	1045	July 18, 2022

What is the proper way of handling multiple features in Huggingface?

Related topics