HF Dataset as a Replay Buffer for RL applications

Blazej · February 20, 2023, 1:10pm

Hi,

I am interested in using HuggingFace models & datasets for a Reinforcement Learning use case. For my purpose I would need to implement a replay buffer.

I considered using HF Datasets due to (1) easily coupling with HF models and (2) efficiency stemming from zero-copy reads by memory mapping the whole dataset. However, I do not see any functionality for (efficiently) augmenting the dataset. Is this functionality there?

Additionally I need the other replay buffer functionality: sampling based on priorities, unloading the buffer etc.

Do you think I should customize HF Datasets for my use case or I better couple some other replay buffer (e.g. rllib, stable baselines) with HF Models?

Thanks in advance. cc the HF RL team: @ThomasSimonini @edbeeching @natolambert @lvwerra

natolambert · February 21, 2023, 11:27pm

Hi Blazej – I agree. Are there any structural blockers to using datasets for this? I guess the challenge is how to do something like FIFO/LIFO nature of a replay buffer. I wonder if it’s interesting to just keep all of the data and have a wrapper that keeps the N most recent.

What do you mean by augmenting?

I think there are some discussions around this with another collaborator, let me follow up internally on this too.

Blazej · February 22, 2023, 8:47am

Hi Nathan, thanks for the response.
Indeed FIFO/LIFO sampling and removing functionality is something that I need. Additionally sampling proportional to item’s priority is desired. Would something like this be possible with datasets while retaining the efficiency of datasets?

By augmenting the dataset I mean adding new items to the buffer (in an efficient manner).
Having such a functionality would definitely push HF forward as a place for RL experiments.

natolambert · February 22, 2023, 6:38pm

Yeah, that makes sense. I’ve shared it with the RL & dataset teams.

Blazej · February 23, 2023, 8:23am

Thanks. Looking forward to hearing back from you!

Blazej · March 1, 2023, 8:40am

Hi Nathan! Any updates on the matter from the RL or Dataset team?

natolambert · March 9, 2023, 10:17pm

Not really, sadly. I’ve been mostly working on non-RL things, but there’s note of this. Hopefully more can get built on it soon.

Topic		Replies	Views
How do we insert our own datasets in DPR / RAG retrieval Q&A models? 🤗Transformers	1	1647	October 11, 2020
Convert HF Dataset to tfds Beginners	0	396	April 29, 2021
Save Data from Streamlit Session to Persist Changes to HF Datasets 🤗Datasets	10	2336	October 5, 2022
Use load dataset to load a sample of the dataset 🤗Datasets	3	1276	May 24, 2021
How to perform unbatch operation with huggingface datasets 🤗Datasets	1	696	August 16, 2021

HF Dataset as a Replay Buffer for RL applications

Related topics