Are alpha and beta in torchrl's prioritized experience replay schedulable?

MadChuck · September 20, 2023, 9:05pm

Hi,

I’ve been thinking for a while to switch from openai baselines PER implementation to torchrl.data.PrioritizedReplayBuffer, since it should be faster, considering current implementation needs to load batch to torch format anyway.

In the original paper, the parameters alpha and beta were changing over time. I also am aware that in some cases, fixed alpha and beta may be better (as reported in dopamine/rainbow).

Since I don’t see an official way of scheduling alpha and beta in documentation, I’m curious whether that was done purposefully for simplicity with believe that constant alpha and beta are better or maybe it has to do with something else or is actually possible?

vmoens · September 24, 2023, 6:45am

This is not currently supported but I’d love to give it a look! Can you post a feature request on the GitHub repo?

MadChuck · September 25, 2023, 11:12am

Sure, thanks for the interest
https://github.com/pytorch/rl/issues/1575