Hello @DungNguyen
It appears as if I’m facing the exact same problem as you, implementing multiple A2C agents in a RL gym env. I have encountered the same errors and I suspect your solution might be helpful for my use case.
Can you explain what worker.data
represents conceptually and how you went about resetting it after every episode?
Thanks in advance