Latest reinforcement-learning topics

Topic	Replies	Views	Activity
Custom Vectorized environment for torchrl	3	205	April 3, 2025
Gymnasium FrozenLake - why one-hot encoding for state is required?	0	84	March 27, 2025
Defining a ProbalisticActor with two normal distributions	17	280	March 13, 2025
Feature Request: Add a `torch.range_map` operator for easy value range mapping	1	94	March 3, 2025
TorchRL cpu-only installation	4	347	February 28, 2025
A3C problem with PyTorch versiona >=2.0.0	16	1320	February 14, 2025
Best practice for integrared vs separate optimizers in actor-critic models	2	165	February 7, 2025
Looking for Up-to-Date MAML meta-RL example	0	136	February 3, 2025
Question about gradient calculation in backward() of actor network of DDPG	0	112	January 27, 2025
Training converges on cpu but never on gpu	6	479	January 20, 2025
"input types can't be cast to the desired output type Long"	1	95	January 20, 2025
I have some problems with algorithm realization	0	78	January 18, 2025
MultiDiscrete Observation Causes Shape Mismatch	1	176	January 17, 2025
RuntimeError: Gradient computation modified by an inplace operation	1	259	January 17, 2025
How should the "next" field in TorchRL's tensordict be?	1	92	January 17, 2025
Is this PPO training code wrong?	1	243	January 17, 2025
Multidmensional Actions	4	1687	January 8, 2025
MAML Implementation failing to adapt to new tasks	0	240	December 17, 2024
Correct way of using foreach_worker and foreach_env	0	150	December 10, 2024
Reward averages to 0 instead of increasing	0	165	December 5, 2024
TorchRL Probabilistic Actor Returning Continuous Actions Despite Given Bounded Discrete Action Spec	1	98	December 5, 2024
Soft Actor-Critic Algorithm does not manage to solve slightly more complicated tasks	0	139	December 2, 2024
I am trying to implement a Hierarchical Actor Critic model but my neural network doesn't return value with the correct dimension	2	67	November 18, 2024
Hello , Please anyone can help me about my code , I try to implemnt mean field game theory to find the optimal charging . The code is working but the loss of actor neural netwrok does not learn quickly. Any thought any help I will appreciate alot	0	85	November 13, 2024
Need advice for solving partially observed maze environment	0	49	October 23, 2024
SAC doesn't converge in gym Mountain Car environment	1	335	October 22, 2024
Issue with training policy networks using PPO	3	243	October 22, 2024
Significiant time difference between minor model architecutre change	1	72	October 15, 2024
Gymnasium Single Frame Render with TorchRL	1	121	October 15, 2024
OpenXExperienceReplay fails	1	167	October 15, 2024