Latest reinforcement-learning topics

Topic	Replies	Views	Activity
About the reinforcement-learning category	7	4514	October 18, 2023
Bridging the Gap: Using PyTorch for Intelligent Agent Design in Game Environments	0	5	March 2, 2026
How to instantiate double LSTM + MLP for actor/critic in PPO	0	43	December 20, 2025
Performance Differences in TD3 Training When Switching from NumPy 1.26.0 to NumPy 2.2.6	0	39	December 9, 2025
Several implementations of TruncatedNormal?	0	23	December 1, 2025
PPO learning poorly on LunarLander-v3	2	233	November 18, 2025
Apparent RAM memory leak when converting batch of ndarray states to GPU tensor	0	51	October 29, 2025
Training Machine Learning Model In Browser For Reinforcement Learning	1	151	October 27, 2025
Model Boilerplate for a Simple DQN	3	123	October 22, 2025
Implementation of Hierarchical Actor Critic with PPolicy-on Policy-off Policy Optimization for primitive actions	0	74	October 15, 2025
PyTorch Compatibility with Older CUDA Versions	1	60	September 20, 2025
Agent Masking in Multi-agent environment?	0	48	September 7, 2025
Batching a multicategorical spec	4	130	August 27, 2025
How to pass options to env.reset within a data collector	0	48	August 26, 2025
Environments from scratch with Torchrl	17	1439	August 25, 2025
How to manage done in a batched custom Env?	3	79	August 25, 2025
ClipPPOLoss problem with MaskedCategorical dist	2	61	August 21, 2025
CosTrader Env from scratch... and transform problem	3	66	August 15, 2025
PPO with Categorical Action... help	10	195	August 14, 2025
Question about TorchRL ParallelEnv error on single-gpu device	3	105	August 5, 2025
Help understanding data collectors	1	88	August 4, 2025
Should we split the trajectories prior to calculating the loss for a DQN?	1	49	August 4, 2025
Question About If PPO Training Will Work	1	108	July 29, 2025
RTX 5090 interconnection with pytorch	6	265	July 28, 2025
Model almost instantly produces "nan"	4	258	July 19, 2025
What loss function should the inner loop of MAML use?	2	120	June 27, 2025
TruncatedNormal loc argument	3	99	June 19, 2025
Using buffers in ParallelEnvs / MultiSyncCollectors	2	158	June 16, 2025
Multi-agent RL with different agent action spaces	0	88	June 12, 2025
Policy Gradient For Pong Not Learning	0	64	May 28, 2025