Latest reinforcement-learning topics

Topic	Replies	Views	Activity
Issue with handling invalid moves in reinforcement learning	2	1219	March 25, 2019
Error: void THCudaTensor_gatherKernel() failed	1	1768	March 22, 2019
Trying to understand why output of nn.Linear (for output layer) isn't retaining	0	1109	March 18, 2019
Multiple cpu producers with few gpus not utilize 100% of the gpus	0	417	March 8, 2019
Doesn't official REINFORCE example work?	3	2427	March 3, 2019
Examples for asynchronous RL (IMPALA, Ape-X) with actors sending observations (not gradients) to a learner's replay buffer	0	1814	February 26, 2019
How can I implement an environment run purely on GPU?	11	1031	March 2, 2019
What is the justification for normalizing each episode's reward targets in the policy gradient examples?	1	1841	March 2, 2019
Diagnosing slow backward pass with RL gradient over minibatch	3	1144	February 26, 2019
Unrolling nn.LSTM	6	2731	February 26, 2019
Reinforcement_q_learning.py	4	836	February 19, 2019
Multiprocessing with cuda model	0	995	February 18, 2019
Copying part of the weights	5	11150	February 17, 2019
Backprob policy	0	405	February 11, 2019
Manager.queue get() error	0	478	February 11, 2019
Can you help me adapt the actor-critic example for multi-gpu?	4	937	February 9, 2019
Best practices for exploration/exploitation in Reinforcement Learning	0	657	February 4, 2019
Expected 4-dimensional weight for 4-dimensional input	3	10612	February 1, 2019
Can I backpropagate different distributions at once using Policy Gradient?	1	564	January 31, 2019
Parallel online policy gradient on Module level with torch.multiprocessing	1	658	January 31, 2019
Bad inference performance on some CPUs	2	1612	January 25, 2019
Why is memory being allocated on GPU?	1	564	January 24, 2019
Multi agent deep reinforcement learning to an environment with discrete action space	6	1662	January 22, 2019
Torch.multiprocessing possible alternative to barrier	0	652	January 19, 2019
Unreasonable performances of a simple linear policy	3	946	January 17, 2019
Understanding Enropy	2	1767	January 16, 2019
Torch/RL newbie: Trying to do PPO	0	1583	January 14, 2019
Several questions regarding my implementation of PPO on Pytorch	2	4419	September 19, 2018
Memory leak during backprop in Reinforcement Learning tutorial?	0	871	January 6, 2019
Pytorch categorical distribution, probably a bug?	3	1989	December 30, 2018