| 
            
            
              About the reinforcement-learning category
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
              
                 
              
              
                 
              
           | 
          
            7
           | 
          
            4446
           | 
          
            October 18, 2023
           | 
        
        
          | 
            
            
              Apparent RAM memory leak when converting batch of ndarray states to GPU tensor
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            17
           | 
          
            October 29, 2025
           | 
        
        
          | 
            
            
              Training Machine Learning Model In Browser For Reinforcement Learning
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            1
           | 
          
            97
           | 
          
            October 27, 2025
           | 
        
        
          | 
            
            
              Model Boilerplate for a Simple DQN
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
           | 
          
            3
           | 
          
            38
           | 
          
            October 22, 2025
           | 
        
        
          | 
            
            
              Implementation of Hierarchical Actor Critic with PPolicy-on Policy-off Policy Optimization for primitive actions
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            19
           | 
          
            October 15, 2025
           | 
        
        
          | 
            
            
              PyTorch Compatibility with Older CUDA Versions
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            1
           | 
          
            35
           | 
          
            September 20, 2025
           | 
        
        
          | 
            
            
              Agent Masking in Multi-agent environment?
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            22
           | 
          
            September 7, 2025
           | 
        
        
          | 
            
            
              Batching a multicategorical spec
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            4
           | 
          
            84
           | 
          
            August 27, 2025
           | 
        
        
          | 
            
            
              How to pass options to env.reset within a data collector
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            23
           | 
          
            August 26, 2025
           | 
        
        
          | 
            
            
              Environments from scratch with Torchrl
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
              
                 
              
           | 
          
            17
           | 
          
            1337
           | 
          
            August 25, 2025
           | 
        
        
          | 
            
            
              How to manage done in a batched custom Env?
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            3
           | 
          
            53
           | 
          
            August 25, 2025
           | 
        
        
          | 
            
            
              ClipPPOLoss problem with MaskedCategorical dist
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            2
           | 
          
            39
           | 
          
            August 21, 2025
           | 
        
        
          | 
            
            
              CosTrader Env from scratch... and transform problem
            
            
           | 
          
              
                 
              
           | 
          
            3
           | 
          
            34
           | 
          
            August 15, 2025
           | 
        
        
          | 
            
            
              PPO with Categorical Action... help
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
           | 
          
            10
           | 
          
            96
           | 
          
            August 14, 2025
           | 
        
        
          | 
            
            
              Question about TorchRL ParallelEnv error on single-gpu device
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            3
           | 
          
            53
           | 
          
            August 5, 2025
           | 
        
        
          | 
            
            
              Help understanding data collectors
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            1
           | 
          
            49
           | 
          
            August 4, 2025
           | 
        
        
          | 
            
            
              Should we split the trajectories prior to calculating the loss for a DQN?
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            1
           | 
          
            31
           | 
          
            August 4, 2025
           | 
        
        
          | 
            
            
              Question About If PPO Training Will Work
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            1
           | 
          
            63
           | 
          
            July 29, 2025
           | 
        
        
          | 
            
            
              RTX 5090 interconnection with pytorch
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            6
           | 
          
            171
           | 
          
            July 28, 2025
           | 
        
        
          | 
            
            
              Model almost instantly produces "nan"
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            4
           | 
          
            145
           | 
          
            July 19, 2025
           | 
        
        
          | 
            
            
              What loss function should the inner loop of MAML use?
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
           | 
          
            2
           | 
          
            86
           | 
          
            June 27, 2025
           | 
        
        
          | 
            
            
              TruncatedNormal loc argument
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
              
                 
              
           | 
          
            3
           | 
          
            50
           | 
          
            June 19, 2025
           | 
        
        
          | 
            
            
              Using buffers in ParallelEnvs / MultiSyncCollectors
            
            
           | 
          
              
                 
              
           | 
          
            2
           | 
          
            128
           | 
          
            June 16, 2025
           | 
        
        
          | 
            
            
              Multi-agent RL with different agent action spaces
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            52
           | 
          
            June 12, 2025
           | 
        
        
          | 
            
            
              Policy Gradient For Pong Not Learning
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            29
           | 
          
            May 28, 2025
           | 
        
        
          | 
            
            
              Torchrl kl_div for old and new policy
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            41
           | 
          
            April 7, 2025
           | 
        
        
          | 
            
            
              Custom Vectorized environment for torchrl
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            3
           | 
          
            122
           | 
          
            April 3, 2025
           | 
        
        
          | 
            
            
              Gymnasium FrozenLake - why one-hot encoding for state is required?
            
            
           | 
          
              
                 
              
           | 
          
            0
           | 
          
            63
           | 
          
            March 27, 2025
           | 
        
        
          | 
            
            
              Defining a ProbalisticActor with two normal distributions
            
            
           | 
          
              
                 
              
              
                 
              
              
                 
              
           | 
          
            17
           | 
          
            171
           | 
          
            March 13, 2025
           | 
        
        
          | 
            
            
              Feature Request: Add a `torch.range_map` operator for easy value range mapping
            
            
           | 
          
              
                 
              
              
                 
              
           | 
          
            1
           | 
          
            55
           | 
          
            March 3, 2025
           |