[resolved] Actor Critic with a large amount of possible actions

@mjacar So instead of choosing an action from the distribution, I would find the covariance matrix of the distribution and use that?