Tool for policy search

I did this a month ago and I didn’t push the repo as I thought it was not that useful.

But if someone wants to explore deeper architectures than a simple linear model and run ars-based policy search (https://arxiv.org/abs/1803.07055), or even try Open-AI evolution strategies (https://arxiv.org/abs/1703.03864), this “optimizer” may save a bunch of lines.