Efficiently applying per neuron activation functions

The function is going to be a modified mish with a random coefficient in the exponential.

The split concat version I was talking about was referencing this forum post: How to apply different activation fuctions to different neurons