Small gradient problems

Solved: Using softmax with a large number of layers was leading to overflow. Added a batch normalisation layer before the softmax.

1 Like