I tried some more things:
- disabled eval mode
- removed a non standard regularization step
- replaced the non standard activation function with an out of the box ELU
In all cases I see the same consistent memory leak of ~108k pages of memory per epoch.
Any additional ideas on how to analyze this would be welcome.
Alberto