Segmentation Model failing to learn even on single overtrained example

So one thing to keep in mind is that you’re doing 250.000 (one per per-pixel) classifications here. What is the distribution of the classes in your examples? In my experience, imbalance between your 13 classes can be a problem with that. I sometimes mention a section in our book that describes this.

Best regards

Thomas