How to fine-tuning network for face recognition

The question is how to fine tuning a network for face recognition.
We with triple-loss as metric. It’s somewhat different from other vision task?
I found some torch model here http://www.robots.ox.ac.uk/~albanie/pytorch-models.html