Distance measurement between two activation matrixes

In the case of knowledge transfer, given two activation matrixes between two models of identical dimension, a teacher and a student, Beside L1 and L2 distances, what other distance measurements can I use to best measure the similarity between the two matrixes?