One model that does both translation and classification

just like human brain is able to do both translation and classification, how to create one model, that accomplishes both of these tasks?
so I give sentence in a language as input, it returns sentence in another language, if I give image as input, it returns label, it is able to do both.