In this thread a cross entropy loss was implemented using continuous (or soft) targets. Would that work for you?