How does the transformer output one word?

Why not stay in one discussion?