Object tracking inquiry

Hi, I am somewhat new to the community. Does Object tracking need their own labeling? or does bbox or object detection is enough? Also, does object tracking need kalman filter, can it just rely on Siamese network to cluster ID together?
Thank you.