Keep tracking consistent object IDs for the duration of the video.
Is that car signaling for a turn? Is that person sitting down or standing up? These questions can be easily answered in videos.
Annotating on videos reduces errors caused by annotating sequential frames as independent images.
Annotating on video produces 15-30x more training data than annotating on images.